SpatialLM1.1-Qwen-0.5B-ARKitScenes-SFT

This is a finetuned version of SpatialLM1.1-Qwen-0.5B on ARKitScenes, using a new set of object categories and enable random, gravity-aligned scene orientations. During inference, only the z-axis needs to be kept as the up axis; the x and y axes do not need to be aligned.

Results

Here are some example results comparing ground truth (GT) oriented object bounding boxes with predictions (Pred) from the finetuned SpatialLM1.1-Qwen-0.5B model on the ARKitScenes dataset.

GT Pred
42446137_gt 42446137_pred
47430470_gt 47430470_pred
47334109_gt 47334109_pred
Downloads last month
23
Safetensors
Model size
0.6B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ysmao/SpatialLM1.1-Qwen-0.5B-ARKitScenes-SFT

Base model

Qwen/Qwen2.5-0.5B
Finetuned
(3)
this model

Dataset used to train ysmao/SpatialLM1.1-Qwen-0.5B-ARKitScenes-SFT

Collection including ysmao/SpatialLM1.1-Qwen-0.5B-ARKitScenes-SFT