โจ Upload an image and ask questions about spatial relationships, locations, and orientations! โจ
๐ Paper | ๐ Github | ๐ค Spatial-SSRL-7B Model | ๐ค Spatial-SSRL-81k | ๐ฐ Daily Paper
When enabled, the predefined format prompt is automatically concatenated to your question.
Click on an example below to load it:
This demo showcases spatial reasoning capabilities of vision-language models. The model can:
If you find this project useful, please kindly cite: