🌍 Spatial-SSRL: Spatial Reasoning with Vision-Language Models

✨ Upload an image and ask questions about spatial relationships, locations, and orientations! ✨

Input Image

Question

Apply format prompt (default on)

When enabled, the predefined format prompt is automatically concatenated to your question.

Answer

Generated Tokens

Click on an example below to load it:

Complete Examples

Input Image	Question	Apply format prompt (default on)

This demo showcases spatial reasoning capabilities of vision-language models. The model can:

If you find this project useful, please kindly cite:

BibTeX Citation