MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images


This is a companion discussion topic for the original entry at https://arxiv.org/abs/2511.19119