Great point! Thatβs actually a pretty active area of research as well, enabling models to adopt flexible perspective-taking depending on the task. It would be exciting to see a convergence between the advancements in LLMs and VLMs for spatial reasoning.