MetricGround: Bridging the Dimensionality Gap for Metric-Aware Embodied Vision-Language Models - Official implementation of the paper
embodied-ai vision-language-models spatial-question-answering metric-grounding dimensionality-gap cross-modal-distillation 3d-euclidean-reasoning occlusion-ambiguity
-
Updated
Jan 16, 2026 - Python