2024 ECCV ECCV 2024

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding