2024 ECCV ECCV 2024

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling