Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases

Kai Chen; Yanze Li; Wenhua Zhang; Yanxin Liu; Pengxiang Li; Ruiyuan Gao; Lanqing Hong; Meng Tian; Xinhai Zhao; Zhenguo Li; Dit-Yan Yeung; Huchuan Lu; Xu Jia

2025 WACV WACV 2025

Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases

Abstract

Large Vision-Language Models (LVLMs) have received widespread attentions for advancing the interpretable self-driving. Existing evaluations of LVLMs primarily focus on multi-faceted capabilities in natural circumstances lacking automated and quantifiable assessment for self-driving let alone the severe road corner cases. In this work we propose CODA-LM the very first benchmark for the automatic evaluation of LVLMs for self-driving corner cases. We adopt a hierarchical data structure and prompt powerful LVLMs to analyze complex driving scenes and generate high-quality pre-annotations for the human annotators while for LVLM evaluation we show that using the text-only large language models (LLMs) as judges reveals even better alignment with human preferences than the LVLM judges. Moreover with our CODA-LM we build CODA-VLM a new driving LVLM surpassing all open-sourced counterparts on CODA-LM. Our CODA-VLM performs comparably with GPT-4V even surpassing GPT-4V by +21.42% on the regional perception task. We hope CODA-LM can become the catalyst to promote interpretable self-driving empowered by LVLMs.

🧭 Keyword Pioneer — corner case

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kai Chen , Yanze Li , Wenhua Zhang , Yanxin Liu , Pengxiang Li , Ruiyuan Gao , Lanqing Hong , Meng Tian , Xinhai Zhao , Zhenguo Li , Dit-Yan Yeung , Huchuan Lu , Xu Jia

Topics

Artificial Intelligence > Core AI > Autonomous Vehicles

Keywords

benchmark evaluation large vision-language model corner case interpretable self-driving

Download PDF

Related papers

Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration 2025

ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting 2025

Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation 2025

Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach 2025

Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video 2025