2025 ICML ICML 2025

Reasoning Limitations of Multimodal Large Language Models. A case study of Bongard Problems