2024 ECCV ECCV 2024

Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples