2024 EMNLP EMNLP 2024

Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties