2024 ECCV ECCV 2024

Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data