2024 ECCV ECCV 2024

Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality