2024 ICML ICML 2024

From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation