2024 ECCV ECCV 2024

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

The Questioner