2024
ICML
ICML 2024
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
Authors
Guangzhi Sun
,
Wenyi Yu
,
Changli Tang
,
Xianzhao Chen
,
Tian Tan
,
Wei Li
,
Lu Lu
,
Zejun Ma
,
Yuxuan Wang
,
Chao Zhang