Papers

8,761 papers found
2024 INTERSPEECH
AVR: synergizing foundation models for audio-visual humor detection
Sarthak Sharma, Orchid Chetia Phukan, Drishti Singh et al.
2024 INTERSPEECH
Backchannel prediction, based on who, when and what
Yo-Han Park, Wencke Liermann, Yong-Seok Choi et al.
2024 INTERSPEECH
Beam-search SIEVE for low-memory speech recognition
Martino Ciaperoni, Athanasios Katsamanis, Aristides Gionis et al.
2024 INTERSPEECH
2024 INTERSPEECH
BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis
Jan Pešán, Vojtěch Juřík, Martin Karafiát et al.
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
Binaural Selective Attention Model for Target Speaker Extraction
Hanyu Meng, Qiquan Zhang, Xiangyu Zhang et al.
2024 INTERSPEECH
Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification
Muhammad Umer Sheikh, Hassan Abid, Bhuiyan Sanjid Shafique et al.
2024 INTERSPEECH
Boosting CTC-based ASR using inter-layer attention-based CTC loss
Keigo Hojo, Yukoh Wakabayashi, Kengo Ohta et al.
2024 INTERSPEECH