Research Explorer

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Haibin Wu, Yuan Tseng, Hung-yi Lee

2024 INTERSPEECH

Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis

David Ortiz-Perez, Jose Garcia-Rodriguez, David Tomás

2024 INTERSPEECH

CogniVoice: Multimodal and Multilingual Fusion Networks for Mild Cognitive Impairment Assessment from Spontaneous Speech

Jiali Cheng, Mohamed Elgaar, Nidhi Vakil et al.

2024 INTERSPEECH

Collaborative Contrastive Learning for Hypothesis Domain Adaptation

Jen-Tzung Chien, I-Ping Yeh, Man-Wai Mak

2024 INTERSPEECH

Collecting Mandible Movement in Brazilian Portuguese

Donna Erickson, Albert Rilliard, Malin Svensson Lundmark et al.

2024 INTERSPEECH

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction

Xueyuan Chen, Dongchao Yang, Dingdong Wang et al.

2024 INTERSPEECH

Combining Acoustic Feature Sets for Detecting Mild Cognitive Impairment in the Interspeech'24 TAUKADIAL Challenge

Gábor Gosztolya, László Tóth

2024 INTERSPEECH

ComFeAT: combination of neural and spectral features for improved depression detection

Orchid Chetia Phukan, Sarthak Jain, Shubham Singh et al.

2024 INTERSPEECH

Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness

Sai Srujana Buddi, Satyam Kumar, Utkarsh Sarawgi et al.

2024 INTERSPEECH

Comparing ambulatory voice measures during daily life with brief laboratory assessments in speakers with and without vocal hyperfunction

Daryush D. Mehta, Jarrad H. Van Stan, Hamzeh Ghasemzadeh et al.

2024 INTERSPEECH

Comparing ASR Systems in the Context of Speech Disfluencies

Maria Teleki, Xiangjue Dong, Soohwan Kim et al.

2024 INTERSPEECH

Comparing Discrete and Continuous Space LLMs for Speech Recognition

Yaoxun Xu, Shi-Xiong Zhang, Jianwei Yu et al.

2024 INTERSPEECH

Complex Image-Generative Diffusion Transformer for Audio Denoising

Junhui Li, Pu Wang, Jialu Li et al.

2024 INTERSPEECH

Confidence-aware Hypothesis Transfer Networks for Source-Free Cross-Corpus Speech Emotion Recognition

Jincen Wang, Yan Zhao, Cheng Lu et al.

2024 INTERSPEECH

Confidence Estimation for Automatic Detection of Depression and Alzheimer’s Disease Based on Clinical Interviews

Wen Wu, Chao Zhang, Philip C. Woodland

2024 INTERSPEECH

Conformer without Convolutions

Matthijs Van keirsbilck, Alexander Keller

2024 INTERSPEECH

Connected Speech-Based Cognitive Assessment in Chinese and English

Saturnino Luz, Sofia De La Fuente Garcia, Fasih Haider et al.

2024 INTERSPEECH

ConnecTone: a modular AAC system prototype with contextual generative text prediction and style-adaptive conversational TTS

Juliana Francis, Éva Székely, Joakim Gustafson

2024 INTERSPEECH

ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

Yatong Bai, Trung Dang, Dung Tran et al.

2024 INTERSPEECH

Contemplative Mechanism for Speech Recognition: Speech Encoders can Think

Tien-Ju Yang, Andrew Rosenberg, Bhuvana Ramabhadran

2024 INTERSPEECH

Context-Aware Speech Recognition Using Prompts for Language Learners

Jian Cheng

2024 INTERSPEECH

Contextual Biasing Speech Recognition in Speech-enhanced Large Language Model

Xun Gong, Anqi Lv, Zhiming Wang et al.

2024 INTERSPEECH

Contextual Biasing with Confidence-based Homophone Detector for Mandarin End-to-End Speech Recognition

Chengxu Yang, Lin Zheng, Sanli Tian et al.

2024 INTERSPEECH

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

Weiran Wang, Zelin Wu, Diamantino Caseiro et al.

2024 INTERSPEECH

Contextual Interactive Evaluation of TTS Models in Dialogue Systems

Siyang Wang, Éva Székely, Joakim Gustafson

2024 INTERSPEECH

Papers