Papers
8,761 papers found
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Haibin Wu, Yuan Tseng, Hung-yi Lee
Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis
David Ortiz-Perez, Jose Garcia-Rodriguez, David Tomás
CogniVoice: Multimodal and Multilingual Fusion Networks for Mild Cognitive Impairment Assessment from Spontaneous Speech
Jiali Cheng, Mohamed Elgaar, Nidhi Vakil et al.
Collaborative Contrastive Learning for Hypothesis Domain Adaptation
Jen-Tzung Chien, I-Ping Yeh, Man-Wai Mak
Collecting Mandible Movement in Brazilian Portuguese
Donna Erickson, Albert Rilliard, Malin Svensson Lundmark et al.
CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
Xueyuan Chen, Dongchao Yang, Dingdong Wang et al.
Combining Acoustic Feature Sets for Detecting Mild Cognitive Impairment in the Interspeech'24 TAUKADIAL Challenge
Gábor Gosztolya, László Tóth
ComFeAT: combination of neural and spectral features for improved depression detection
Orchid Chetia Phukan, Sarthak Jain, Shubham Singh et al.
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
Sai Srujana Buddi, Satyam Kumar, Utkarsh Sarawgi et al.
Comparing ambulatory voice measures during daily life with brief laboratory assessments in speakers with and without vocal hyperfunction
Daryush D. Mehta, Jarrad H. Van Stan, Hamzeh Ghasemzadeh et al.
Comparing ASR Systems in the Context of Speech Disfluencies
Maria Teleki, Xiangjue Dong, Soohwan Kim et al.
Comparing Discrete and Continuous Space LLMs for Speech Recognition
Yaoxun Xu, Shi-Xiong Zhang, Jianwei Yu et al.
Complex Image-Generative Diffusion Transformer for Audio Denoising
Junhui Li, Pu Wang, Jialu Li et al.
Confidence-aware Hypothesis Transfer Networks for Source-Free Cross-Corpus Speech Emotion Recognition
Jincen Wang, Yan Zhao, Cheng Lu et al.
Confidence Estimation for Automatic Detection of Depression and Alzheimer’s Disease Based on Clinical Interviews
Wen Wu, Chao Zhang, Philip C. Woodland
Conformer without Convolutions
Matthijs Van keirsbilck, Alexander Keller
Connected Speech-Based Cognitive Assessment in Chinese and English
Saturnino Luz, Sofia De La Fuente Garcia, Fasih Haider et al.
ConnecTone: a modular AAC system prototype with contextual generative text prediction and style-adaptive conversational TTS
Juliana Francis, Éva Székely, Joakim Gustafson
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Yatong Bai, Trung Dang, Dung Tran et al.
Contemplative Mechanism for Speech Recognition: Speech Encoders can Think
Tien-Ju Yang, Andrew Rosenberg, Bhuvana Ramabhadran
Contextual Biasing Speech Recognition in Speech-enhanced Large Language Model
Xun Gong, Anqi Lv, Zhiming Wang et al.
Contextual Biasing with Confidence-based Homophone Detector for Mandarin End-to-End Speech Recognition
Chengxu Yang, Lin Zheng, Sanli Tian et al.
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Weiran Wang, Zelin Wu, Diamantino Caseiro et al.
Contextual Interactive Evaluation of TTS Models in Dialogue Systems
Siyang Wang, Éva Székely, Joakim Gustafson