Papers
8,761 papers found
1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis
Sewade Ogun, Abraham T. Owodunni, Tobi Olatunji et al.
2.5D Vocal Tract Modeling: Bridging Low-Dimensional Efficiency with 3D Accuracy
Debasish Ray Mohapatra, Victor Zappi, Sidney Fels
Acceleration of Posteriorgram-based DTW by Distilling the Class-to-class Distances Encoded in the Classifier Used to Calculate Posteriors
Haitong Sun, Jaehyun Choi, Nobuaki Minematsu et al.
Accent Conversion with Articulatory Representations
Yashish M. Siriwardena, Nathan Swedlow, Audrey Howard et al.
A ChatGPT-based oral Q&A practice system for first-time student participants in international conferences
Mayuko Aiba, Daisuke Saito, Nobuaki Minematsu
A Cluster-based Personalized Federated Learning Strategy for End-to-End ASR of Dementia Patients
Wei-Tung Hsu, Chin-Po Chen, Yun-Shao Lin et al.
A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives
Jan Lehečka, Josef V. Psutka, Lubos Smidl et al.
A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection
Stefan Kalabakov, Monica Gonzalez-Machorro, Florian Eyben et al.
A comparative analysis of sequential models that integrate syllable dependency for automatic syllable stress detection
Jhansi Mallela, Sai Harshitha Aluru, Chiranjeevi Yarra
A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production
Chetan Sharma, Vaishnavi Chandwanshi, Prasanta Kumar Ghosh
A comparison of voice similarity through acoustics, human perception and deep neural network (DNN) speaker verification systems
Suyuan Liu, Molly Babel, Jian Zhu
A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Zhenyu Zhou, Shibiao Xu, Shi Yin et al.
A Contrastive Learning Approach to Mitigate Bias in Speech Models
Alkis Koudounas, Flavio Giobergia, Eliana Pastor et al.
Acoustical analysis of the initial phones in speech-laugh
Ryo Setoguchi, Yoshiko Arimoto
Acoustic changes in speech prosody produced by children with autism after robot-assisted speech training
Si Chen, Bruce Xiao Wang, Yitian Hong et al.
Acoustic Effects of Facial Feminisation Surgery on Speech and Singing: A Case Study
Cliodhna Hughes, Guy Brown, Ning Ma et al.
Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment
Heejin Do, Wonjun Lee, Gary Geunbae Lee
Acquisition of high vowel devoicing in Japanese: A production experiment with three and four year olds
Hyun Kyung Hwang, Manami Hirayama
A Cross-Attention Layer coupled with Multimodal Fusion Methods for Recognizing Depression from Spontaneous Speech
Loukas Ilias, Dimitris Askounis
Active Speaker Detection in Fisheye Meeting Scenes with Scene Spatial Spectrums
Xinghao Huang, Weiwei Jiang, Long Rao et al.
Adapter Learning from Pre-trained Model for Robust Spoof Speech Detection
Haochen Wu, Wu Guo, Shengyu Peng et al.
Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models
Sathvik Udupa, Jesuraj Bandekar, Saurabh Kumar et al.
AdaRA: Adaptive Rank Allocation of Residual Adapters for Speech Foundation Model
Zhouyuan Huo, Dongseong Hwang, Gan Song et al.
A data-driven model of acoustic speech intelligibility for optimization-based models of speech production
Benjamin Elie, Juraj Simko, Alice Turk