Research Explorer

1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis

Sewade Ogun, Abraham T. Owodunni, Tobi Olatunji et al.

2024 INTERSPEECH

2.5D Vocal Tract Modeling: Bridging Low-Dimensional Efficiency with 3D Accuracy

Debasish Ray Mohapatra, Victor Zappi, Sidney Fels

2024 INTERSPEECH

2DP-2MRC: 2-Dimensional Pointer-based Machine Reading Comprehension Method for Multimodal Moment Retrieval

Jiajun He, Tomoki Toda

2024 INTERSPEECH

Acceleration of Posteriorgram-based DTW by Distilling the Class-to-class Distances Encoded in the Classifier Used to Calculate Posteriors

Haitong Sun, Jaehyun Choi, Nobuaki Minematsu et al.

2024 INTERSPEECH

Accent Conversion with Articulatory Representations

Yashish M. Siriwardena, Nathan Swedlow, Audrey Howard et al.

2024 INTERSPEECH

A ChatGPT-based oral Q&A practice system for first-time student participants in international conferences

Mayuko Aiba, Daisuke Saito, Nobuaki Minematsu

2024 INTERSPEECH

A Cluster-based Personalized Federated Learning Strategy for End-to-End ASR of Dementia Patients

Wei-Tung Hsu, Chin-Po Chen, Yun-Shao Lin et al.

2024 INTERSPEECH

A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives

Jan Lehečka, Josef V. Psutka, Lubos Smidl et al.

2024 INTERSPEECH

A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection

Stefan Kalabakov, Monica Gonzalez-Machorro, Florian Eyben et al.

2024 INTERSPEECH

A comparative analysis of sequential models that integrate syllable dependency for automatic syllable stress detection

Jhansi Mallela, Sai Harshitha Aluru, Chiranjeevi Yarra

2024 INTERSPEECH

A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production

Chetan Sharma, Vaishnavi Chandwanshi, Prasanta Kumar Ghosh

2024 INTERSPEECH

A comparison of voice similarity through acoustics, human perception and deep neural network (DNN) speaker verification systems

Suyuan Liu, Molly Babel, Jian Zhu

2024 INTERSPEECH

A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition

Zhenyu Zhou, Shibiao Xu, Shi Yin et al.

2024 INTERSPEECH

A Contrastive Learning Approach to Mitigate Bias in Speech Models

Alkis Koudounas, Flavio Giobergia, Eliana Pastor et al.

2024 INTERSPEECH

Acoustical analysis of the initial phones in speech-laugh

Ryo Setoguchi, Yoshiko Arimoto

2024 INTERSPEECH

Acoustic changes in speech prosody produced by children with autism after robot-assisted speech training

Si Chen, Bruce Xiao Wang, Yitian Hong et al.

2024 INTERSPEECH

Acoustic Effects of Facial Feminisation Surgery on Speech and Singing: A Case Study

Cliodhna Hughes, Guy Brown, Ning Ma et al.

2024 INTERSPEECH

Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment

Heejin Do, Wonjun Lee, Gary Geunbae Lee

2024 INTERSPEECH

Acquisition of high vowel devoicing in Japanese: A production experiment with three and four year olds

Hyun Kyung Hwang, Manami Hirayama

2024 INTERSPEECH

A Cross-Attention Layer coupled with Multimodal Fusion Methods for Recognizing Depression from Spontaneous Speech

Loukas Ilias, Dimitris Askounis

2024 INTERSPEECH

Active Speaker Detection in Fisheye Meeting Scenes with Scene Spatial Spectrums

Xinghao Huang, Weiwei Jiang, Long Rao et al.

2024 INTERSPEECH

Adapter Learning from Pre-trained Model for Robust Spoof Speech Detection

Haochen Wu, Wu Guo, Shengyu Peng et al.

2024 INTERSPEECH

Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models

Sathvik Udupa, Jesuraj Bandekar, Saurabh Kumar et al.

2024 INTERSPEECH

AdaRA: Adaptive Rank Allocation of Residual Adapters for Speech Foundation Model

Zhouyuan Huo, Dongseong Hwang, Gan Song et al.

2024 INTERSPEECH

A data-driven model of acoustic speech intelligibility for optimization-based models of speech production

Benjamin Elie, Juraj Simko, Alice Turk

2024 INTERSPEECH

Papers