Papers
8,761 papers found
Analysis and Visualization of Directional Diversity in Listening Fluency of World Englishes Speakers in the Framework of Mutual Shadowing
Yu Tomita, Yingxiang Gao, Nobuaki Minematsu et al.
Analysis of articulatory setting for L1 and L2 English speakers using MRI data
Kevin Huang, Jack Goldberg, Louis Goldstein et al.
Analyzing Multimodal Features of Spontaneous Voice Assistant Commands for Mild Cognitive Impairment Detection
Nana Lin, Youxiang Zhu, Xiaohui Liang et al.
Analyzing Speech Motor Movement using Surface Electromyography in Minimally Verbal Adults with Autism Spectrum Disorder
Wazeer Zulfikar, Nishat Protyasha, Camila Canales et al.
An Analysis of the Variance of Diffusion-based Speech Enhancement
Bunlong Lay, Timo Gerkmann
An Attribute Interpolation Method in Speech Synthesis by Model Merging
Masato Murata, Koichi Miyazaki, Tomoki Koriyama
An Effective Local Prototypical Mapping Network for Speech Emotion Recognition
Yuxuan Xi, Yan Song, Lirong Dai et al.
An efficient text augmentation approach for contextualized Mandarin speech recognition
Naijun Zheng, Xucheng Wan, Kai Liu et al.
An End-to-End Approach for Chord-Conditioned Song Generation
Shuochen Gao, Shun Lei, Fan Zhuo et al.
An End-to-End Speech Summarization Using Large Language Model
Hengchao Shang, Zongyao Li, Jiaxin Guo et al.
A New Approach to Voice Authenticity
Nicolas M. Müller, Piotr Kawa, Shen Hu et al.
An Exploration of Length Generalization in Transformer-Based Speech Enhancement
Qiquan Zhang, Hongxu Zhu, Xinyuan Qian et al.
ANIMAL-CLEAN – A Deep Denoising Toolkit for Animal-Independent Signal Enhancement
Alexander Barnhill, Elmar Noeth, Andreas Maier et al.
An inclusive approach to creating a palette of synthetic voices for gender diversity
Eva Szekely, Maxwell Hope
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios
Cheng Gong, Erica Cooper, Xin Wang et al.
An Inter-Speaker Fairness-Aware Speech Emotion Regression Framework
Hsing-Hang Chou, Woan-Shiuan Chien, Ya-Tse Wu et al.
An Investigation of Group versus Individual Fairness in Perceptually Fair Speech Emotion Recognition
Woan-Shiuan Chien, Chi-Chun Lee
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker et al.
Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example
Suhita Ghosh, Melanie Jouaiti, Arnab Das et al.
AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection
Anbai Jiang, Bing Han, Zhiqiang Lv et al.
A novel experimental design for the study of listener-to-listener convergence in phoneme categorization
Qingye Shen, Leonardo Lancia, Noel Nguyen
Anti-spoofing Ensembling Model: Dynamic Weight Allocation in Ensemble Models for Improved Voice Biometrics Security
Eros Rosello, Angel M. Gomez, Iván López-Espejo et al.
An Uyghur Extension to the MASSIVE Multi-lingual Spoken Language Understanding Corpus with Comprehensive Evaluations
Ainikaerjiang Aimaiti, Di Wu, Liting Jiang et al.
A Parameter-efficient Language Extension Framework for Multilingual ASR
Wei Liu, Jingyong Hou, Dong Yang et al.