Papers
8,761 papers found
Attention-augmented X-vectors for the Evaluation of Mimicked Speech Using Sparse Autoencoder-LSTM framework
Bhasi K. C., Rajeev Rajan, Noumida A
Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection
Zihan Pan, Tianchi Liu, Hardik B. Sailor et al.
ATTEST: an analytics tool for the testing and evaluation of speech technologies
Dmitrii Obukhov, Marcel de Korte, Andrey Adaschik
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto et al.
Audio Editing with Non-Rigid Text Prompts
Francesco Paissan, Luca Della Libera, Zhepei Wang et al.
Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline
Shiran Aziz, Yossi Adi, Shmuel Peleg
Audio Fingerprinting with Holographic Reduced Representations
Yusuke Fujita, Tatsuya Komatsu
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations
Sarthak Yadav, Zheng-Hua Tan
Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation
Yifei Xin, Zhihong Zhu, Xuxin Cheng et al.
Auditory Attention Decoding in Four-Talker Environment with EEG
Yujie Yan, Xiran Xu, Haolin Zhu et al.
Auditory Spatial Attention Detection Based on Feature Disentanglement and Brain Connectivity-Informed Graph Neural Networks
Yixiang Niu, Ning Chen, Hongqing Zhu et al.
A Unified Approach to Multilingual Automatic Speech Recognition with Improved Language Identification for Indic Languages
Nikhil Jakhar, Sudhanshu Srivastava, Arun Baby
Automated content assessment and feedback for Finnish L2 learners in a picture description speaking task
Nhan Phan, Anna von Zansen, Maria Kautonen et al.
Automated Human-Readable Label Generation in Open Intent Discovery
Grant Anderson, Emma Hart, Dimitra Gkatzia et al.
Automatic Assessment of Dysarthria using Speech and synthetically generated Electroglottograph signal
Fathima Zaheera, Supritha Shetty, Gayadhar Pradhan et al.
Automatic Assessment of Speech Production Skills for Children with Cochlear Implants Using Wav2Vec2.0 Acoustic Embeddings
Seonwoo Lee, Sunhee Kim, Minhwa Chung
Automatic Children Speech Sound Disorder Detection with Age and Speaker Bias Mitigation
Gahye Kim, Yunjung Eom, Selina S. Sung et al.
Automatic Classification of News Subjects in Broadcast News: Application to a Gender Bias Representation Analysis
Valentin Pelloin, Léna Dodson, Émile Chapuis et al.
Automatic Detection of Hearing Loss from Children's Speech using wav2vec 2.0 Features
Jessica Monaghan, Arun Sebastian, Nicky Chong-White et al.
Automatic Evaluation of a Sentence Memory Test for Preschool Children
Ilja Baumann, Nicole Unger, Dominik Wagner et al.
Automatic Longitudinal Investigation of Multiple Sclerosis Subjects
Gábor Gosztolya, Veronika Svindt, Judit Bóna et al.
Automatic pitch accent classification through image classification
Na Hu, Hugo Schnack, Amalia Arvaniti
Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
Liming Wang, Yuan Gong, Nauman Dawalatabad et al.
Automatic recognition and detection of aphasic natural speech
Mara Barberis, Pieter De Clercq, Bastiaan Tamm et al.
Automatic Speech Recognition with parallel L1 and L2 acoustic phone models to evaluate /l/ allophony in L2 English speech production
Anisia Popescu, Lori Lamel, Ioana Vasilescu et al.