Research Explorer

Attention-augmented X-vectors for the Evaluation of Mimicked Speech Using Sparse Autoencoder-LSTM framework

Bhasi K. C., Rajeev Rajan, Noumida A

2024 INTERSPEECH

Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection

Zihan Pan, Tianchi Liu, Hardik B. Sailor et al.

2024 INTERSPEECH

ATTEST: an analytics tool for the testing and evaluation of speech technologies

Dmitrii Obukhov, Marcel de Korte, Andrey Adaschik

2024 INTERSPEECH

Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data

Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto et al.

2024 INTERSPEECH

Audio Editing with Non-Rigid Text Prompts

Francesco Paissan, Luca Della Libera, Zhepei Wang et al.

2024 INTERSPEECH

Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline

Shiran Aziz, Yossi Adi, Shmuel Peleg

2024 INTERSPEECH

Audio Fingerprinting with Holographic Reduced Representations

Yusuke Fujita, Tatsuya Komatsu

2024 INTERSPEECH

Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations

Sarthak Yadav, Zheng-Hua Tan

2024 INTERSPEECH

Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation

Yifei Xin, Zhihong Zhu, Xuxin Cheng et al.

2024 INTERSPEECH

Auditory Attention Decoding in Four-Talker Environment with EEG

Yujie Yan, Xiran Xu, Haolin Zhu et al.

2024 INTERSPEECH

Auditory Spatial Attention Detection Based on Feature Disentanglement and Brain Connectivity-Informed Graph Neural Networks

Yixiang Niu, Ning Chen, Hongqing Zhu et al.

2024 INTERSPEECH

A Unified Approach to Multilingual Automatic Speech Recognition with Improved Language Identification for Indic Languages

Nikhil Jakhar, Sudhanshu Srivastava, Arun Baby

2024 INTERSPEECH

Automated content assessment and feedback for Finnish L2 learners in a picture description speaking task

Nhan Phan, Anna von Zansen, Maria Kautonen et al.

2024 INTERSPEECH

Automated Human-Readable Label Generation in Open Intent Discovery

Grant Anderson, Emma Hart, Dimitra Gkatzia et al.

2024 INTERSPEECH

Automatic Assessment of Dysarthria using Speech and synthetically generated Electroglottograph signal

Fathima Zaheera, Supritha Shetty, Gayadhar Pradhan et al.

2024 INTERSPEECH

Automatic Assessment of Speech Production Skills for Children with Cochlear Implants Using Wav2Vec2.0 Acoustic Embeddings

Seonwoo Lee, Sunhee Kim, Minhwa Chung

2024 INTERSPEECH

Automatic Children Speech Sound Disorder Detection with Age and Speaker Bias Mitigation

Gahye Kim, Yunjung Eom, Selina S. Sung et al.

2024 INTERSPEECH

Automatic Classification of News Subjects in Broadcast News: Application to a Gender Bias Representation Analysis

Valentin Pelloin, Léna Dodson, Émile Chapuis et al.

2024 INTERSPEECH

Automatic Detection of Hearing Loss from Children's Speech using wav2vec 2.0 Features

Jessica Monaghan, Arun Sebastian, Nicky Chong-White et al.

2024 INTERSPEECH

Automatic Evaluation of a Sentence Memory Test for Preschool Children

Ilja Baumann, Nicole Unger, Dominik Wagner et al.

2024 INTERSPEECH

Automatic Longitudinal Investigation of Multiple Sclerosis Subjects

Gábor Gosztolya, Veronika Svindt, Judit Bóna et al.

2024 INTERSPEECH

Automatic pitch accent classification through image classification

Na Hu, Hugo Schnack, Amalia Arvaniti

2024 INTERSPEECH

Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer

Liming Wang, Yuan Gong, Nauman Dawalatabad et al.

2024 INTERSPEECH

Automatic recognition and detection of aphasic natural speech

Mara Barberis, Pieter De Clercq, Bastiaan Tamm et al.

2024 INTERSPEECH

Automatic Speech Recognition with parallel L1 and L2 acoustic phone models to evaluate /l/ allophony in L2 English speech production

Anisia Popescu, Lori Lamel, Ioana Vasilescu et al.

2024 INTERSPEECH

Papers