Papers

8,761 papers found

A Dataset and Two-pass System for Reading Miscue Detection

Raj Gothi, Rahul Kumar, Mildred Pereira et al.

2024 INTERSPEECH

Adding User Feedback To Enhance CB-Whisper

Raul Monteiro

2024 INTERSPEECH

A demonstrator for articulation-based command word recognition

Joao Vitor Possamai de Menezes, Arne-Lukas Fietkau, Tom Diener et al.

2024 INTERSPEECH

A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding

Gaëlle Laperrière, Sahar Ghannay, Bassam Jabaian et al.

2024 INTERSPEECH

Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches

Mahdi Amiri, Ina Kodrasi

2024 INTERSPEECH

Aerodynamics of Sakata labial-velar oral stops

Lorenzo Maselli, Véronique Delvaux

2024 INTERSPEECH

Affricates in Lushootseed

Ted Kye

2024 INTERSPEECH

AFL-Net: Integrating Audio, Facial, and Lip Modalities with a Two-step Cross-attention for Robust Speaker Diarization in the Wild

YongKang Yin, Xu Li, Ying Shan et al.

2024 INTERSPEECH

A Framework for Phoneme-Level Pronunciation Assessment Using CTC

Xinwei Cao, Zijian Fan, Torbjørn Svendsen et al.

2024 INTERSPEECH

A Functional Trade-off between Prosodic and Semantic Cues in Conveying Sarcasm

Zhu Li, Xiyuan Gao, Yuqing Zhang et al.

2024 INTERSPEECH

Age-related Differences in Acoustic Cues for the Perception of Checked Syllables in Shengzhou Wu

Bingliang Zhao, Jiangping Kong, Xiyu Wu

2024 INTERSPEECH

AG-LSEC: Audio Grounded Lexical Speaker Error Correction

Rohit Paturi, Xiang Li, Sundararajan Srinivasan

2024 INTERSPEECH

A Human-in-the-Loop Approach to Improving Cross-Text Prosody Transfer

Himanshu Maurya, Atli Sigurgeirsson

2024 INTERSPEECH

A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification

Xujiang Xing, Mingxing Xu, Thomas Fang Zheng

2024 INTERSPEECH

A Language Modeling Approach to Diacritic-Free Hebrew TTS

Amit Roth, Arnon Turetzky, Yossi Adi

2024 INTERSPEECH

A Layer-Anchoring Strategy for Enhancing Cross-Lingual Speech Emotion Recognition

Shreya G. Upadhyay, Carlos Busso, Chi-Chun Lee

2024 INTERSPEECH

A layer-wise analysis of Mandarin and English suprasegmentals in SSL speech models

Anton de la Fuente, Dan Jurafsky

2024 INTERSPEECH

AlignNet: Learning dataset score alignment functions to enable better training of speech quality estimators

Jaden Pieper, Stephen Voran

2024 INTERSPEECH

All Ears: Building Self-Supervised Learning based ASR models for Indian Languages at scale

Vasista Sai Lodagala, Abhishek Biswas, Shoutrik Das et al.

2024 INTERSPEECH

All Neural Low-latency Directional Speech Extraction

Ashutosh Pandey, Sanha Lee, Juan Azcarreta et al.

2024 INTERSPEECH

A Low-Bitrate Neural Audio Codec Framework with Bandwidth Reduction and Recovery for High-Sampling-Rate Waveforms

Yang Ai, Ye-Xin Lu, Xiao-Hang Jiang et al.

2024 INTERSPEECH

A multimodal analysis of different types of laughter expression in conversational dialogues

Kexin Wang, Carlos Ishi, Ryoko Hayashi

2024 INTERSPEECH

A multimodal approach to study the nature of coordinative patterns underlying speech rhythm

Jinyu Li, Leonardo Lancia

2024 INTERSPEECH

A Multimodal Framework for the Assessment of the Schizophrenia Spectrum

Gowtham Premananth, Yashish M. Siriwardena, Philip Resnik et al.

2024 INTERSPEECH

A Multitask Training Approach to Enhance Whisper with Open-Vocabulary Keyword Spotting

Yuang Li, Min Zhang, Chang Su et al.

2024 INTERSPEECH