Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Speech Processing
181 directly classified papers
Papers per year
2015: 1
2016: 10
2017: 12
2018: 6
2019: 15
2020: 16
2021: 19
2022: 23
2023: 20
2024: 24
2025: 35
Papers
Can LLMs Understand Unvoiced Speech? Exploring EMG-to-Text Conversion with LLMs
ACL 2025
Mind the Gap: Static and Interactive Evaluations of Large Audio Models
ACL 2025
Towards Reliable Large Audio Language Model
ACL 2025
Context-Aware Lexical Stress Prediction and Phonemization for Ukrainian TTS Systems
ACL 2025
Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution
ACL 2025
Phonotomizer: A Compact, Unsupervised, Online Training Approach to Real-Time, Multilingual Phonetic Segmentation
ACL 2025
PACHAT: Persona-Aware Speech Assistant for Multi-party Dialogue
EMNLP 2025
BRSpeech-DF: A Deep Fake Synthetic Speech Dataset for Portuguese Zero-Shot TTS
EMNLP 2025
MockConf: A Student Interpretation Dataset: Analysis, Word- and Span-level Alignment and Baselines
ACL 2025
Spoken Conversational Agents with Large Language Models
EMNLP 2025
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception
ACL 2024
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
ACL 2024
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
ACL 2024
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
ACL 2024
“Allot?” is “A Lot!” Towards Developing More Generalized Speech Recognition System for Accessible Communication
AAAI 2024
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations
AAAI 2024
Cross-Talk Reduction
IJCAI 2024
WavLLM: Towards Robust and Adaptive Speech Large Language Model
EMNLP 2024
sign.mt: Real-Time Multilingual Sign Language Translation Application
EMNLP 2024
Textless Speech-to-Speech Translation With Limited Parallel Data
EMNLP 2024
Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach
EMNLP 2024
CTC-based Non-autoregressive Textless Speech-to-Speech Translation
ACL 2024
Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model
EMNLP 2024
Uncovering Syllable Constituents in the Self-Attention-Based Speech Representations of Whisper
EMNLP 2024
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech
ACL 2024
<
1
2
3
4
5
…
8
>