Papers
16,685 papers found
ETLT 2021: Shared Task on Automatic Speech Recognition for Non-Native Children’s Speech
R. Gretter, Marco Matassoni, D. Falavigna et al.
Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition
Yiming Rong, Yixin Zhang, Ziyi Wang et al.
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages
Matthew Wiesner, Chunxi Liu, Lucas Ondel et al.
CyclicAugment: Speech Data Random Augmentation with Cosine Annealing Scheduler for Automatic Speech Recognition
Zhihan Wang, Feng Hou, Yuanhang Qiu et al.
Automatic Speech Disentanglement for Voice Conversion using Rank Module and Speech Augmentation
Zhonghua Liu, Shijun Wang, Ning Chen
Exploiting Diversity of Automatic Transcripts from Distinct Speech Recognition Techniques for Children’s Speech
Christopher Gebauer, Lars Rumberg, Hanna Ehlert et al.
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource Languages
Phat Do, Matt Coler, Jelske Dijkstra et al.
Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis
Wing-Zin Leung, Mattias Cross, Anton Ragni et al.
Unsupervised Phonetic and Word Level Discovery for Speech to Speech Translation for Unwritten Languages
Steven Hillis, Anushree Prasanna Kumar, Alan W. Black
Investigating Speech Reconstruction for Laryngectomees for Silent Speech Interfaces
Beiming Cao, Nordine Sebkhi, Arpan Bhavsar et al.
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
William Ravenscroft, George Close, Stefan Goetze et al.
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
Luke Prananta, Bence Halpern, Siyuan Feng et al.
The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Kun Song, Yi Lei, Peikun Chen et al.
Strategies for developing a Conversational Speech Dataset for Text-To-Speech Synthesis
Adaeze O. Adigwe, Esther Klabbers
Using the Outputs of Different Automatic Speech Recognition Paradigms for Acoustic- and BERT-Based Alzheimer’s Dementia Detection Through Spontaneous Speech
Yilin Pan, Bahman Mirheidari, Jennifer M. Harris et al.
Large Margin Hidden Markov Models for Automatic Speech Recognition
Fei Sha, Lawrence K. Saul
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Yichong Leng, Xu Tan, Linchen Zhu et al.
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Sehoon Kim, Amir Gholami, Albert Shaw et al.
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Yichong Leng, Xu Tan, Wenjie Liu et al.
Fine-tuning pre-trained models for Automatic Speech Recognition, experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)
Séverine Guillaume, Guillaume Wisniewski, Cécile Macaire et al.
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models
Yuchen Hu, Chen Chen, Chengwei Qin et al.
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan et al.
Fine-Tuning a Pre-Trained Wav2Vec2 Model for Automatic Speech Recognition- Experiments with De Zahrar Sproche
Andrea Gulli, Francesco Costantini, Diego Sidraschi et al.
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Yash Jain, David M. Chan, Pranav Dheram et al.