CACTAS - Collaborative Audio Categorization and Transcription for ASR Systems

Mithul Mathivanan; Kinnera Saranu; Abhishek Pandey; Jithendra Vepa

2018 INTERSPEECH INTERSPEECH 2018

CACTAS - Collaborative Audio Categorization and Transcription for ASR Systems

Abstract

We present a web based tool that allows collaborative analysis and/or transcription of audios with respect to Automatic Speech Recognition (ASR) systems. The tool presents a webpage consisting of audios and their corresponding references and hypotheses obtained offline. Several other information and features are provided that allow the audios to be categorized and references to be corrected efficiently in a collaborative way almost 10 times faster, without the need for prior knowledge on speech or ASR systems. The analysis can later be summarized and acted upon to improve or triage the ASR system.

🧭 Keyword Pioneer — collaborative transcription

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Mithul Mathivanan , Kinnera Saranu , Abhishek Pandey , Jithendra Vepa

Topics

Speech & Audio > Recognition > Automatic Speech Recognition

Keywords

automatic speech recognition audio analysis collaborative transcription

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018