Overview of the Third Shared Task on Speech Recognition for Vulnerable Individuals in Tamil

Bharathi B; Bharathi Raja Chakravarthi; Sripriya N; Rajeswari Natarajan; Suhasini S

2024 EACL EACL 2024

Overview of the Third Shared Task on Speech Recognition for Vulnerable Individuals in Tamil

Abstract

AbstractThe overview of the shared task on speech recognition for vulnerable individuals in Tamil (LT-EDI-2024) is described in this paper. The work comes with a Tamil dataset that was gath- ered from elderly individuals who identify as male, female, or transgender. The audio sam- ples were taken in public places such as marketplaces, vegetable shops, hospitals, etc. The training phase and the testing phase are when the dataset is made available. The task required of the participants was to handle audio signals using various models and techniques, and then turn in their results as transcriptions of the pro- vided test samples. The participant’s results were assessed using WER (Word Error Rate). The transformer-based approach was employed by the participants to achieve automatic voice recognition. This overview paper discusses the findings and various pre-trained transformer- based models that the participants employed.

🧭 Keyword Pioneer — vulnerable populations

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

Authors

Bharathi B , Bharathi Raja Chakravarthi , Sripriya N , Rajeswari Natarajan , Suhasini S

Topics

Machine Learning > Application Areas > Efficient Computing Speech & Audio > Recognition > Automatic Speech Recognition Speech & Audio > Recognition > Speech Recognition Speech & Audio > Analysis > Clinical Speech Analysis

Keywords

automatic speech recognition multilingual speech processing word error rate tamil language vulnerable population transformer model

Download PDF

Related papers

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry 2024

PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation 2024

Overview of the Hate Speech Detection in Turkish and Arabic Tweets (HSD-2Lang) Shared Task at CASE 2024 2024

Evaluating In-Context Learning for Computational Literary Studies: A Case Study Based on the Automatic Recognition of Knowledge Transfer in German Drama 2024

Selam@DravidianLangTech 2024:Identifying Hate Speech and Offensive Language 2024