Detecting Extraneous Content in Podcasts

Sravana Reddy; Yongze Yu; Aasish Pappu; Aswin Sivaraman; Rezvaneh Rezapour; Rosie Jones

2021 EACL EACL 2021

Detecting Extraneous Content in Podcasts

Abstract

AbstractPodcast episodes often contain material extraneous to the main content, such as advertisements, interleaved within the audio and the written descriptions. We present classifiers that leverage both textual and listening patterns in order to detect such content in podcast descriptions and audio transcripts. We demonstrate that our models are effective by evaluating them on the downstream task of podcast summarization and show that we can substantively improve ROUGE scores and reduce the extraneous content generated in the summaries.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing and Speech & Audio

🧭 Keyword Pioneer — content detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Sravana Reddy , Yongze Yu , Aasish Pappu , Aswin Sivaraman , Rezvaneh Rezapour , Rosie Jones

Topics

Machine Learning > Core Methods > Classification Natural Language Processing > Applications > Text Classification Speech & Audio > Analysis > Speech Analysis Machine Learning > Learning Types > Classification

Keywords

text classification audio classification content classification content detection

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021