Evaluating Automatic Topic Segmentation as a Segment Retrieval Task

Abdessalam Bouchekif; Delphine Charlet; Geraldine Damnati; Nathalie Camelin; Yannick Estève

2017 INTERSPEECH INTERSPEECH 2017

Evaluating Automatic Topic Segmentation as a Segment Retrieval Task

Abstract

Several evaluation metrics have been proposed for topic segmentation. Most of them rely on the paradigm that segmentation is mainly a task that detects boundaries, and thus are oriented on boundary detection evaluation. Nevertheless, this paradigm is not appropriate to get homogeneous chapters, which is one of the major applications of topic segmentation. For instance on Broadcast News, topic segmentation enables users to watch a chapter independently of the others. We propose to consider segmentation as a task that detects homogeneous segments, and we propose evaluation metrics oriented on segment retrieval. The proposed metrics are experimented on various TV shows from different channels. Results are analysed and discussed, highlighting their relevance.

🧭 Keyword Pioneer — broadcast news

🐣 Hot Topic Early Bird — boundary detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Abdessalam Bouchekif , Delphine Charlet , Geraldine Damnati , Nathalie Camelin , Yannick Estève

Topics

Machine Learning > Core Methods > Classification

Keywords

boundary detection evaluation metric evaluation metrics topic segmentation broadcast news segment retrieval

Download PDF

Related papers

Description of the Munich-Passau Snore Sound Corpus (MPSSC) 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks 2017

Building Audio-Visual Phonetically Annotated Arabic Corpus for Expressive Text to Speech 2017

A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences 2017