2017
EACL
EACL 2017
Age Group Classification with Speech and Metadata Multimodality Fusion
Abstract
AbstractChildren comprise a significant proportion of TV viewers and it is worthwhile to customize the experience for them. However, identifying who is a child in the audience can be a challenging task. We present initial studies of a novel method which combines utterances with user metadata. In particular, we develop an ensemble of different machine learning techniques on different subsets of data to improve child detection. Our initial results show an 9.2% absolute improvement over the baseline, leading to a state-of-the-art performance.
🌱
Topic Pioneer
— Analysis
🌉
Interdisciplinary Bridge
— Interdisciplinary and Machine Learning and Speech & Audio
🧭
Keyword Pioneer
— age group classification
🐣
Hot Topic Early Bird
— multi-modal learning
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio