Self-Assessed Affect Recognition Using Fusion of Attentional BLSTM and Static Acoustic Features

Bo-Hao Su; Sung-Lin Yeh; Ming-Ya Ko; Huan-Yu Chen; Shun-Chang Zhong; Jeng-Lin Li; Chi-Chun Lee

2018 INTERSPEECH INTERSPEECH 2018

Self-Assessed Affect Recognition Using Fusion of Attentional BLSTM and Static Acoustic Features

Abstract

In this study, we present a computational framework to participate in the Self-Assessed Affect Sub-Challenge in the INTERSPEECH 2018 Computation Paralinguistics Challenge. The goal of this sub-challenge is to classify the valence scores given by the speaker themselves into three different levels, i.e., low, medium and high. We explore fusion of Bi-directional LSTM with baseline SVM models to improve the recognition accuracy. In specifics, we extract frame-level acoustic LLDs as input to the BLSTM with a modified attention mechanism and separate SVMs are trained using the standard ComParE_16 baseline feature sets with minority class upsampling. These diverse prediction results are then further fused using a decision-level score fusion scheme to integrate all of the developed models. Our proposed approach achieves a 62.94% and 67.04% unweighted average recall (UAR), which is an 6.24% and 1.04% absolute improvement over the best baseline provided by the challenge organizer. We further provide a detailed comparison analysis between different models.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Bo-Hao Su , Sung-Lin Yeh , Ming-Ya Ko , Huan-Yu Chen , Shun-Chang Zhong , Jeng-Lin Li , Chi-Chun Lee

Topics

Machine Learning > Core Methods > Classification Deep Learning > Architectures > Neural Networks

Keywords

support vector machine bidirectional lstm acoustic feature score fusion affect recognition valence classification

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018