Confusion Detection for Adaptive Conversational Strategies of An Oral Proficiency Assessment Interview Agent

Mao Saeki; Kotoka Miyagi; Shinya Fujie; Shungo Suzuki; Tetsuji Ogawa; Tetsunori Kobayashi; Yoichi Matsuyama

2022 INTERSPEECH INTERSPEECH 2022

Confusion Detection for Adaptive Conversational Strategies of An Oral Proficiency Assessment Interview Agent

Abstract

In this study, we present a model to detect user confusion in an online interview dialogue using conversational agents. Conversational agents have gained attention for reliable assessment of language learners' oral skills in interviews. Learners often face confusion, where they fail to understand what the system has said, and may end up unable to respond, leading to a conversational breakdown. It is thus crucial for the system to detect such a state and keep the interview going forward by repeating or rephrasing the previous system utterance. To this end, we first collected a dataset of user confusion using a psycholinguistic experimental approach and identified seven multimodal signs of confusion, some of which were unique to an online conversation. With the corresponding features, we trained a classification model of user confusion. An ablation study showed that the features related to self-talk and gaze direction were most predictive. We discuss how this model can assist a conversational agent to detect and resolve user confusion in real-time.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — confusion detection

🐣 Hot Topic Early Bird — conversational agent

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mao Saeki , Kotoka Miyagi , Shinya Fujie , Shungo Suzuki , Tetsuji Ogawa , Tetsunori Kobayashi , Yoichi Matsuyama

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Core AI > Human-AI Interaction Machine Learning > Core Methods > Classification Natural Language Processing > Applications > Dialogue Systems Machine Learning > Learning Types > Multi-Modal Learning

Keywords

multimodal classification conversational agent dialogue system adaptive strategy confusion detection oral proficiency oral proficiency assessment

Download PDF

Related papers

Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis 2022

Which Model is Best: Comparing Methods and Metrics for Automatic Laughter Detection in a Naturalistic Conversational Dataset 2022

Evidence of Onset and Sustained Neural Responses to Isolated Phonemes from Intracranial Recordings in a Voice-based Cursor Control Task 2022

Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing Applications 2022

Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction 2022