2018
EMNLP
EMNLP 2018
A Knowledge-Grounded Multimodal Search-Based Conversational Agent
Abstract
AbstractMultimodal search-based dialogue is a challenging new task: It extends visually grounded question answering systems into multi-turn conversations with access to an external database. We address this new challenge by learning a neural response generation system from the recently released Multimodal Dialogue (MMD) dataset (Saha et al., 2017). We introduce a knowledge-grounded multimodal conversational model where an encoded knowledge base (KB) representation is appended to the decoder input. Our model substantially outperforms strong baselines in terms of text-based similarity measures (over 9 BLEU points, 3 of which are solely due to the use of additional information from the KB).
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Deep Learning and Knowledge & Reasoning and Machine Learning and Natural Language Processing
📈
Trend Setter
— Retrieval-Augmented Generation
🧭
Keyword Pioneer
— search-based conversation
🐣
Hot Topic Early Bird
— conversational agent
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Natural Language Processing > Generation > Dialogue Systems
Natural Language Processing > Applications > Information Retrieval
Knowledge & Reasoning > Representation > Knowledge Graphs
Natural Language Processing > Applications > Dialogue Systems
Machine Learning > Learning Types > Multi-Modal Learning
Artificial Intelligence > Core AI > Multi-Modal Learning
Deep Learning > Learning Types > Retrieval-Augmented Generation
Artificial Intelligence > Core AI > Dialogue Systems