2018
ACL
ACL 2018
Think Visually: Question Answering through Virtual Imagery
Abstract
AbstractIn this paper, we study the problem of geometric reasoning (a form of visual reasoning) in the context of question-answering. We introduce Dynamic Spatial Memory Network (DSMN), a new deep network architecture that specializes in answering questions that admit latent visual representations, and learns to generate and reason over such representations. Further, we propose two synthetic benchmarks, FloorPlanQA and ShapeIntersection, to evaluate the geometric reasoning capability of QA systems. Experimental results validate the effectiveness of our proposed DSMN for visual thinking tasks.
🌉
Interdisciplinary Bridge
— Computer Vision and Deep Learning and Natural Language Processing
📈
Trend Setter
— Visual Question Answering
🧭
Keyword Pioneer
— virtual imagery
🐣
Hot Topic Early Bird
— visual reasoning
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio