StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts

Zhengxiang Shi; Qiang Zhang; Aldo Lipani

2022 AAAI AAAI 2022

StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts

Abstract

Abstract Inferring spatial relations in natural language is a crucial ability an intelligent system should possess. The bAbI dataset tries to capture tasks relevant to this domain (task 17 and 19). However, these tasks have several limitations. Most importantly, they are limited to fixed expressions, they are limited in the number of reasoning steps required to solve them, and they fail to test the robustness of models to input that contains irrelevant or redundant information. In this paper, we present a new Question-Answering dataset called StepGame for robust multi-step spatial reasoning in texts. Our experiments demonstrate that state-of-the-art models on the bAbI dataset struggle on the StepGame dataset. Moreover, we propose a Tensor-Product based Memory-Augmented Neural Network (TP-MANN) specialized for spatial reasoning tasks. Experimental results on both datasets show that our model outperforms all the baselines with superior generalization and robustness performance.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Knowledge & Reasoning and Machine Learning and Natural Language Processing

📈 Trend Setter — Reasoning

🧭 Keyword Pioneer — question-answering benchmark

🐣 Hot Topic Early Bird — spatial reasoning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zhengxiang Shi , Qiang Zhang , Aldo Lipani

Topics

Artificial Intelligence > Core AI > Memory Natural Language Processing > Applications > Question Answering Knowledge & Reasoning > Reasoning > Automated Reasoning Artificial Intelligence > Core AI > Reasoning Deep Learning > Learning Types > Representation Learning Machine Learning > Learning Types > Reasoning

Keywords

question answering benchmark dataset spatial reasoning multi-hop reasoning tensor product question-answering benchmark memory-augmented neural network multi-hop spatial reasoning spatial relation inference tensor-product representation

Download PDF

Related papers

Dynamic Spatial Propagation Network for Depth Completion 2022

FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition 2022

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding 2022

AnchorFace: Boosting TAR@FAR for Practical Face Recognition 2022

Parallel and High-Fidelity Text-to-Lip Generation 2022