Jump to better conclusions: SCAN both left and right

Jasmijn Bastings; Marco Baroni; Jason Weston; Kyunghyun Cho; Douwe Kiela

2018 EMNLP EMNLP 2018

Jump to better conclusions: SCAN both left and right

Abstract

AbstractLake and Baroni (2018) recently introduced the SCAN data set, which consists of simple commands paired with action sequences and is intended to test the strong generalization abilities of recurrent sequence-to-sequence models. Their initial experiments suggested that such models may fail because they lack the ability to extract systematic rules. Here, we take a closer look at SCAN and show that it does not always capture the kind of generalization that it was designed for. To mitigate this we propose a complementary dataset, which requires mapping actions back to the original commands, called NACS. We show that models that do well on SCAN do not necessarily do well on NACS, and that NACS exhibits properties more closely aligned with realistic use-cases for sequence-to-sequence models.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — scan dataset

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jasmijn Bastings , Marco Baroni , Jason Weston , Kyunghyun Cho , Douwe Kiela

Topics

Natural Language Processing > Applications > Semantic Parsing Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Architectures > Recurrent Neural Networks

Keywords

natural language inference semantic parsing recurrent neural network sequence-to-sequence model scan dataset

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018