Mapping natural language commands to web elements

Panupong Pasupat; Tian-Shun Jiang; Evan Liu; Kelvin Guu; Percy Liang

2018 EMNLP EMNLP 2018

Mapping natural language commands to web elements

Abstract

AbstractThe web provides a rich, open-domain environment with textual, structural, and spatial properties. We propose a new task for grounding language in this environment: given a natural language command (e.g., “click on the second article”), choose the correct element on the web page (e.g., a hyperlink or text box). We collected a dataset of over 50,000 commands that capture various phenomena such as functional references (e.g. “find who made this site”), relational reasoning (e.g. “article by john”), and visual reasoning (e.g. “top-most article”). We also implemented and analyzed three baseline models that capture different phenomena present in the dataset.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — element grounding

🐣 Hot Topic Early Bird — visual reasoning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Panupong Pasupat , Tian-Shun Jiang , Evan Liu , Kelvin Guu , Percy Liang

Topics

Machine Learning > Core Methods > Classification Natural Language Processing > Applications > Intent Classification Computer Vision > Core AI > Multimodal Learning Machine Learning > Learning Types > Multi-Modal Learning Artificial Intelligence > Core AI > Language

Keywords

visual reasoning language grounding relational reasoning natural language command element grounding element classification web element

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018