Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision

Haoruo Peng; Ming-Wei Chang; Wen-tau Yih

2017 EMNLP EMNLP 2017

Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision

Abstract

AbstractNeural networks have achieved state-of-the-art performance on several structured-output prediction tasks, trained in a fully supervised fashion. However, annotated examples in structured domains are often costly to obtain, which thus limits the applications of neural networks. In this work, we propose Maximum Margin Reward Networks, a neural network-based framework that aims to learn from both explicit (full structures) and implicit supervision signals (delayed feedback on the correctness of the predicted structure). On named entity recognition and semantic parsing, our model outperforms previous systems on the benchmark datasets, CoNLL-2003 and WebQuestionsSP.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — reward network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Haoruo Peng , Ming-Wei Chang , Wen-tau Yih

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Natural Language Processing > Understanding > Named Entity Recognition Natural Language Processing > Applications > Named Entity Recognition Deep Learning > Learning Types > Reinforcement Learning Natural Language Processing > Applications > Semantic Parsing

Keywords

structured prediction named entity recognition semantic parsing maximum margin structured output maximum margin learning reward network implicit supervision

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017