A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Kazuma Hashimoto; Caiming Xiong; Yoshimasa Tsuruoka; Richard Socher

2017 EMNLP EMNLP 2017

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Abstract

AbstractTransfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a single model. We introduce a joint many-task model together with a strategy for successively growing its depth to solve increasingly complex tasks. Higher layers include shortcut connections to lower-level task predictions to reflect linguistic hierarchies. We use a simple regularization term to allow for optimizing all model weights to improve one task’s loss without exhibiting catastrophic interference of the other tasks. Our single end-to-end model obtains state-of-the-art or competitive results on five different tasks from tagging, parsing, relatedness, and entailment tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Transfer Learning

🧭 Keyword Pioneer — joint many-task model

🐣 Hot Topic Early Bird — sequence tagging

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Core Methods > Representation Learning Deep Learning > Architectures > Neural Networks Machine Learning > Learning Types > Multi-Task Learning Natural Language Processing > Resources & Methods > Transfer Learning Machine Learning > Learning Paradigms > Multi-Task Learning Deep Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Multi-Task Learning Natural Language Processing > Applications > Natural Language Understanding

Keywords

representation learning multi-task learning transfer learning natural language processing sequence tagging semantic similarity neural network joint many-task model

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017