Retrofitting Structure-aware Transformer Language Model for End Tasks

Hao Fei; Yafeng Ren; Donghong Ji

2020 EMNLP EMNLP 2020

Retrofitting Structure-aware Transformer Language Model for End Tasks

Abstract

AbstractWe consider retrofitting structure-aware Transformer language model for facilitating end tasks by proposing to exploit syntactic distance to encode both the phrasal constituency and dependency connection into the language model. A middle-layer structural learning strategy is leveraged for structure integration, accomplished with main semantic task training under multi-task learning scheme. Experimental results show that the retrofitted structure-aware Transformer language model achieves improved perplexity, meanwhile inducing accurate syntactic phrases. By performing structure-aware fine-tuning, our model achieves significant improvements for both semantic- and syntactic-dependent tasks.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — structure-aware fine-tuning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hao Fei , Yafeng Ren , Donghong Ji

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Self-Supervised Learning Deep Learning > Architectures > Transformers Natural Language Processing > Understanding > Syntax Machine Learning > Learning Types > Multi-Task Learning Natural Language Processing > Resources & Methods > Language Modeling Machine Learning > Learning Paradigms > Multi-Task Learning Deep Learning > Models > Transformers

Keywords

multi-task learning structure learning syntactic parsing language model transformer language model syntactic structure structural representation syntactic distance structure-aware fine-tuning

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020