A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling

Ying Lin; Shengqi Yang; Veselin Stoyanov; Heng Ji

2018 ACL ACL 2018

A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling

Abstract

AbstractWe propose a multi-lingual multi-task architecture to develop supervised models with a minimal amount of labeled data for sequence labeling. In this new architecture, we combine various transfer models using two layers of parameter sharing. On the first layer, we construct the basis of the architecture to provide universal word representation and feature extraction capability for all models. On the second level, we adopt different parameter sharing strategies for different transfer schemes. This architecture proves to be particularly effective for low-resource settings, when there are less than 200 training sentences for the target task. Using Name Tagging as a target task, our approach achieved 4.3%-50.5% absolute F-score gains compared to the mono-lingual single-task baseline model.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — multi-lingual learning

🐣 Hot Topic Early Bird — parameter sharing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ying Lin , Shengqi Yang , Veselin Stoyanov , Heng Ji

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Application Areas > Efficient Computing Machine Learning > Learning Paradigms > Transfer Learning Natural Language Processing > Applications > Named Entity Recognition Deep Learning > Learning Types > Multi-Task Learning

Keywords

multi-task learning transfer learning sequence labeling parameter sharing name tagging multi-lingual learning

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018