Character-level Supervision for Low-resource POS Tagging

Katharina Kann; Johannes Bjerva; Isabelle Augenstein; Barbara Plank; Anders Søgaard

2018 ACL ACL 2018

Character-level Supervision for Low-resource POS Tagging

Abstract

AbstractNeural part-of-speech (POS) taggers are known to not perform well with little training data. As a step towards overcoming this problem, we present an architecture for learning more robust neural POS taggers by jointly training a hierarchical, recurrent model and a recurrent character-based sequence-to-sequence network supervised using an auxiliary objective. This way, we introduce stronger character-level supervision into the model, which enables better generalization to unseen words and provides regularization, making our encoding less prone to overfitting. We experiment with three auxiliary tasks: lemmatization, character-based word autoencoding, and character-based random string autoencoding. Experiments with minimal amounts of labeled data on 34 languages show that our new architecture outperforms a single-task baseline and, surprisingly, that, on average, raw text autoencoding can be as beneficial for low-resource POS tagging as using lemma information. Our neural POS tagger closes the gap to a state-of-the-art POS tagger (MarMoT) for low-resource scenarios by 43%, even outperforming it on languages with templatic morphology, e.g., Arabic, Hebrew, and Turkish, by some margin.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — character-level supervision

🐣 Hot Topic Early Bird — low-resource learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Katharina Kann , Johannes Bjerva , Isabelle Augenstein , Barbara Plank , Anders Søgaard

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Natural Language Processing > Understanding > Part-of-Speech Tagging Machine Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Representation Learning Deep Learning > Learning Types > Multi-Task Learning Machine Learning > Core Methods > Sequence Labeling Natural Language Processing > Applications > Part-of-Speech Tagging

Keywords

multi-task learning transfer learning part-of-speech tagging low-resource learning low-resource language recurrent neural network auxiliary task character-level model neural network character-level supervision

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018