Cross-lingual tagger evaluation without test data

Željko Agić; Barbara Plank; Anders Søgaard

2017 EACL EACL 2017

Cross-lingual tagger evaluation without test data

Abstract

AbstractWe address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — cross-lingual evaluation

🐣 Hot Topic Early Bird — low-resource language

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Željko Agić , Barbara Plank , Anders Søgaard

Topics

Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Understanding > Part-of-Speech Tagging Natural Language Processing > Applications > Text Classification Natural Language Processing > Resources & Methods > Multilingual NLP Machine Learning > Learning Types > Transfer Learning Natural Language Processing > Applications > Named Entity Recognition

Keywords

part-of-speech tagging low-resource language cross-lingual evaluation cross-lingual pos tagger dictionary-based metric accuracy prediction system ranking dictionary-based evaluation

Download PDF

Related papers

Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages 2017

Learning and Knowledge Transfer with Memory Networks for Machine Comprehension 2017

Is this a Child, a Girl or a Car? Exploring the Contribution of Distributional Similarity to Learning Referential Word Meanings 2017

Building Web-Interfaces for Vector Semantic Models with the WebVectors Toolkit 2017

Assessing Convincingness of Arguments in Online Debates with Limited Number of Features 2017