WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

Vid Kocijan; Oana-Maria Camburu; Ana-Maria Cretu; Yordan Yordanov; Phil Blunsom; Thomas Lukasiewicz

2019 EMNLP EMNLP 2019

WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

Abstract

AbstractPronoun resolution is a major area of natural language understanding. However, large-scale training sets are still scarce, since manually labelling data is costly. In this work, we introduce WikiCREM (Wikipedia CoREferences Masked) a large-scale, yet accurate dataset of pronoun disambiguation instances. We use a language-model-based approach for pronoun resolution in combination with our WikiCREM dataset. We compare a series of models on a collection of diverse and challenging coreference resolution problems, where we match or outperform previous state-of-the-art approaches on 6 out of 7 datasets, such as GAP, DPR, WNLI, PDP, WinoBias, and WinoGender. We release our model to be used off-the-shelf for solving pronoun disambiguation.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Vid Kocijan , Oana-Maria Camburu , Ana-Maria Cretu , Yordan Yordanov , Phil Blunsom , Thomas Lukasiewicz

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Unsupervised Learning Natural Language Processing > Understanding > Coreference Resolution Natural Language Processing > Resources & Methods > Language Modeling Natural Language Processing > Applications > Coreference Resolution

Keywords

unsupervised learning named entity recognition coreference resolution language model named entity pronoun resolution pronoun disambiguation

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019