On Generalization in Coreference Resolution

Shubham Toshniwal; Patrick Xia; Sam Wiseman; Karen Livescu; Kevin Gimpel

2021 EMNLP EMNLP 2021

On Generalization in Coreference Resolution

Abstract

AbstractWhile coreference resolution is defined independently of dataset domain, most models for performing coreference resolution do not transfer well to unseen domains. We consolidate a set of 8 coreference resolution datasets targeting different domains to evaluate the off-the-shelf performance of models. We then mix three datasets for training; even though their domain, annotation guidelines, and metadata differ, we propose a method for jointly training a single model on this heterogeneous data mixture by using data augmentation to account for annotation differences and sampling to balance the data quantities. We find that in a zero-shot setting, models trained on a single dataset transfer poorly while joint training yields improved overall performance, leading to better generalization in coreference resolution models. This work contributes a new benchmark for robust coreference resolution and multiple new state-of-the-art results.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — heterogeneous datum

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shubham Toshniwal , Patrick Xia , Sam Wiseman , Karen Livescu , Kevin Gimpel

Topics

Machine Learning > Application Areas > Data Augmentation Natural Language Processing > Understanding > Coreference Resolution Machine Learning > Learning Types > Transfer Learning Machine Learning > Learning Types > Domain Generalization Deep Learning > Learning Types > Zero-Shot Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

zero-shot learning domain generalization data augmentation coreference resolution heterogeneous datum joint training neural network

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021