Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Yordan Yordanov; Oana-Maria Camburu; Vid Kocijan; Thomas Lukasiewicz

2020 EMNLP EMNLP 2020

Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Abstract

AbstractHard cases of pronoun resolution have been used as a long-standing benchmark for commonsense reasoning. In the recent literature, pre-trained language models have been used to obtain state-of-the-art results on pronoun resolution. Overall, four categories of training and evaluation objectives have been introduced. The variety of training datasets and pre-trained language models used in these works makes it unclear whether the choice of training objective is critical. In this work, we make a fair comparison of the performance and seed-wise stability of four models that represent the four categories of objectives. Our experiments show that the objective of sequence ranking performs the best in-domain, while the objective of semantic similarity between candidates and pronoun performs the best out-of-domain. We also observe a seed-wise instability of the model using sequence ranking, which is not the case when the other objectives are used.

❓ The Questioner

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yordan Yordanov , Oana-Maria Camburu , Vid Kocijan , Thomas Lukasiewicz

Topics

Machine Learning > Optimization & Theory > Learning Theory Natural Language Processing > Understanding > Coreference Resolution Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Natural Language Inference Machine Learning > Learning Types > Evaluation Deep Learning > Learning Types > Representation Learning

Keywords

coreference resolution semantic similarity language model training objective commonsense reasoning pronoun resolution winograd schema sequence ranking

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020