Towards Debiasing Fact Verification Models

Tal Schuster; Darsh Shah; Yun Jie Serene Yeo; Daniel Roberto Filizzola Ortiz; Enrico Santus; Regina Barzilay

2019 EMNLP EMNLP 2019

Towards Debiasing Fact Verification Models

Abstract

AbstractFact verification requires validating a claim in the context of evidence. We show, however, that in the popular FEVER dataset this might not necessarily be the case. Claim-only classifiers perform competitively with top evidence-aware models. In this paper, we investigate the cause of this phenomenon, identifying strong cues for predicting labels solely based on the claim, without considering any evidence. We create an evaluation set that avoids those idiosyncrasies. The performance of FEVER-trained models significantly drops when evaluated on this test set. Therefore, we introduce a regularization method which alleviates the effect of bias in the training data, obtaining improvements on the newly created test set. This work is a step towards a more sound evaluation of reasoning capabilities in fact verification models.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — reasoning capability

🐣 Hot Topic Early Bird — fact verification

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Tal Schuster , Darsh Shah , Yun Jie Serene Yeo , Daniel Roberto Filizzola Ortiz , Enrico Santus , Regina Barzilay

Topics

Machine Learning > Application Areas > Fairness Natural Language Processing > Applications > Fact-Checking Deep Learning > Learning Types > Transfer Learning

Keywords

fact verification bias detection reasoning capability fever dataset claim-only classification claim-only classifier

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019