HILDIF: Interactive Debugging of NLI Models Using Influence Functions

Hugo Zylberajch; Piyawat Lertvittayakumjorn; Francesca Toni

2021 ACL ACL 2021

HILDIF: Interactive Debugging of NLI Models Using Influence Functions

Abstract

AbstractBiases and artifacts in training data can cause unwelcome behavior in text classifiers (such as shallow pattern matching), leading to lack of generalizability. One solution to this problem is to include users in the loop and leverage their feedback to improve models. We propose a novel explanatory debugging pipeline called HILDIF, enabling humans to improve deep text classifiers using influence functions as an explanation method. We experiment on the Natural Language Inference (NLI) task, showing that HILDIF can effectively alleviate artifact problems in fine-tuned BERT models and result in increased model generalizability.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

🧭 Keyword Pioneer — explanatory debugging

🐣 Hot Topic Early Bird — influence function

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Hugo Zylberajch , Piyawat Lertvittayakumjorn , Francesca Toni

Topics

Artificial Intelligence > Core AI > Interpretability Natural Language Processing > Resources & Methods > Natural Language Inference

Keywords

natural language inference influence function explanatory debugging

Download PDF

Related papers

Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training 2021

A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification 2021

How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements 2021

Exploring Discourse Structures for Argument Impact Classification 2021

Language Embeddings for Typology and Cross-lingual Transfer Learning 2021