A Survey on Natural Language Counterfactual Generation

Yongjie Wang; Xiaoqi Qiu; Yu Yue; Xu Guo; Zhiwei Zeng; Yuhong Feng; Zhiqi Shen

2024 EMNLP EMNLP 2024

A Survey on Natural Language Counterfactual Generation

Abstract

AbstractNatural language counterfactual generation aims to minimally modify a given text such that the modified text will be classified into a different class. The generated counterfactuals provide insight into the reasoning behind a model’s predictions by highlighting which words significantly influence the outcomes. Additionally, they can be used to detect model fairness issues and augment the training data to enhance the model’s robustness. A substantial amount of research has been conducted to generate counterfactuals for various NLP tasks, employing different models and methodologies. With the rapid growth of studies in this field, a systematic review is crucial to guide future researchers and developers. To bridge this gap, this survey provides a comprehensive overview of textual counterfactual generation methods, particularly those based on Large Language Models. We propose a new taxonomy that systematically categorizes the generation methods into four groups and summarizes the metrics for evaluating the generation quality. Finally, we discuss ongoing research challenges and outline promising directions for future work.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — robustness augmentation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yongjie Wang , Xiaoqi Qiu , Yu Yue , Xu Guo , Zhiwei Zeng , Yuhong Feng , Zhiqi Shen

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Application Areas > Fairness Natural Language Processing > Applications > Text Classification Artificial Intelligence > Core AI > Fairness Deep Learning > Learning Types > Adversarial Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

adversarial learning natural language processing text classification model interpretability counterfactual generation model fairness large language model robustness augmentation

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024