CausalQA: A Benchmark for Causal Question Answering

Alexander Bondarenko; Magdalena Wolska; Stefan Heindorf; Lukas Blübaum; Axel-Cyrille Ngonga Ngomo; Benno Stein; Pavel Braslavski; Matthias Hagen; Martin Potthast

2022 COLING COLING 2022

CausalQA: A Benchmark for Causal Question Answering

Abstract

AbstractAt least 5% of questions submitted to search engines ask about cause-effect relationships in some way. To support the development of tailored approaches that can answer such questions, we construct Webis-CausalQA-22, a benchmark corpus of 1.1 million causal questions with answers. We distinguish different types of causal questions using a novel typology derived from a data-driven, manual analysis of questions from ten large question answering (QA) datasets. Using high-precision lexical rules, we extract causal questions of each type from these datasets to create our corpus. As an initial baseline, the state-of-the-art QA model UnifiedQA achieves a ROUGE-L F1 score of 0.48 on our new benchmark.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — question answering benchmark

🐣 Hot Topic Early Bird — causal reasoning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Alexander Bondarenko , Magdalena Wolska , Stefan Heindorf , Lukas Blübaum , Axel-Cyrille Ngonga Ngomo , Benno Stein , Pavel Braslavski , Matthias Hagen , Martin Potthast

Topics

Artificial Intelligence > Core AI > Causal Inference Natural Language Processing > Applications > Question Answering Artificial Intelligence > Core AI > Reasoning Machine Learning > Learning Types > Retrieval-Augmented Generation

Keywords

causal inference causal reasoning question answering information retrieval benchmark dataset causal question answering cause-effect relationship question answering benchmark search engine query

Download PDF

Related papers

MulZDG: Multilingual Code-Switching Framework for Zero-shot Dialogue Generation 2022

The Role of Context and Uncertainty in Shallow Discourse Parsing 2022

SelfMix: Robust Learning against Textual Label Noise with Self-Mixup Training 2022

Complicate Then Simplify: A Novel Way to Explore Pre-trained Models for Text Classification 2022

Repo4QA: Answering Coding Questions via Dense Retrieval on GitHub Repositories 2022