The student becomes the master: Outperforming GPT3 on Scientific Factual Error Correction

Dhananjay Ashok; Atharva Kulkarni; Hai Pham; Barnabás Póczos

2023 EMNLP EMNLP 2023

The student becomes the master: Outperforming GPT3 on Scientific Factual Error Correction

Abstract

AbstractDue to the prohibitively high cost of creating error correction datasets, most Factual Claim Correction methods rely on a powerful verification model to guide the correction process. This leads to a significant drop in performance in domains like Scientific Claim Correction, where good verification models do not always exist. In this work we introduce SciFix, a claim correction system that does not require a verifier but is able to outperform existing methods by a considerable margin — achieving correction accuracy of 84% on the SciFact dataset, 77% on SciFact-Open and 72.75% on the CovidFact dataset, compared to next best accuracies of 7.6%, 5% and 15% on the same datasets respectively. Our method leverages the power of prompting with LLMs during training to create a richly annotated dataset that can be used for fully supervised training and regularization. We additionally use a claim-aware decoding procedure to improve the quality of corrected claims. Our method outperforms the very LLM that was used to generate the annotated dataset — with FewShot Prompting on GPT3.5 achieving 58%, 61% and 64% on the respective datasets, a consistently lower correction accuracy, despite using nearly 800 times as many parameters as our model.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — scientific claim correction

🐣 Hot Topic Early Bird — fact checking

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dhananjay Ashok , Atharva Kulkarni , Hai Pham , Barnabás Póczos

Topics

Machine Learning > Application Areas > Knowledge Distillation Deep Learning > Techniques > Pretraining Natural Language Processing > Generation > Text Generation Natural Language Processing > Applications > Fact-Checking Artificial Intelligence > Core AI > Large Language Models Deep Learning > Learning Types > Knowledge Distillation Deep Learning > Learning Types > Generative Models

Keywords

factual error correction knowledge distillation text generation few-shot prompting fact checking scientific text large language model supervised training scientific claim correction claim correction

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023