Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

Paul Roit; Johan Ferret; Lior Shani; Roee Aharoni; Geoffrey Cideron; Robert Dadashi; Matthieu Geist; Sertan Girgin; Leonard Hussenot; Orgad Keller; Nikola Momchev; Sabela Ramos Garea; Piotr Stańczyk; Nino Vieillard; Olivier Bachem; Gal Elidan; Avinatan Hassidim; Olivier Pietquin; Idan Szpektor

2023 ACL ACL 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

Abstract

AbstractDespite the seeming success of contemporary grounded text generation systems, they often tend to generate factually inconsistent text with respect to their input. This phenomenon is emphasized in tasks like summarization, in which the generated summaries should be corroborated by their source article. In this work we leverage recent progress on textual entailment models to directly address this problem for abstractive summarization systems. We use reinforcement learning with reference-free, textual-entailment rewards to optimize for factual consistency and explore the ensuing trade-offs, as improved consistency may come at the cost of less informative or more extractive summaries. Our results, according to both automatic metrics and human evaluation, show that our method considerably improves the faithfulness, salience and conciseness of the generated summaries.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing and Reinforcement Learning

🧭 Keyword Pioneer — reference-free reward

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Paul Roit , Johan Ferret , Lior Shani , Roee Aharoni , Geoffrey Cideron , Robert Dadashi , Matthieu Geist , Sertan Girgin , Leonard Hussenot , Orgad Keller , Nikola Momchev , Sabela Ramos Garea , Piotr Stańczyk , Nino Vieillard , Olivier Bachem , Gal Elidan , Avinatan Hassidim , Olivier Pietquin , Idan Szpektor

Topics

Machine Learning > Application Areas > Domain Adaptation Reinforcement Learning > Methods > Deep RL Natural Language Processing > Applications > Natural Language Inference Natural Language Processing > Applications > Summarization Deep Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning textual entailment reference-free evaluation abstractive summarization factual consistency reference-free reward

Download PDF

History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling 2023

Efficient Transformers with Dynamic Token Pooling 2023

HHU at SemEval-2023 Task 3: An Adapter-based Approach for News Genre Classification 2023

NAP at SemEval-2023 Task 3: Is Less Really More? (Back-)Translation as Data Augmentation Strategies for Detecting Persuasion Techniques 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

Abstract

Authors

Topics

Keywords

Related papers