To Err Is Human, but Llamas Can Learn It Too

Agnes Luhtaru; Taido Purason; Martin Vainikko; Maksym Del; Mark Fishel

2024 EMNLP EMNLP 2024

To Err Is Human, but Llamas Can Learn It Too

Abstract

AbstractThis study explores enhancing grammatical error correction (GEC) through automatic error generation (AEG) using language models (LMs). Specifically, we fine-tune Llama 2 LMs for error generation and find that this approach yields synthetic errors akin to human errors. Next, we train GEC Llama models using these artificial errors and outperform previous state-of-the-art error correction models, with gains ranging between 0.8 and 6 F0.5 points across all tested languages (German, Ukrainian, and Estonian). Moreover, we demonstrate that generating errors by fine-tuning smaller sequence-to-sequence models and prompting large commercial LMs (GPT3.5 and GPT4) also results in synthetic errors beneficially affecting error generation models. We openly release trained models for error generation and correction as well as all the synthesized error datasets for the covered languages.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — synthetic error

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Agnes Luhtaru , Taido Purason , Martin Vainikko , Maksym Del , Mark Fishel

Topics

Machine Learning > Learning Types > Self-Supervised Learning Natural Language Processing > Generation > Text Generation Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Learning Types > Fine-Tuning Natural Language Processing > Applications > Text Processing

Keywords

multilingual nlp grammatical error correction language model fine-tuning sequence-to-sequence model error generation synthetic error synthetic error generation

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024