Mini-DA: Improving Your Model Performance through Minimal Data Augmentation using LLM

SHuangtao Yang; Xiaoyi Liu; Xiaozheng Dong; Bo Fu

2024 NAACL NAACL 2024

Mini-DA: Improving Your Model Performance through Minimal Data Augmentation using LLM

Abstract

AbstractWhen performing data augmentation using large language models (LLMs), the common approach is to directly generate a large number of new samples based on the original dataset, and then model is trained on the integration of augmented dataset and the original dataset. However, data generation demands extensive computational resources. In this study, we propose Mini-DA, a minimized data augmentation method that leverages the feedback from the target model during the training process to select only the most challenging samples from the validation set for augmentation. Our experimental results show in text classification task, by using as little as 13 percent of the original augmentation volume, Mini-DA can achieve performance comparable to full data augmentation for intent detection task, significantly improving data and computational resource utilization efficiency.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

SHuangtao Yang , Xiaoyi Liu , Xiaozheng Dong , Bo Fu

Topics

Machine Learning > Application Areas > Data Augmentation Natural Language Processing > Applications > Intent Classification Natural Language Processing > Applications > Text Classification

Keywords

text classification data augmentation computational efficiency intent detection large language model

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024