Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Data Augmentation
429 directly classified papers
Papers per year
2013: 1
2017: 3
2018: 8
2019: 29
2020: 47
2021: 67
2022: 76
2023: 82
2024: 63
2025: 53
Papers
Towards Robust Universal Information Extraction: Dataset, Evaluation, and Solution
ACL 2025
TutorMind at BEA 2025 Shared Task: Leveraging Fine-Tuned LLMs and Data Augmentation for Mistake Identification
ACL 2025
Generate or Re-Weight? A Mutual-Guidance Method for Class-Imbalanced Graphs
IJCAI 2025
RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning
AAAI 2025
Enhancing Dialectal Arabic Intent Detection through Cross-Dialect Multilingual Input Augmentation
COLING 2025
MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation
ACL 2025
UCSP Submission to the AmericasNLP 2025 Shared Task
NAACL 2025
Named Entity Recognition for the Irish Language
NAACL 2025
Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification
AAAI 2025
Who’s the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration
AAAI 2025
Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud
COLING 2025
KCS: Diversify Multi-hop Question Generation with Knowledge Composition Sampling
EMNLP 2025
Investigating the Effect of Backtranslation for Indic Languages
COLING 2025
A Little Human Data Goes A Long Way
ACL 2025
DS2-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis
ACL 2025
Cap2Aug: Caption Guided Image Data Augmentation
WACV 2025
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
ACL 2025
The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR
NAACL 2025
Scalable Data Synthesis through Human-like Cognitive Imitation and Data Recombination
EMNLP 2025
A Training-free Synthetic Data Selection Method for Semantic Segmentation
AAAI 2025
Evaluating the Effectiveness and Scalability of LLM-Based Data Augmentation for Retrieval
EMNLP 2025
Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class
AAAI 2025
SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
EMNLP 2025
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
WACV 2025
Overcoming Data Scarcity in Named Entity Recognition: Synthetic Data Generation with Large Language Models
ACL 2025
<
1
2
3
4
5
…
18
>