Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Data Augmentation
429 directly classified papers
Papers per year
2013: 1
2017: 3
2018: 8
2019: 29
2020: 47
2021: 67
2022: 76
2023: 82
2024: 63
2025: 53
Papers
RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning
AAAI 2025
IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation
ACL 2025
Bemba Speech Translation: Exploring a Low-Resource African Language
ACL 2025
Gender Swapping as a Data Augmentation Technique: Developing Gender-Balanced Datasets for Ukrainian Language Processing
ACL 2025
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
WACV 2025
DS2-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis
ACL 2025
The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR
NAACL 2025
UCSP Submission to the AmericasNLP 2025 Shared Task
NAACL 2025
Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification
AAAI 2025
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
ACL 2025
Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud
COLING 2025
Investigating the Effect of Backtranslation for Indic Languages
COLING 2025
Generate or Re-Weight? A Mutual-Guidance Method for Class-Imbalanced Graphs
IJCAI 2025
Scalable Data Synthesis through Human-like Cognitive Imitation and Data Recombination
EMNLP 2025
A Training-free Synthetic Data Selection Method for Semantic Segmentation
AAAI 2025
Realistic Training Data Generation and Rule Enhanced Decoding in LLM for NameGuess
EMNLP 2025
Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
EMNLP 2025
Evaluating the Effectiveness and Scalability of LLM-Based Data Augmentation for Retrieval
EMNLP 2025
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
CVPR 2025
Navigating Towards Fairness with Data Selection
AAAI 2025
Explicit and Implicit Data Augmentation for Social Event Detection
ACL 2025
SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
EMNLP 2025
Who’s the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration
AAAI 2025
Cap2Aug: Caption Guided Image Data Augmentation
WACV 2025
Priority on High-Quality: Selecting Instruction Data via Consistency Verification of Noise Injection
EMNLP 2025
<
1
2
3
4
5
…
18
>