← Learning Types

Machine Learning › Learning Types ›

Data Augmentation

429 directly classified papers

Papers per year

Papers

RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning AAAI 2025

IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation ACL 2025

Bemba Speech Translation: Exploring a Low-Resource African Language ACL 2025

Gender Swapping as a Data Augmentation Technique: Developing Gender-Balanced Datasets for Ukrainian Language Processing ACL 2025

DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining WACV 2025

DS2-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis ACL 2025

The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR NAACL 2025

UCSP Submission to the AmericasNLP 2025 Shared Task NAACL 2025

Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification AAAI 2025

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices ACL 2025

Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud COLING 2025

Investigating the Effect of Backtranslation for Indic Languages COLING 2025

Generate or Re-Weight? A Mutual-Guidance Method for Class-Imbalanced Graphs IJCAI 2025

Scalable Data Synthesis through Human-like Cognitive Imitation and Data Recombination EMNLP 2025

A Training-free Synthetic Data Selection Method for Semantic Segmentation AAAI 2025

Realistic Training Data Generation and Rule Enhanced Decoding in LLM for NameGuess EMNLP 2025

Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs EMNLP 2025

Evaluating the Effectiveness and Scalability of LLM-Based Data Augmentation for Retrieval EMNLP 2025

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language CVPR 2025

Navigating Towards Fairness with Data Selection AAAI 2025

Explicit and Implicit Data Augmentation for Social Event Detection ACL 2025

SNaRe: Domain-aware Data Generation for Low-Resource Event Detection EMNLP 2025

Who’s the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration AAAI 2025

Cap2Aug: Caption Guided Image Data Augmentation WACV 2025

Priority on High-Quality: Selecting Instruction Data via Consistency Verification of Noise Injection EMNLP 2025