Decomposed Distribution Matching in Dataset Condensation

Sahar Rahimi Malakshan; Mohammad Saeed Ebrahimi Saadabadi; Ali Dabouei; Nasser Nasrabadi

2025 WACV WACV 2025

Decomposed Distribution Matching in Dataset Condensation

Abstract

Dataset Condensation (DC) aims to reduce deep neural networks training efforts by synthesizing a small dataset such that it will be as effective as the original large dataset. Conventionally DC relies on a costly bi-level optimization which prohibits its practicality. Recent research formulates DC as a distribution matching problem which circumvents the costly bi-level optimization. However this efficiency sacrifices the DC performance. To investigate this performance degradation we decomposed the dataset distribution into content and style. Our observations indicate two major shortcomings of: 1) style discrepancy between original and condensed data and 2) limited intra-class diversity of condensed dataset. We present a simple yet effective method to match the style information between original and condensed data employing statistical moments of feature maps as well-established style indicators. Moreover we enhance the intra-class diversity by maximizing the Kullback-Leibler divergence within each synthetic class i.e. content. We demonstrate the efficacy of our method through experiments on diverse datasets of varying size and resolution achieving improvements of up to 4.1% on CIFAR10 4.2% on CIFAR100 4.3% on TinyImageNet 2.0% on ImageNet-1K 3.3% on ImageWoof 2.5% on ImageNette and 5.5% in continual learning accuracy.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sahar Rahimi Malakshan , Mohammad Saeed Ebrahimi Saadabadi , Ali Dabouei , Nasser Nasrabadi

Topics

Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Optimization & Theory > Optimization Deep Learning > Models > Generative Models

Keywords

continual learning style transfer distribution matching kullback-leibler divergence feature map dataset condensation

Download PDF

Related papers

Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration 2025

ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting 2025

Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation 2025

Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach 2025

Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video 2025