Dynamic Attention-Guided Diffusion for Image Super-Resolution

Brian B. Moser; Stanislav Frolov; Federico Raue; Sebastian Palacio; Andreas Dengel

2025 WACV WACV 2025

Dynamic Attention-Guided Diffusion for Image Super-Resolution

Abstract

Diffusion models in image Super-Resolution (SR) treat all image regions uniformly which risks compromising the overall image quality by potentially introducing artifacts during denoising of less-complex regions. To address this we propose "You Only Diffuse Areas" (YODA) a dynamic attention-guided diffusion process for image SR. YODA selectively focuses on spatial regions defined by attention maps derived from the low-resolution images and the current denoising time step. This time-dependent targeting enables a more efficient conversion to high-resolution outputs by focusing on areas that benefit the most from the iterative refinement process i.e. detail-rich objects. We empirically validate YODA by extending leading diffusion-based methods SR3 DiffBIR and SRDiff. Our experiments demonstrate new state-of-the-art performances in face and general SR tasks across PSNR SSIM and LPIPS metrics. As a side effect we find that YODA reduces color shift issues and stabilizes training with small batches.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — spatial region

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Brian B. Moser , Stanislav Frolov , Federico Raue , Sebastian Palacio , Andreas Dengel

Topics

Machine Learning > Application Areas > Efficient Computing Deep Learning > Models > Diffusion Models Computer Vision > Processing > Image Restoration

Keywords

image restoration attention mechanism image super-resolution diffusion model denoising process attention map dynamic attention spatial region

Download PDF

Related papers

Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration 2025

ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting 2025

Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation 2025

Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach 2025

Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video 2025