Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing

Jiancheng Huang; Yi Huang; Jianzhuang Liu; Donghao Zhou; Yifan liu; Shifeng Chen

2025 WACV WACV 2025

Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing

Abstract

Text-conditional image editing is a practical AIGC task that has recently emerged with great commercial and academic value. For real image editing most diffusion model-based methods use DDIM Inversion as the first stage before editing. However DDIM Inversion often results in reconstruction failure leading to unsatisfactory performance for downstream editing. To address this problem we first analyze why the reconstruction via DDIM Inversion fails. We then propose a new inversion and sampling method named Dual-Schedule Inversion. We also design a classifier to adaptively combine Dual-Schedule Inversion with different editing methods for user-friendly image editing. Our work can achieve superior reconstruction and editing performance with the following advantages: 1) It can reconstruct real images perfectly without fine-tuning and its reversibility is guaranteed mathematically. 2) The edited object/scene conforms to the semantics of the text prompt. 3) The unedited parts of the object/scene retain the original identity.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning

🧭 Keyword Pioneer — text-conditional generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jiancheng Huang , Yi Huang , Jianzhuang Liu , Donghao Zhou , Yifan liu , Shifeng Chen

Topics

Deep Learning > Models > Diffusion Models Deep Learning > Models > Generative Models Computer Vision > Generation > Image Generation Computer Vision > Processing > Image Editing Artificial Intelligence > Core AI > Computer Vision

Keywords

image reconstruction image editing diffusion model ddim inversion text-conditional generation

Download PDF

Related papers

Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration 2025

ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting 2025

Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation 2025

Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach 2025

Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video 2025