LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Enis Simsar; Thomas Hofmann; Federico Tombari; Pinar Yanardag

2025 CVPR CVPR 2025

LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Abstract

Recent advances in text-to-image customization have enabled high-fidelity, context-rich generation of personalized images, allowing specific concepts to appear in a variety of scenarios. However, current methods struggle with combining multiple personalized models, often leading to attribute entanglement or requiring separate training to preserve concept distinctiveness. We present LoRACLR, a novel approach for multi-concept image generation that merges multiple LoRA models, each fine-tuned for a distinct concept, into a single, unified model without additional individual fine-tuning. LoRACLR uses a contrastive objective to align and merge the weight spaces of these models, ensuring compatibility while minimizing interference. By enforcing distinct yet cohesive representations for each concept, LoRACLR enables efficient, scalable model composition for high-quality, multi-concept image synthesis. Our results highlight the effectiveness of LoRACLR in accurately merging multiple concepts, advancing the capabilities of personalized image generation.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Enis Simsar , Thomas Hofmann , Federico Tombari , Pinar Yanardag

Topics

Machine Learning > Learning Types > Contrastive Learning Machine Learning > Application Areas > Model Merging Deep Learning > Models > Diffusion Models Computer Vision > Generation > Image Generation Deep Learning > Techniques > Contrastive Learning Deep Learning > Learning Types > Contrastive Learning Deep Learning > Techniques > Knowledge Distillation

Keywords

contrastive learning personalized generation text-to-image generation model merging diffusion model low-rank adaptation multi-concept generation

Download PDF

Related papers

AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos 2025

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding 2025

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing 2025

Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning 2025

Reversible Decoupling Network for Single Image Reflection Removal 2025