DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models

Namhyuk Ahn; Junsoo Lee; Chunggi Lee; Kunhee Kim; Daesik Kim; Seung-Hun Nam; Kibeom Hong

2024 AAAI AAAI 2024

DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models

Abstract

Abstract Recent progresses in large-scale text-to-image models have yielded remarkable accomplishments, finding various applications in art domain. However, expressing unique characteristics of an artwork (e.g. brushwork, colortone, or composition) with text prompts alone may encounter limitations due to the inherent constraints of verbal description. To this end, we introduce DreamStyle, a novel framework designed for artistic image synthesis, proficient in both text-to-image synthesis and style transfer. DreamStyle optimizes a multi-stage textual embedding with a context-aware text prompt, resulting in prominent image quality. In addition, with content and style guidance, DreamStyle exhibits flexibility to accommodate a range of style references. Experimental results demonstrate its superior performance across multiple scenarios, suggesting its promising potential in artistic product creation. Project page: https://nmhkahn.github.io/dreamstyler/

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — artistic image synthesis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Namhyuk Ahn , Junsoo Lee , Chunggi Lee , Kunhee Kim , Daesik Kim , Seung-Hun Nam , Kibeom Hong

Topics

Deep Learning > Models > Diffusion Models Deep Learning > Techniques > Pretraining Computer Vision > Generation > Image Generation Computer Vision > Generation > Image Translation Computer Vision > Processing > Image Editing Deep Learning > Learning Types > Generative Models

Keywords

style transfer text-to-image synthesis image synthesis diffusion model text-to-image diffusion textual embedding artistic image synthesis

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024