Fader Networks:Manipulating Images by Sliding Attributes

Guillaume Lample; Neil Zeghidour; Nicolas Usunier; Antoine Bordes; Ludovic DENOYER; Marc'aurelio Ranzato

2017 NIPS NeurIPS 2017

Fader Networks:Manipulating Images by Sliding Attributes

Abstract

This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much a specific attribute is perceivable in the generated image. This property could allow for applications where users can modify an image using sliding knobs, like faders on a mixing console, to change the facial expression of a portrait, or to update the color of some objects. Compared to the state-of-the-art which mostly relies on training adversarial networks in pixel space by altering attribute values at train time, our approach results in much simpler training schemes and nicely scales to multiple attributes. We present evidence that our model can significantly change the perceived value of the attributes while preserving the naturalness of images.

🌉 Interdisciplinary Bridge — Computer Science and Computer Vision and Deep Learning

📈 Trend Setter — Image Editing

🧭 Keyword Pioneer — attribute manipulation

🐣 Hot Topic Early Bird — disentangled representation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Guillaume Lample , Neil Zeghidour , Nicolas Usunier , Antoine Bordes , Ludovic DENOYER , Marc'aurelio Ranzato

Topics

Deep Learning > Models > Generative Models Computer Science > Systems > Computer Graphics Computer Vision > Generation > Image Editing

Keywords

image generation disentangled representation latent space encoder-decoder architecture attribute manipulation

Download PDF

Related papers

High-Order Attention Models for Visual Question Answering 2017

Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization 2017

Premise Selection for Theorem Proving by Deep Graph Embedding 2017

Neural Program Meta-Induction 2017

Safe and Nested Subgame Solving for Imperfect-Information Games 2017