3D-Aware Image Synthesis via Learning Structural and Textural Representations

Yinghao Xu; Sida Peng; Ceyuan Yang; Yujun Shen; Bolei Zhou

2022 CVPR CVPR 2022

3D-Aware Image Synthesis via Learning Structural and Textural Representations

Abstract

Making generative models 3D-aware bridges the 2D image space and the 3D physical world yet remains challenging. Recent attempts equip a Generative Adversarial Network (GAN) with a Neural Radiance Field (NeRF), which maps 3D coordinates to pixel values, as a 3D prior. However, the implicit function in NeRF has a very local receptive field, making the generator hard to become aware of the global structure. Meanwhile, NeRF is built on volume rendering which can be too costly to produce high-resolution results, increasing the optimization difficulty. To alleviate these two problems, we propose a novel framework, termed as VolumeGAN, for high-fidelity 3D-aware image synthesis, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets confirm that, our approach achieves sufficiently higher image quality and better 3D control than the previous methods..

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning

📈 Trend Setter — 3D Vision

🐣 Hot Topic Early Bird — volume rendering

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yinghao Xu , Sida Peng , Ceyuan Yang , Yujun Shen , Bolei Zhou

Topics

Artificial Intelligence > Core AI > Multimodal Learning Deep Learning > Architectures > Autoencoders Deep Learning > Models > Generative Models Computer Vision > Generation > Image Generation Computer Vision > Generation > 3D Vision

Keywords

volume rendering image generation image synthesis variational autoencoder generative adversarial network neural radiance field 3d-aware image synthesis feature volume 3d-aware synthesis

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022