Viewpoint-Agnostic Image Rendering

Hiroaki Aizawa; Hirokatsu Kataoka; Yutaka Satoh; Kunihito Kato

2021 WACV WACV 2021

Viewpoint-Agnostic Image Rendering

Abstract

Rendering an any-viewpoint image is extremely difficult for Generative Adversarial Networks. This is because conventional GANs do not understand 3D information underlying a given viewpoint image such as an object shape and relationship between viewpoint and objects in 3D space. In this paper, we present how to perform a Viewpoint-Agnostic Image Rendering (VAIR), equipping a conditional GAN with a mechanism to reconstruct 3D information of the input view. VAIR realizes any-viewpoint image generation by manipulating a viewpoint in 3D space where the reconstructed instance shape is arranged. In addition, we convert the reconstructed 3D shape into a 2D representation for image-based conditional GAN, while preserving detail 3D information. The representation consists of a depth image and 2D semantic keypoint images, which are obtained by rendering the shape from a viewpoint. In the experiment, we evaluate using a CUB-200-2011 dataset, which contains few-samples biased a viewpoint such that covers only part of the target appearance. As a result, our VAIR clearly renders an any-viewpoint image.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hiroaki Aizawa , Hirokatsu Kataoka , Yutaka Satoh , Kunihito Kato

Topics

Deep Learning > Models > Generative Models Computer Vision > Analysis > 3D Vision Computer Vision > Generation > Image Generation

Keywords

3d reconstruction image generation depth estimation conditional generative adversarial network viewpoint synthesis semantic keypoint

Download PDF

Related papers

Multimodal Humor Dataset: Predicting Laughter Tracks for Sitcoms 2021

Benchmark for Evaluating Pedestrian Action Prediction 2021

Regional Attention Networks With Context-Aware Fusion for Group Emotion Recognition 2021

Robust Lensless Image Reconstruction via PSF Estimation 2021

Improved Training of Generative Adversarial Networks Using Decision Forests 2021