3D Part Guided Image Editing for Fine-Grained Object Understanding

Zongdai Liu; Feixiang Lu; Peng Wang; Hui Miao; Liangjun Zhang; Ruigang Yang; Bin Zhou

2020 CVPR CVPR 2020

3D Part Guided Image Editing for Fine-Grained Object Understanding

Abstract

Holistically understanding an object with its 3D movable parts is essential for visual models of a robot to interact with the world. For example, only by understanding many possible part dynamics of other vehicles (e.g., door or trunk opening, taillight blinking for changing lane), a self-driving vehicle can be success in dealing with emergency cases. However, existing visual models tackle rarely on these situations, but focus on bounding box detection. In this paper, we fill this important missing piece in autonomous driving by solving two critical issues. First, for dealing with data scarcity, we propose an effective training data generation process by fitting a 3D car model with dynamic parts to cars in real images. This allows us to directly edit the real images using the aligned 3D parts, yielding effective training data for learning robust deep neural networks (DNNs). Secondly, to benchmark the quality of 3D part understanding, we collected a large dataset in real driving scenario with cars in uncommon states (CUS), i.e. with door or trunk opened etc., which demonstrates that our trained network with edited images largely outperforms other baselines in terms of 2D detection and instance segmentation accuracy.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — 3d part understanding

🐣 Hot Topic Early Bird — image editing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zongdai Liu , Feixiang Lu , Peng Wang , Hui Miao , Liangjun Zhang , Ruigang Yang , Bin Zhou

Topics

Machine Learning > Application Areas > Domain Adaptation Computer Vision > Analysis > Object Detection Computer Vision > Domain-Specific > Autonomous Driving Computer Vision > Analysis > Object Segmentation

Keywords

object detection autonomous driving 3d vision image editing instance segmentation synthetic data generation fine-grained recognition deep neural network 3d part understanding part guidance 3d car model

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020