Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction

Yifei Shi; Angel X. Chang; Zhelun Wu; Manolis Savva; Kai Xu

2019 CVPR CVPR 2019

Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction

Abstract

Indoor scenes exhibit rich hierarchical structure in 3D object layouts. Many tasks in 3D scene understanding can benefit from reasoning jointly about the hierarchical context of a scene, and the identities of objects. We present a variational denoising recursive autoencoder (VDRAE) that generates and iteratively refines a hierarchical representation of 3D object layouts, interleaving bottom-up encoding for context aggregation and top-down decoding for propagation. We train our VDRAE on large-scale 3D scene datasets to predict both instance-level segmentations and a 3D object detections from an over-segmentation of an input point cloud. We show that our VDRAE improves object detection performance on real-world 3D point cloud datasets compared to baselines from prior work.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — variational denoising autoencoder

🐣 Hot Topic Early Bird — 3d scene understanding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yifei Shi , Angel X. Chang , Zhelun Wu , Manolis Savva , Kai Xu

Topics

Deep Learning > Models > Variational Inference Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Scene Understanding Computer Vision > Domain-Specific > 3D Vision

Keywords

scene understanding object detection point cloud 3d scene understanding hierarchical representation 3d scene layout variational autoencoder recursive autoencoder variational denoising autoencoder

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019