Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

Dan Xu; Wei Wang; Hao Tang; Hong Liu; Nicu Sebe; Elisa Ricci

2018 CVPR CVPR 2018

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

Abstract

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF, allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITTI benchmark and outperforms the state of the art on the NYU Depth V2 dataset.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — multi-scale information

🐣 Hot Topic Early Bird — monocular depth estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dan Xu , Wei Wang , Hao Tang , Hong Liu , Nicu Sebe , Elisa Ricci

Topics

Machine Learning > Optimization & Theory > Stochastic Processes Computer Vision > Analysis > Depth Estimation Deep Learning > Techniques > Attention Computer Vision > Processing > Depth Estimation Deep Learning > Learning Types > Multi-Scale Learning

Keywords

depth estimation monocular depth estimation convolutional neural network conditional random field structured attention monocular depth multi-scale information

Download PDF

Related papers

Multi-Shot Pedestrian Re-Identification via Sequential Decision Making 2018

Multi-Cue Correlation Filters for Robust Visual Tracking 2018

Pointwise Convolutional Neural Networks 2018

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking 2018

Image Generation From Scene Graphs 2018