Learning to Autofocus

Charles Herrmann; Richard Strong Bowen; Neal Wadhwa; Rahul Garg; Qiurui He; Jonathan T. Barron; Ramin Zabih

2020 CVPR CVPR 2020

Learning to Autofocus

Abstract

Autofocus is an important task for digital cameras, yet current approaches often exhibit poor performance. We propose a learning-based approach to this problem, and provide a realistic dataset of sufficient size for effective learning. Our dataset is labeled with per-pixel depths obtained from multi-view stereo, following [9]. Using this dataset, we apply modern deep classification models and an ordinal regression loss to obtain an efficient learning-based autofocus technique. We demonstrate that our approach provides a significant improvement compared with previous learned and non-learned methods: our model reduces the mean absolute error by a factor of 3.6 over the best comparable baseline algorithm. Our dataset and code are publicly available.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — deep classification

🐣 Hot Topic Early Bird — multi-view stereo

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Charles Herrmann , Richard Strong Bowen , Neal Wadhwa , Rahul Garg , Qiurui He , Jonathan T. Barron , Ramin Zabih

Topics

Artificial Intelligence > Learning Paradigms > Few-Shot Learning Machine Learning > Core Methods > Regression Computer Vision > Processing > Image Processing Deep Learning > Learning Types > Deep Learning Computer Vision > Processing > Depth Estimation

Keywords

image classification ordinal regression depth estimation multi-view stereo depth prediction deep classification

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020