DSNAS: Direct Neural Architecture Search Without Parameter Retraining

Shoukang Hu; Sirui Xie; Hehui Zheng; Chunxiao Liu; Jianping Shi; Xunying Liu; Dahua Lin

2020 CVPR CVPR 2020

DSNAS: Direct Neural Architecture Search Without Parameter Retraining

Abstract

If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this definition can reduce the vaguely-defined NAS evaluation to i) accuracy of this task and ii) the total computation consumed to finally obtain a model with satisfying accuracy. Seeing that most existing methods do not solve this problem directly, we propose DSNAS, an efficient differentiable NAS framework that simultaneously optimizes architecture and parameters with a low-biased Monte Carlo estimate. Child networks derived from DSNAS can be deployed directly without parameter retraining. Comparing with two-stage methods, DSNAS successfully discovers networks with comparable accuracy (74.4%) on ImageNet in 420 GPU hours, reducing the total time by more than 34%.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shoukang Hu , Sirui Xie , Hehui Zheng , Chunxiao Liu , Jianping Shi , Xunying Liu , Dahua Lin

Topics

Machine Learning > Optimization & Theory > Neural Network Optimization Machine Learning > Application Areas > Efficient Computing Computer Vision > Analysis > Object Detection Machine Learning > Learning Types > Deep Learning Artificial Intelligence > Core AI > Efficient Computing Deep Learning > Optimization & Theory > Neural Network Optimization Deep Learning > Optimization & Theory > Efficient Computing Machine Learning > Learning Types > Neural Architecture Search

Keywords

image classification network architecture neural architecture search efficient computing parameter optimization monte carlo estimate differentiable search differentiable architecture search

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020