P2SGrad: Refined Gradients for Optimizing Deep Face Models

Xiao Zhang; Rui Zhao; Junjie Yan; Mengya Gao; Yu Qiao; Xiaogang Wang; hongsheng Li

2019 CVPR CVPR 2019

P2SGrad: Refined Gradients for Optimizing Deep Face Models

Abstract

Cosine-based softmax losses significantly improve the performance of deep face recognition networks. However, these losses always include sensitive hyper-parameters which can make training process unstable, and it is very tricky to set suitable hyper parameters for a specific dataset. This paper addresses this challenge by directly designing the gradients for training in an adaptive manner. We first investigate and unify previous cosine softmax losses from the perspective of gradients. This unified view inspires us to propose a novel gradient called P2SGrad (Probability-to-Similarity Gradient), which leverages a cosine similarity instead of classification probability to control the gradients for updating neural network parameters. P2SGrad is adaptive and hyper-parameter free, which makes training process more efficient and faster. We evaluate our P2SGrad on three face recognition benchmarks, LFW, MegaFace, and IJB-C. The results show that P2SGrad is stable in training, robust to noise, and achieves state-of-the-art performance on all the three benchmarks.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — cosine-based softmax

🐣 Hot Topic Early Bird — gradient optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xiao Zhang , Rui Zhao , Junjie Yan , Mengya Gao , Yu Qiao , Xiaogang Wang , hongsheng Li

Topics

Deep Learning > Techniques > Model Architecture Computer Vision > Analysis > Face Recognition Deep Learning > Learning Types > Deep Learning Deep Learning > Optimization & Theory > Loss Functions

Keywords

neural network training face recognition deep learning gradient optimization deep neural network cosine-based softmax cosine softmax loss

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019