Learning Globally Optimized Object Detector via Policy Gradient

Yongming Rao; Dahua Lin; Jiwen Lu; Jie Zhou

2018 CVPR CVPR 2018

Learning Globally Optimized Object Detector via Policy Gradient

Abstract

In this paper, we propose a simple yet effective method to learn globally optimized detector for object detection, which is a simple modification to the standard cross-entropy gradient inspired by the REINFORCE algorithm. In our approach, the cross-entropy gradient is adaptively adjusted according to overall mean Average Precision (mAP) of the current state for each detection candidate, which leads to more effective gradient and global optimization of detection results, and brings no computational overhead. Benefiting from more precise gradients produced by the global optimization method, our framework significantly improves state-of-the-art object detectors. Furthermore, since our method is based on scores and bounding boxes without modification on the architecture of object detector, it can be easily applied to off-the-shelf modern object detection frameworks.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning and Reinforcement Learning

🐣 Hot Topic Early Bird — cross-entropy loss

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yongming Rao , Dahua Lin , Jiwen Lu , Jie Zhou

Topics

Computer Vision > Analysis > Object Detection Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Types > Reinforcement Learning Deep Learning > Optimization & Theory > Optimization

Keywords

reinforcement learning policy gradient object detection global optimization cross-entropy loss mean average precision

Download PDF

Related papers

Multi-Shot Pedestrian Re-Identification via Sequential Decision Making 2018

Multi-Cue Correlation Filters for Robust Visual Tracking 2018

Pointwise Convolutional Neural Networks 2018

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking 2018

Image Generation From Scene Graphs 2018