Visual Tracking via Hierarchical Deep Reinforcement Learning

Dawei Zhang; Zhonglong Zheng; Riheng Jia; Minglu Li

2021 AAAI AAAI 2021

Visual Tracking via Hierarchical Deep Reinforcement Learning

Abstract

Abstract Visual tracking has achieved great progress due to numerous different algorithms. However, deep trackers based on classification or Siamese network still have their specific limitations. In this work, we show how to teach machines to track a generic object in videos like humans, who can use a few search steps to perform tracking. By constructing a Markov decision process in Deep Reinforcement Learning (DRL), our agents can learn to determine hierarchical decisions on tracking mode and motion estimation. To be specific, our Hierarchical DRL framework is composed of a Siamese-based observation network which models the motion information of an arbitrary target, a policy network for mode switch and an actor-critic network for box regression. This tracking strategy is more in line with human behavior paradigm, and is effective and efficient to cope with fast motion, background clutter and large deformations. Extensive experiments on the GOT-10k, OTB-100, UAV-123, VOT and LaSOT tracking benchmarks, demonstrate that the proposed tracker achieves state-of-the-art performance while running in real-time.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dawei Zhang , Zhonglong Zheng , Riheng Jia , Minglu Li

Topics

Artificial Intelligence > Core AI > Trajectory Prediction Computer Vision > Analysis > Object Tracking Reinforcement Learning > Methods > Deep RL Artificial Intelligence > Core AI > Robotics

Keywords

deep reinforcement learning visual object tracking object tracking hierarchical reinforcement learning real-time tracking visual tracking siamese network hierarchical decision making

Download PDF

Related papers

Contextual Conditional Reasoning 2021

Attention Beam: An Image Captioning Approach (Student Abstract) 2021

Movie Summarization via Sparse Graph Construction 2021

Text Analysis for Understanding Symptoms of Social Anxiety in Student Veterans 2021

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs 2021