Object-Centric Multiple Object Tracking

Zixu Zhao; Jiaze Wang; Max Horn; Yizhuo Ding; Tong He; Zechen Bai; Dominik Zietlow; Carl-Johann Simon-Gabriel; Bing Shuai; Zhuowen Tu; Thomas Brox; Bernt Schiele; Yanwei Fu; Francesco Locatello; Zheng Zhang; Tianjun Xiao

2023 ICCV ICCV 2023

Object-Centric Multiple Object Tracking

Abstract

Unsupervised object-centric learning methods allow the partitioning of scenes into entities without additional localization information and are excellent candidates for reducing the annotation burden of multiple-object tracking (MOT) pipelines. Unfortunately, they lack two key properties: objects are often split into parts and are not consistently tracked over time. In fact, state-of-the-art models achieve pixel-level accuracy and temporal consistency by relying on supervised object detection with additional ID labels for the association through time. This paper proposes a video object-centric model for MOT. It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module that builds complete object prototypes to handle occlusions. Benefited from object-centric learning, we only require sparse detection labels (0%-6.25%) for object localization and feature binding. Relying on our self-supervised Expectation-Maximization-inspired loss for object association, our approach requires no ID labels. Our experiments significantly narrow the gap between the existing object-centric model and the fully supervised state-of-the-art and outperform several unsupervised trackers that also do not require ID labels.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zixu Zhao , Jiaze Wang , Max Horn , Yizhuo Ding , Tong He , Zechen Bai , Dominik Zietlow , Carl-Johann Simon-Gabriel , Bing Shuai , Zhuowen Tu , Thomas Brox , Bernt Schiele , Yanwei Fu , Francesco Locatello , Zheng Zhang , Tianjun Xiao

Topics

Machine Learning > Learning Types > Self-Supervised Learning Machine Learning > Learning Types > Unsupervised Learning Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Object Tracking

Keywords

unsupervised learning object detection self-supervised learning multiple object tracking slot attention

Download PDF

Related papers

PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework 2023

Periodically Exchange Teacher-Student for Source-Free Object Detection 2023

Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations 2023

Minimal Solutions to Uncalibrated Two-view Geometry with Known Epipoles 2023

3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation 2023