Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image

Jaeyoung Yoo; Hojun Lee; Inseop Chung; Geonseok Seo; Nojun Kwak

2021 ICCV ICCV 2021

Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image

Abstract

In multi-object detection using neural networks, the fundamental problem is, "How should the network learn a variable number of bounding boxes in different input images?". Previous methods train a multi-object detection network through a procedure that directly assigns the ground truth bounding boxes to the specific locations of the network's output. However, this procedure makes the training of a multi-object detection network too heuristic and complicated. In this paper, we reformulate the multi-object detection task as a problem of density estimation of bounding boxes. Instead of assigning each ground truth to specific locations of network's output, we train a network by estimating the probability density of bounding boxes in an input image using a mixture model. For this purpose, we propose a novel network for object detection called Mixture Density Object Detector (MDOD), and the corresponding objective function for the density-estimation-based training. We applied MDOD to MS COCO dataset. Our proposed method not only deals with multi-object detection problems in a new approach, but also improves detection performances through MDOD. The code is available: https://github.com/yoojy31/MDOD.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jaeyoung Yoo , Hojun Lee , Inseop Chung , Geonseok Seo , Nojun Kwak

Topics

Deep Learning > Architectures > Neural Networks Computer Vision > Analysis > Object Detection Machine Learning > Core Methods > Probabilistic Modeling Machine Learning > Learning Types > Deep Learning Mathematics & Optimization > Probability > Stochastic Processes

Keywords

density estimation object detection mixture model bounding box neural network bounding box estimation

Download PDF

Related papers

Spatial-Temporal Transformer for Dynamic Scene Graph Generation 2021

ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators 2021

A Broad Study on the Transferability of Visual Representations With Contrastive Learning 2021

Query Adaptive Few-Shot Object Detection With Heterogeneous Graph Convolutional Networks 2021

Self-Supervised Neural Networks for Spectral Snapshot Compressive Imaging 2021