Revisiting Weakly Supervised Pre-Training of Visual Perception Models

Mannat Singh; Laura Gustafson; Aaron Adcock; Vinicius de Freitas Reis; Bugra Gedik; Raj Prateek Kosaraju; Dhruv Mahajan; Ross Girshick; Piotr Dollár; Laurens van der Maaten

2022 CVPR CVPR 2022

Revisiting Weakly Supervised Pre-Training of Visual Perception Models

Abstract

Model pre-training is a cornerstone of modern visual recognition systems. Although fully supervised pre-training on datasets like ImageNet is still the de-facto standard, recent studies suggest that large-scale weakly supervised pre-training can outperform fully supervised approaches. This paper revisits weakly-supervised pre-training of models using hashtag supervision with modern versions of residual networks and the largest-ever dataset of images and corresponding hashtags. We study the performance of the resulting models in various transfer-learning settings including zero-shot transfer. We also compare our models with those obtained via large-scale self-supervised learning. We find our weakly-supervised models to be very competitive across all settings, and find they substantially outperform their self-supervised counterparts. We also include an investigation into whether our models learned potentially troubling associations or stereotypes. Overall, our results provide a compelling argument for the use of weakly supervised learning in the development of visual recognition systems. Our models, Supervised Weakly through hashtAGs (SWAG), are available publicly.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🐣 Hot Topic Early Bird — visual perception

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mannat Singh , Laura Gustafson , Aaron Adcock , Vinicius de Freitas Reis , Bugra Gedik , Raj Prateek Kosaraju , Dhruv Mahajan , Ross Girshick , Piotr Dollár , Laurens van der Maaten

Topics

Machine Learning > Learning Types > Weakly Supervised Learning Deep Learning > Techniques > Pretraining Computer Vision > Analysis > Object Detection Machine Learning > Learning Types > Transfer Learning Computer Vision > Core AI > Computer Vision Deep Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Weakly Supervised Learning

Keywords

image classification transfer learning visual perception self-supervised learning weakly supervised learning model pretraining visual recognition hashtag supervision

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022