Contrastive learning, multi-view redundancy, and linear models

Christopher Tosh; Akshay Krishnamurthy; Daniel Hsu

2021 ALT ALT 2021

Contrastive learning, multi-view redundancy, and linear models

Abstract

Self-supervised learning is an empirically successful approach to unsupervised learning based on creating artificial supervised learning problems. A popular self-supervised approach to representation learning is contrastive learning, which leverages naturally occurring pairs of similar and dissimilar data points, or multiple views of the same data. This work provides a theoretical analysis of contrastive learning in the multi-view setting, where two views of each datum are available. The main result is that linear functions of the learned representations are nearly optimal on downstream prediction tasks whenever the two views provide redundant information about the label.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Christopher Tosh , Akshay Krishnamurthy , Daniel Hsu

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Contrastive Learning Machine Learning > Learning Types > Self-Supervised Learning

Keywords

representation learning contrastive learning self-supervised learning multi-view learning linear model

Download PDF

Related papers

Statistical guarantees for generative models without domination 2021

Last-Iterate Convergence Rates for Min-Max Optimization: Convergence of Hamiltonian Gradient Descent and Consensus Optimization 2021

Stochastic Dueling Bandits with Adversarial Corruption 2021

Asymptotically Optimal Strategies For Combinatorial Semi-Bandits in Polynomial Time 2021

Efficient sampling from the Bingham distribution 2021