Unsupervised Supervised Learning II: Margin-Based Classification Without Labels

Krishnakumar Balasubramanian; Pinar Donmez; Guy Lebanon

2011 JMLR JMLR 2011

Unsupervised Supervised Learning II: Margin-Based Classification Without Labels

Abstract

Many popular linear classifiers, such as logistic regression, boosting, or SVM, are trained by optimizing a margin-based risk function. Traditionally, these risk functions are computed based on a labeled data set. We develop a novel technique for estimating such risks using only unlabeled data and the marginal label distribution. We prove that the proposed risk estimator is consistent on high-dimensional data sets and demonstrate it on synthetic and real-world data. In particular, we show how the estimate is used for evaluating classifiers in transfer learning, and for training classifiers with no labeled data whatsoever. [abs] [ pdf ][ bib ] © JMLR 2011. (edit, beta)

🧭 Keyword Pioneer — risk estimation

🐣 Hot Topic Early Bird — transfer learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Krishnakumar Balasubramanian , Pinar Donmez , Guy Lebanon

Topics

Machine Learning > Learning Types > Weakly Supervised Learning

Keywords

transfer learning margin-based classification risk estimation marginal distribution high-dimensional datum unlabeled datum

Download PDF

Related papers

MSVMpack: A Multi-Class Support Vector Machine Package 2011

Multitask Sparsity via Maximum Entropy Discrimination 2011

Training SVMs Without Offset 2011

Logistic Stick-Breaking Process 2011

Learning Multi-modal Similarity 2011