Reducing Label Complexity by Learning From Bags

Sivan Sabato; Nathan Srebro; Naftali Tishby

2010 AISTATS AISTATS 2010

Reducing Label Complexity by Learning From Bags

Abstract

We consider a supervised learning setting in which the main cost of learning is the number of training labels and one can obtain a single label for a bag of examples, indicating only if a positive example exists in the bag, as in Multi-Instance Learning. We thus propose to create a training sample of bags, and to use the obtained labels to learn to classify individual examples. We provide a theoretical analysis showing how to select the bag size as a function of the problem parameters, and prove that if the original labels are distributed unevenly, the number of required labels drops considerably when learning from bags. We demonstrate that finding a low-error separating hyperplane from bags is feasible in this setting using a simple iterative procedure similar to latent SVM. Experiments on synthetic and real data sets demonstrate the success of the approach.

🚀 Conference Pioneer — AISTATS 2010

🧭 Keyword Pioneer — bag classification

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

📈 Trend Setter — Multi-Instance Learning

🐣 Hot Topic Early Bird — weakly supervised learning

Authors

Sivan Sabato , Nathan Srebro , Naftali Tishby

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Weakly Supervised Learning Machine Learning > Learning Types > Multi-Instance Learning

Keywords

multi-instance learning weakly supervised learning label complexity latent variable model bag classification separating hyperplane hyperplane learning

Download PDF

Related papers

Towards Understanding Situated Natural Language 2010

Mass Fatality Incident Identification based on nuclear DNA evidence 2010

Locally Linear Denoising on Image Manifolds 2010

Negative Results for Active Learning with Convex Losses 2010

Collaborative Filtering on a Budget 2010