Co-Training with Insufficient Views

Wei Wang; Zhi-Hua Zhou

2013 ACML ACML 2013

Co-Training with Insufficient Views

Abstract

Co-training is a famous semi-supervised learning paradigm exploiting unlabeled data with two views. Most previous theoretical analyses on co-training are based on the assumption that each of the views is sufficient to correctly predict the label. However, this assumption can hardly be met in real applications due to feature corruption or various feature noise. In this paper, we present the theoretical analysis on co-training when neither view is sufficient. We define the diversity between the two views with respect to the confidence of prediction and prove that if the two views have large diversity, co-training is able to improve the learning performance by exploiting unlabeled data even with insufficient views. We also discuss the relationship between view insufficiency and diversity, and give some implications for understanding of the difference between co-training and co-regularization.

🧭 Keyword Pioneer — prediction confidence

🐣 Hot Topic Early Bird — semi-supervised learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Robotics, Speech & Audio

Authors

Wei Wang , Zhi-Hua Zhou

Topics

Machine Learning > Learning Types > Semi-Supervised Learning

Keywords

semi-supervised learning prediction confidence unlabeled datum feature noise view diversity

Download PDF

Related papers

Multilabel Classification through Random Graph Ensembles 2013

Multi-armed Bandit Problem with Lock-up Periods 2013

Generalized Aitchison Embeddings for Histograms 2013

Aggregating Predictions via Sequential Mini-Trading 2013

Guided Monte Carlo Tree Search for Planning in Learned Environments 2013