A Potential-based Framework for Online Multi-class Learning with Partial Feedback

Shijun Wang; Rong Jin; Hamed Valizadegan

2010 AISTATS AISTATS 2010

A Potential-based Framework for Online Multi-class Learning with Partial Feedback

Abstract

We study the problem of online multi-class learning with partial feedback: in each trial of online learning, instead of providing the true class label for a given instance, the oracle will only reveal to the learner if the predicted class label is correct. We present a general framework for online multi-class learning with partial feedback that adapts the potential-based gradient descent approaches (Cesa-Bianchi & Lugosi, 2006). The generality of the proposed framework is verified by the fact that Banditron (Kakade et al., 2008) is indeed a special case of our work if the potential function is set to be the squared $L_2$ norm of the weight vector. We propose an exponential gradient algorithm for online multi-class learning with partial feedback. Compared to the Banditron algorithm, the exponential gradient algorithm is advantageous in that its mistake bound is independent from the dimension of data, making it suitable for classifying high dimensional data. Our empirical study with four data sets show that the proposed algorithm for online learning with partial feedback is more effective than the Banditron algorithm.

🚀 Conference Pioneer — AISTATS 2010

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — partial feedback

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

📈 Trend Setter — Multi-Armed Bandits

🐣 Hot Topic Early Bird — multi-class classification

Authors

Shijun Wang , Rong Jin , Hamed Valizadegan

Topics

Machine Learning > Core Methods > Classification Mathematics & Optimization > Optimization > Online Algorithms Machine Learning > Learning Types > Online Learning Machine Learning > Learning Types > Multi-Armed Bandits Machine Learning > Learning Types > Multi-Class Classification

Keywords

multi-class classification partial feedback bandit setting mistake bound online multi-class learning potential-based gradient descent potential-based learning exponential gradient

Download PDF

Related papers

Towards Understanding Situated Natural Language 2010

Mass Fatality Incident Identification based on nuclear DNA evidence 2010

Locally Linear Denoising on Image Manifolds 2010

Negative Results for Active Learning with Convex Losses 2010

Collaborative Filtering on a Budget 2010