A Finite Sample Analysis of the Naive Bayes Classifier

Daniel Berend; Aryeh Kontorovich

2015 JMLR JMLR 2015

A Finite Sample Analysis of the Naive Bayes Classifier

Abstract

We revisit, from a statistical learning perspective, the classical decision-theoretic problem of weighted expert voting. In particular, we examine the consistency (both asymptotic and finitary) of the optimal Naive Bayes weighted majority and related rules. In the case of known expert competence levels, we give sharp error estimates for the optimal rule. We derive optimality results for our estimates and also establish some structural characterizations. When the competence levels are unknown, they must be empirically estimated. We provide frequentist and Bayesian analyses for this situation. Some of our proof techniques are non-standard and may be of independent interest. Several challenging open problems are posed, and experimental results are provided to illustrate the theory. [abs] [ pdf ][ bib ] © JMLR 2015. (edit, beta)

🧭 Keyword Pioneer — weighted voting

🐣 Hot Topic Early Bird — statistical learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy

Authors

Daniel Berend , Aryeh Kontorovich

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Statistical Learning

Keywords

statistical learning finite sample analysis naive baye weighted voting naive bayes classifier weighted expert voting

Download PDF

Related papers

The Sample Complexity of Learning Linear Predictors with the Squared Loss 2015

Preface to this Special Issue 2015

Fast Cross-Validation via Sequential Testing 2015

Online Tensor Methods for Learning Latent Variable Models 2015

CEKA: A Tool for Mining the Wisdom of Crowds 2015