A Linear-Time Kernel Goodness-of-Fit Test

Wittawat Jitkrittum; Wenkai Xu; Zoltan Szabo; Kenji Fukumizu; Arthur Gretton

2017 NIPS NeurIPS 2017

A Linear-Time Kernel Goodness-of-Fit Test

Abstract

We propose a novel adaptive test of goodness-of-fit, with computational cost linear in the number of samples. We learn the test features that best indicate the differences between observed samples and a reference model, by minimizing the false negative rate. These features are constructed via Stein's method, meaning that it is not necessary to compute the normalising constant of the model. We analyse the asymptotic Bahadur efficiency of the new test, and prove that under a mean-shift alternative, our test always has greater relative efficiency than a previous linear-time kernel test, regardless of the choice of parameters for that test. In experiments, the performance of our method exceeds that of the earlier linear-time test, and matches or exceeds the power of a quadratic-time kernel test. In high dimensions and where model structure may be exploited, our goodness of fit test performs far better than a quadratic-time two-sample test based on the Maximum Mean Discrepancy, with samples drawn from the model.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning and Mathematics & Optimization

📈 Trend Setter — Statistics

🧭 Keyword Pioneer — bahadur efficiency

🐣 Hot Topic Early Bird — maximum mean discrepancy

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Wittawat Jitkrittum , Wenkai Xu , Zoltan Szabo , Kenji Fukumizu , Arthur Gretton

Topics

Machine Learning > Optimization & Theory > Statistical Learning Data Science & Analytics > Methods > Data Mining Mathematics & Optimization > Mathematics > Statistics Machine Learning > Optimization & Theory > Statistics Machine Learning > Core Methods > Kernel Methods Mathematics & Optimization > Statistics > Statistics

Keywords

statistical testing two-sample test maximum mean discrepancy statistical test goodness-of-fit test goodness of fit stein method kernel methods stein's method bahadur efficiency

Download PDF

Related papers

High-Order Attention Models for Visual Question Answering 2017

Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization 2017

Premise Selection for Theorem Proving by Deep Graph Embedding 2017

Neural Program Meta-Induction 2017

Safe and Nested Subgame Solving for Imperfect-Information Games 2017