Convergence Rates for Gaussian Mixtures of Experts

nhật Hồ; Chiao-Yu Yang; Michael I. Jordan

2022 JMLR JMLR 2022

Convergence Rates for Gaussian Mixtures of Experts

Abstract

We provide a theoretical treatment of over-specified Gaussian mixtures of experts with covariate-free gating networks. We establish the convergence rates of the maximum likelihood estimation (MLE) for these models. Our proof technique is based on a novel notion of algebraic independence of the expert functions. Drawing on optimal transport, we establish a connection between the algebraic independence of the expert functions and a certain class of partial differential equations (PDEs) with respect to the parameters. Exploiting this connection allows us to derive convergence rates for parameter estimation. [abs] [ pdf ][ bib ] © JMLR 2022. (edit, beta)

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐣 Hot Topic Early Bird — mixture of expert

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🧭 Keyword Pioneer — gaussian mixture of expert

Authors

nhật Hồ , Chiao-Yu Yang , Michael I. Jordan

Topics

Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Statistical Learning Mathematics & Optimization > Mathematics > Statistics Machine Learning > Core Methods > Probabilistic Modeling Machine Learning > Optimization & Theory > Statistics Mathematics & Optimization > Probability > Stochastic Processes

Keywords

optimal transport maximum likelihood maximum likelihood estimation convergence rate partial differential equation mixture of expert gaussian mixture gaussian mixture of expert

Download PDF

Related papers

Prior Adaptive Semi-supervised Learning with Application to EHR Phenotyping 2022

LinCDE: Conditional Density Estimation via Lindsey's Method 2022

Causal Classification: Treatment Effect Estimation vs. Outcome Prediction 2022

Provable Tensor-Train Format Tensor Completion by Riemannian Optimization 2022

Power Iteration for Tensor PCA 2022