Convergence Rate of Stochastic k-means

Cheng Tang; Claire Monteleoni

2017 AISTATS AISTATS 2017

Convergence Rate of Stochastic k-means

Abstract

We analyze online (Bottou & Bengio, 1994) and mini-batch (Sculley, 2010) k-means variants. Both scale up the widely used Lloyd’s algorithm via stochastic approximation, and have become popular for large-scale clustering and unsupervised feature learning. We show, for the first time, that they have global convergence towards “local optima” at rate $O(1/t)$ under general conditions. In addition, we show that if the dataset is clusterable, stochastic k-means with suitable initialization converges to an optimal k-means solution at rate $O(1/t)$ with high probability. The k-means objective is non-convex and non-differentiable; we exploit ideas from non-convex gradient-based optimization by providing a novel characterization of the trajectory of the k-means algorithm on its solution space, and circumvent its non-differentiability via geometric insights about the k-means update.

🧭 Keyword Pioneer — stochastic k-mean

🐣 Hot Topic Early Bird — non-convex optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Reinforcement Learning

Authors

Cheng Tang , Claire Monteleoni

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Optimization & Theory > Stochastic Processes

Keywords

non-convex optimization online clustering global convergence unsupervised feature learning convergence rate lloyd's algorithm stochastic k-mean mini-batch clustering k-means algorithm

Download PDF

Related papers

Conditions beyond treewidth for tightness of higher-order LP relaxations 2017

Non-square matrix sensing without spurious local minima via the Burer-Monteiro approach 2017

Tensor-Dictionary Learning with Deep Kruskal-Factor Analysis 2017

A Sub-Quadratic Exact Medoid Algorithm 2017

Performance Bounds for Graphical Record Linkage 2017