Clustering Oligarchies

Margareta Ackerman; Shai Ben-David; David Loker; Sivan Sabato

2013 AISTATS AISTATS 2013

Clustering Oligarchies

Abstract

We investigate the extent to which clustering algorithms are robust to the addition of a small, potentially adversarial, set of points. Our analysis reveals radical differences in the robustness of popular clustering methods. k-means and several related techniques are robust when data is clusterable, and we provide a quantitative analysis capturing the precise relationship between clusterability and robustness. In contrast, common linkage-based algorithms and several standard objective-function-based clustering methods can be highly sensitive to the addition of a small set of points even when the data is highly clusterable. We call such sets of points oligarchies. Lastly, we show that the behavior with respect to oligarchies of the popular Lloyd’s method changes radically with the initialization technique.

📈 Trend Setter — Fairness

🧭 Keyword Pioneer — linkage-based algorithm

🐣 Hot Topic Early Bird — adversarial robustness

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Margareta Ackerman , Shai Ben-David , David Loker , Sivan Sabato

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Optimization & Theory > Theory Machine Learning > Application Areas > Fairness

Keywords

adversarial robustness k-means clustering theoretical analysis clustering algorithm linkage-based algorithm linkage-based clustering

Download PDF

Related papers

Consensus Ranking with Signed Permutations 2013

Ultrahigh Dimensional Feature Screening via RKHS Embeddings 2013

Collapsed Variational Bayesian Inference for Hidden Markov Models 2013

Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes 2013

Evidence Estimation for Bayesian Partially Observed MRFs 2013