Communication-efficient Distributed Sparse Linear Discriminant Analysis

Lu Tian; Quanquan Gu

2017 AISTATS AISTATS 2017

Communication-efficient Distributed Sparse Linear Discriminant Analysis

Abstract

We propose a communication-efficient distributed estimation method for sparse linear discriminant analysis (LDA) in the high dimensional regime. Our method distributes the data of size N into m machines, and estimates a local sparse LDA estimator on each machine using the data subset of size N/m. After the distributed estimation, our method aggregates the debiased local estimators from m machines, and sparsifies the aggregated estimator. We show that the aggregated estimator attains the same statistical rate as the centralized estimation method, as long as the number of machines m is chosen appropriately. Moreover, we prove that our method can attain the model selection consistency under a milder condition than the centralized method. Experiments on both synthetic and real datasets corroborate our theory.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lu Tian , Quanquan Gu

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Distributed Learning

Keywords

distributed learning sparse estimation high-dimensional statistics linear discriminant analysis model selection consistency

Download PDF

Related papers

Conditions beyond treewidth for tightness of higher-order LP relaxations 2017

Non-square matrix sensing without spurious local minima via the Burer-Monteiro approach 2017

Tensor-Dictionary Learning with Deep Kruskal-Factor Analysis 2017

A Sub-Quadratic Exact Medoid Algorithm 2017

Performance Bounds for Graphical Record Linkage 2017