Do Outliers Ruin Collaboration?

Mingda Qiao

2018 ICML ICML 2018

Do Outliers Ruin Collaboration?

Abstract

We consider the problem of learning a binary classifier from $n$ different data sources, among which at most an $\eta$ fraction are adversarial. The overhead is defined as the ratio between the sample complexity of learning in this setting and that of learning the same hypothesis class on a single data distribution. We present an algorithm that achieves an $O(\eta n + \ln n)$ overhead, which is proved to be worst-case optimal. We also discuss the potential challenges to the design of a computationally efficient learning algorithm with a small overhead.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

📈 Trend Setter — Robust Learning

🐣 Hot Topic Early Bird — federated learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mingda Qiao

Topics

Artificial Intelligence > Learning Paradigms > Federated Learning Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Learning Types > Robust Learning Machine Learning > Learning Types > Multi-Source Learning

Keywords

federated learning sample complexity distributed learning collaborative learning robust learning binary classifier adversarial noise adversarial outlier

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018