Population Matching Discrepancy and Applications in Deep Learning

Jianfei Chen; Chongxuan Li; Yizhong Ru; Jun Zhu

2017 NIPS NeurIPS 2017

Population Matching Discrepancy and Applications in Deep Learning

Abstract

A differentiable estimation of the distance between two distributions based on samples is important for many deep learning tasks. One such estimation is maximum mean discrepancy (MMD). However, MMD suffers from its sensitive kernel bandwidth hyper-parameter, weak gradients, and large mini-batch size when used as a training objective. In this paper, we propose population matching discrepancy (PMD) for estimating the distribution distance based on samples, as well as an algorithm to learn the parameters of the distributions using PMD as an objective. PMD is defined as the minimum weight matching of sample populations from each distribution, and we prove that PMD is a strongly consistent estimator of the first Wasserstein metric. We apply PMD to two deep learning tasks, domain adaptation and generative modeling. Empirical results demonstrate that PMD overcomes the aforementioned drawbacks of MMD, and outperforms MMD on both tasks in terms of the performance as well as the convergence speed.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

📈 Trend Setter — Generative Models

🧭 Keyword Pioneer — distribution distance

🐣 Hot Topic Early Bird — generative modeling

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jianfei Chen , Chongxuan Li , Yizhong Ru , Jun Zhu

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Application Areas > Domain Adaptation Deep Learning > Optimization & Theory > Optimization Machine Learning > Learning Types > Generative Models Deep Learning > Learning Types > Domain Adaptation

Keywords

generative modeling domain adaptation optimal transport distribution matching maximum mean discrepancy generative model wasserstein metric distribution distance population matching discrepancy distribution distance estimation

Download PDF

Related papers

High-Order Attention Models for Visual Question Answering 2017

Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization 2017

Premise Selection for Theorem Proving by Deep Graph Embedding 2017

Neural Program Meta-Induction 2017

Safe and Nested Subgame Solving for Imperfect-Information Games 2017