2007
NIPS
NeurIPS 2007
Parallelizing Support Vector Machines on Distributed Computers
Abstract
Support Vector Machines (SVMs) suffer from a widely recognized scalability problem in both memory use and computational time. To improve scalability, we have developed a parallel SVM algorithm (PSVM), which reduces memory use through performing a row-based, approximate matrix factorization, and which loads only essential data to each machine to perform parallel computation. Let $n$ denote the number of training instances, $p$ the reduced matrix dimension after factorization ($p$ is significantly smaller than $n$), and $m$ the number of machines. PSVM reduces the memory requirement from $\MO$($n^2$) to $\MO$($np/m$), and improves computation time to $\MO$($np^2/m$). Empirical studies on up to $500$ computers shows PSVM to be effective.
🌉
Interdisciplinary Bridge
— Machine Learning and Mathematics & Optimization
📈
Trend Setter
— Distributed Learning
🧭
Keyword Pioneer
— scalability
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio
🌱
Topic Pioneer
— Distributed Optimization
🐣
Hot Topic Early Bird
— matrix factorization
Authors
Topics
Machine Learning > Core Methods > Classification
Machine Learning > Optimization & Theory > Distributed Learning
Machine Learning > Optimization & Theory > Optimization
Mathematics & Optimization > Optimization > Continuous Optimization
Computer Science > Systems > Distributed Systems
Mathematics & Optimization > Optimization > Distributed Learning
Machine Learning > Core Methods > Kernel Methods
Machine Learning > Core Methods > Support Vector Machine
Machine Learning > Learning Types > Distributed Learning
Mathematics & Optimization > Optimization > Distributed Optimization