Communication Efficient Distributed Machine Learning with the Parameter Server 论文

2014Neural Information Processing Systems引用 438
Stochastic Gradient Optimization TechniquesNeural Networks and ApplicationsSparse and Compressive Sensing Techniques

摘要

This paper describes a third-generation parameter server framework for distributed machine learning. This framework offers two relaxations to balance system performance and algorithm efficiency. We propose a new algorithm that takes advantage of this framework to solve non-convex non-smooth problems with convergence guarantees. We present an in-depth analysis of two large scale machine learning problems ranging from l1 -regularized logistic regression on CPUs to reconstruction ICA on GPUs, using 636TB of real data with hundreds of billions of samples and dimensions. We demonstrate using these examples that the parameter server framework is an effective and straightforward way to scale machine learning to larger problems and systems than have been previously achieved.