Butterfly Mixing: Accelerating Incremental-Update Algorithms on Clusters

Huasha Zhao; John F Canny

doi:10.1137/1.9781611972832.87

Abstract

1 min read

Incremental model-update strategies are widely used in machine learning and data mining.By "incremental update" we refer to models that are updated many times using small subsets of the training data.Two wellknown examples are stochastic gradient and MCMC.Both provide fast sequential performance and have generated many of the best-performing methods for particular problems (logistic regression, SVM, LDA etc.).But these methods are difficult to adapt to parallel or cluster settings because of the overhead of distributing model updates through the network.Updates can be locally batched to reduce communication overhead, but convergence typically suffers as the batch size increases.In this paper we introduce and analyze butterfly mixing, an approach which interleaves communication with computation.We evaluate butterfly mixing on stochastic gradient algorithms for logistic regression and SVM, on two datasets.Results show that butterfly mix steps are fast and failure-tolerant, and overall we achieved a 3.3x speedup over full mix (AllReduce) on an Amazon EC2 cluster.

Butterfly Mixing: Accelerating Incremental-Update Algorithms on Clusters

Abstract

Discussion(0)

Related publications

Big data analytics with small footprint

Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling

An Incremental Learning Framework for Human-Like Redundancy Optimization of Anthropomorphic Manipulators

KNNENS: A <i>k</i>-Nearest Neighbor Ensemble-Based Method for Incremental Learning Under Data Stream With Emerging New Classes

RFID Security Protocol Based on Synchronous Update of Random Number

Related publications

Article2013
Big data analytics with small footprint
Article2013

Preprint2024
Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling
Preprint2024

Article2020
An Incremental Learning Framework for Human-Like Redundancy Optimization of Anthropomorphic Manipulators
Article2020

Article2022
KNNENS: A <i>k</i>-Nearest Neighbor Ensemble-Based Method for Incremental Learning Under Data Stream With Emerging New Classes
Article2022

Article2013
RFID Security Protocol Based on Synchronous Update of Random Number
Article2013