Anytime Exploitation of Stragglers in Synchronous Stochastic Gradient Descent
Autor: | Stark C. Draper, Benjamin Gharachorloo, Nuwan S. Ferdinand |
---|---|
Rok vydání: | 2017 |
Předmět: |
Stochastic gradient descent
Computer science Robustness (computer science) Linear regression 0202 electrical engineering electronic engineering information engineering 020206 networking & telecommunications 02 engineering and technology 010501 environmental sciences 01 natural sciences Algorithm 0105 earth and related environmental sciences |
Zdroj: | ICMLA |
DOI: | 10.1109/icmla.2017.0-166 |
Popis: | In this paper we propose an approach to parallelizing synchronous stochastic gradient descent (SGD) that we term “Anytime-Gradients”. The Anytime-Gradients is designed to exploit the work completed by slow compute nodes or “stragglers”. In many approaches work completed by these nodes, while only partial, is discarded completely. To maintain synchronization in our approach, each computational epoch is of fixed duration, and at the end of each epoch, workers send updated parameter vectors to a master mode for combination. The master weights each update by the amount of work done. The Anytime-Gradients scheme is robust to both persistent and non-persistent stragglers and requires no prior knowledge about processor abilities. We show that the scheme effectively exploits stragglers and outperforms existing methods. |
Databáze: | OpenAIRE |
Externí odkaz: |