000 01443 a2200217 4500
003 OSt
005 20250910123601.0
008 250910b |||||||| |||| 00| 0 eng d
020 _a9783031190667
082 _a006.31
_bJ837o
100 _aJoshi, Gauri
245 _aOptimization algorithms for distributed machine learning
_cGauri Joshi
260 _bSpringer
_c2023
_aSwitzerland
300 _axiii, 127p
440 _aSynthesis lectures on learning, networks, and algorithms
490 _a/ edited by Lei Ying
520 _aThis book discusses state-of-the-art stochastic optimization algorithms for distributed machine learning and analyzes their convergence speed. The book first introduces stochastic gradient descent (SGD) and its distributed version, synchronous SGD, where the task of computing gradients is divided across several worker nodes. The author discusses several algorithms that improve the scalability and communication efficiency of synchronous SGD, such as asynchronous SGD, local-update SGD, quantized and sparsified SGD, and decentralized SGD. For each of these algorithms, the book analyzes its error versus iterations convergence, and the runtime spent per iteration. The author shows that each of these strategies to reduce communication or synchronization delays encounters a fundamental trade-off between error and runtime.
650 _aComputer algorithms
650 _aMachine learning
942 _cREF
999 _c567635
_d567635