TY - GEN AU - Joshi, Gauri TI - Optimization algorithms for distributed machine learning T2 - / edited by Lei Ying SN - 9783031190667 U1 - 006.31 PY - 2023/// CY - Switzerland PB - Springer KW - Computer algorithms KW - Machine learning N2 - This book discusses state-of-the-art stochastic optimization algorithms for distributed machine learning and analyzes their convergence speed. The book first introduces stochastic gradient descent (SGD) and its distributed version, synchronous SGD, where the task of computing gradients is divided across several worker nodes. The author discusses several algorithms that improve the scalability and communication efficiency of synchronous SGD, such as asynchronous SGD, local-update SGD, quantized and sparsified SGD, and decentralized SGD. For each of these algorithms, the book analyzes its error versus iterations convergence, and the runtime spent per iteration. The author shows that each of these strategies to reduce communication or synchronization delays encounters a fundamental trade-off between error and runtime ER -