Efficient-Adam: Communication-Efficient Distributed Adam

Chen, Congliang; Shen, Li; Liu, Wei; Luo, Zhi-Quan

Computer Science > Machine Learning

arXiv:2205.14473 (cs)

[Submitted on 28 May 2022 (v1), last revised 24 Aug 2023 (this version, v2)]

Title:Efficient-Adam: Communication-Efficient Distributed Adam

Authors:Congliang Chen, Li Shen, Wei Liu, Zhi-Quan Luo

View PDF

Abstract:Distributed adaptive stochastic gradient methods have been widely used for large-scale nonconvex optimization, such as training deep learning models. However, their communication complexity on finding $\varepsilon$-stationary points has rarely been analyzed in the nonconvex setting. In this work, we present a novel communication-efficient distributed Adam in the parameter-server model for stochastic nonconvex optimization, dubbed {\em Efficient-Adam}. Specifically, we incorporate a two-way quantization scheme into Efficient-Adam to reduce the communication cost between the workers and server. Simultaneously, we adopt a two-way error feedback strategy to reduce the biases caused by the two-way quantization on both the server and workers, respectively. In addition, we establish the iteration complexity for the proposed Efficient-Adam with a class of quantization operators, and further characterize its communication complexity between the server and workers when an $\varepsilon$-stationary point is achieved. Finally, we apply Efficient-Adam to solve a toy stochastic convex optimization problem and train deep learning models on real-world vision and language tasks. Extensive experiments together with a theoretical guarantee justify the merits of Efficient Adam.

Comments:	IEEE Transactions on Signal Processing
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC)
Cite as:	arXiv:2205.14473 [cs.LG]
	(or arXiv:2205.14473v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.14473

Submission history

From: Li Shen [view email]
[v1] Sat, 28 May 2022 16:17:52 UTC (1,469 KB)
[v2] Thu, 24 Aug 2023 09:44:29 UTC (4,203 KB)

Computer Science > Machine Learning

Title:Efficient-Adam: Communication-Efficient Distributed Adam

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient-Adam: Communication-Efficient Distributed Adam

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators