TY - GEN
T1 - Communication-Efficient Federated Learning via Sparse Training with Regularized Error Correction
AU - Greidi, Ran
AU - Cohen, Kobi
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Federated Learning (FL) is an emerging paradigm that allows for decentralized machine learning (ML), where multiple models are collaboratively trained in a privacy-preserving manner. However, since communications and computation resources are limited, training DNN models in FL systems face challenges such as elevated computational and communication costs in complex tasks. Sparse training schemes gain increasing attention in order to scale down the dimensionality of each client (i.e., node) transmission. Specifically, sparsification with error correction methods is a promising technique, where only important updates are sent to the parameter server (PS) and the rest are accumulated locally. While error correction methods have shown to achieve a significant sparsification level of the client-to-PS message without harming convergence, pushing sparsity further remains unresolved due to the staleness effect. In this paper, we propose a novel algorithm, dubbed Federated Learning with Accumulated Regularized Embeddings (FLARE), to overcome this challenge. FLARE presents a novel sparse training approach via accumulated pulling of the updated models with regularization on the embeddings in the FL process, providing a powerful solution to the staleness effect, and pushing sparsity to an exceptional level. Our theoretical analysis demonstrates that FLARE's regularized error feedback achieves significant improvements in scalability with sparsity parameter. The performance of FLARE is validated through experiments on diverse and complex models, achieving a remarkable sparsity level (10 times and more beyond the current state-of-the-art) along with significantly improved accuracy.
AB - Federated Learning (FL) is an emerging paradigm that allows for decentralized machine learning (ML), where multiple models are collaboratively trained in a privacy-preserving manner. However, since communications and computation resources are limited, training DNN models in FL systems face challenges such as elevated computational and communication costs in complex tasks. Sparse training schemes gain increasing attention in order to scale down the dimensionality of each client (i.e., node) transmission. Specifically, sparsification with error correction methods is a promising technique, where only important updates are sent to the parameter server (PS) and the rest are accumulated locally. While error correction methods have shown to achieve a significant sparsification level of the client-to-PS message without harming convergence, pushing sparsity further remains unresolved due to the staleness effect. In this paper, we propose a novel algorithm, dubbed Federated Learning with Accumulated Regularized Embeddings (FLARE), to overcome this challenge. FLARE presents a novel sparse training approach via accumulated pulling of the updated models with regularization on the embeddings in the FL process, providing a powerful solution to the staleness effect, and pushing sparsity to an exceptional level. Our theoretical analysis demonstrates that FLARE's regularized error feedback achieves significant improvements in scalability with sparsity parameter. The performance of FLARE is validated through experiments on diverse and complex models, achieving a remarkable sparsity level (10 times and more beyond the current state-of-the-art) along with significantly improved accuracy.
KW - communication-efficiency
KW - Deep learning
KW - deep neural network (DNN)
KW - federated learning (FL)
KW - sparse training
UR - http://www.scopus.com/inward/record.url?scp=85211149965&partnerID=8YFLogxK
U2 - 10.1109/Allerton63246.2024.10735294
DO - 10.1109/Allerton63246.2024.10735294
M3 - Conference contribution
AN - SCOPUS:85211149965
T3 - 2024 60th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2024
BT - 2024 60th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2024
PB - Institute of Electrical and Electronics Engineers
T2 - 60th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2024
Y2 - 24 September 2024 through 27 September 2024
ER -