pith. sign in

arxiv: 2407.02610 · v2 · pith:OFVGI5TXnew · submitted 2024-07-02 · 💻 cs.LG · cs.DC

Towards Federated Learning with On-device Training and Communication in 8-bit Floating Point

classification 💻 cs.LG cs.DC
keywords trainingcommunicationfp32learningcomparedfederatedfloatingmethod
0
0 comments X
read the original abstract

Recent work has shown that 8-bit floating point (FP8) can be used for efficiently training neural networks with reduced computational cost compared to training in FP32/FP16. In this work, we investigate the use of FP8 training in a federated learning context. This approach brings not only the usual benefits of FP8 which are desirable for on-device training at the edge, but also reduces client-server communication costs due to significant weight compression. We present a novel method for combining FP8 client training while maintaining a global FP32 server model and provide convergence analysis. Experiments with various machine learning models and datasets show that our method consistently yields communication reductions of at least 2.9x across a variety of tasks and models compared to an FP32 baseline to achieve the same trained model accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.