Abstract
Federated learning (FL) is one of the popular distributed machine learning(ML) solutions but incurs significant communication and computation costs atedge devices. Federated split learning (FSL) can train sub-models in paralleland reduce the computational burden of edge devices by splitting the modelarchitecture. However, it still requires a high communication overhead due totransmitting the smashed data and gradients between clients and the server inevery global round. Furthermore, the server must maintain separate partialmodels for every client, leading to a significant storage requirement. Toaddress these challenges, this paper proposes a novel communication and storageefficient federated split learning method, termed CSE-FSL, which utilizes anauxiliary network to locally update the weights of the clients while keeping asingle model at the server, hence avoiding frequent transmissions of gradientsfrom the server and greatly reducing the storage requirement of the server.Additionally, a new model update method of transmitting the smashed data inselected epochs can reduce the amount of smashed data sent from the clients. Weprovide a theoretical analysis of CSE-FSL, rigorously guaranteeing itsconvergence under non-convex loss functions. The extensive experimental resultsfurther indicate that CSE-FSL achieves a significant communication reductionover existing FSL solutions using real-world FL tasks.