Abstract
Federated clustering, an essential extension of centralized clustering forfederated scenarios, enables multiple data-holding clients to collaborativelygroup data while keeping their data locally. In centralized scenarios,clustering driven by representation learning has made significant advancementsin handling high-dimensional complex data. However, the combination offederated clustering and representation learning remains underexplored. Tobridge this, we first tailor a cluster-contrastive model for learningclustering-friendly representations. Then, we harness this model as thefoundation for proposing a new federated clustering method, namedcluster-contrastive federated clustering (CCFC). Benefiting from representationlearning, the clustering performance of CCFC even double those of the bestbaseline methods in some cases. Compared to the most related baseline, thebenefit results in substantial NMI score improvements of up to 0.4155 on themost conspicuous case. Moreover, CCFC also shows superior performance inhandling device failures from a practical viewpoint.