为异构客户提供联合的自我监督学习

论文标题

为异构客户提供联合的自我监督学习

Federated Self-supervised Learning for Heterogeneous Clients

论文作者

Makhija, Disha, Ho, Nhat, Ghosh, Joydeep

论文摘要

由于其隐私和计算益处，联合学习已成为重要的学习范式。随着领域的发展，仍然有两个关键的挑战仍有待解决的问题是：（1）系统异质性 - 每个客户端中存在的计算和/或数据资源的可变性，以及（2）在某些联合设置中缺乏标记的数据。最近的一些发展试图独立克服这些挑战。在这项工作中，我们提出了一个统一的，系统的框架，即\ emph {异质自我监督联盟学习}（Hetero-SSFL），以实现与异质客户联邦的自我监督学习。所提出的框架允许在所有客户端进行协作表示学习，而无需施加架构约束或需要标记的数据。 Hetero-SSFL中的关键思想是让每个客户培训其独特的自我监督模型，并通过将公共数据集中的较低维表示来使客户进行联合学习。整个培训程序都可以视为自我，并被视为当地培训和对齐程序，不需要任何标记的数据。与常规的自我监督学习一样，获得的客户模型是独立的，可用于各种终端任务。我们提供了在异质环境中非凸目标的拟议框架的融合保证，并从经验上证明，我们所提出的方法的表现优于最大的方法。

Federated Learning has become an important learning paradigm due to its privacy and computational benefits. As the field advances, two key challenges that still remain to be addressed are: (1) system heterogeneity - variability in the compute and/or data resources present on each client, and (2) lack of labeled data in certain federated settings. Several recent developments have tried to overcome these challenges independently. In this work, we propose a unified and systematic framework, \emph{Heterogeneous Self-supervised Federated Learning} (Hetero-SSFL) for enabling self-supervised learning with federation on heterogeneous clients. The proposed framework allows collaborative representation learning across all the clients without imposing architectural constraints or requiring presence of labeled data. The key idea in Hetero-SSFL is to let each client train its unique self-supervised model and enable the joint learning across clients by aligning the lower dimensional representations on a common dataset. The entire training procedure could be viewed as self and peer-supervised as both the local training and the alignment procedures do not require presence of any labeled data. As in conventional self-supervised learning, the obtained client models are task independent and can be used for varied end-tasks. We provide a convergence guarantee of the proposed framework for non-convex objectives in heterogeneous settings and also empirically demonstrate that our proposed approach outperforms the state of the art methods by a significant margin.

下载PDF全文

下载文献需遵守相关版权规定

论文标题