Cluster-driven Graph Federated Learning over Multiple Domains
Autor: | Debora Caldarola, Massimiliano Mancini, Fabio Galasso, Barbara Caputo, Emanuele Rodolà, Marco Ciccone |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Theoretical computer science Computer science business.industry Computer Vision and Pattern Recognition (cs.CV) Knowledge engineering Federated learning Computer Science - Computer Vision and Pattern Recognition Machine learning Clustering Graph Convolutional Networks Data modeling Domain (software engineering) Machine Learning (cs.LG) Machine Learning Study heterogeneity Computer Science - Learning Cluster (physics) Graph (abstract data type) Artificial intelligence business Cluster analysis |
Zdroj: | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) CVPR Workshops |
Popis: | Federated Learning (FL) deals with learning a central model (i.e. the server) in privacy-constrained scenarios, where data are stored on multiple devices (i.e. the clients). The central model has no direct access to the data, but only to the updates of the parameters computed locally by each client. This raises a problem, known as statistical heterogeneity, because the clients may have different data distributions (i.e. domains). This is only partly alleviated by clustering the clients. Clustering may reduce heterogeneity by identifying the domains, but it deprives each cluster model of the data and supervision of others. Here we propose a novel Cluster-driven Graph Federated Learning (FedCG). In FedCG, clustering serves to address statistical heterogeneity, while Graph Convolutional Networks (GCNs) enable sharing knowledge across them. FedCG: i) identifies the domains via an FL-compliant clustering and instantiates domain-specific modules (residual branches) for each domain; ii) connects the domain-specific modules through a GCN at training to learn the interactions among domains and share knowledge; and iii) learns to cluster unsupervised via teacher-student classifier-training iterations and to address novel unseen test domains via their domain soft-assignment scores. Thanks to the unique interplay of GCN over clusters, FedCG achieves the state-of-the-art on multiple FL benchmarks. Comment: Accepted to CVPR21 Workshop Learning from Limited or Imperfect Data (L^2ID) |
Databáze: | OpenAIRE |
Externí odkaz: |