Haiyan Cui, Junping Du, Yang Jiang, Yue Wang, Runyu Yu
{"title":"基于知识蒸馏和深度梯度压缩的联邦学习方法","authors":"Haiyan Cui, Junping Du, Yang Jiang, Yue Wang, Runyu Yu","doi":"10.1109/CCIS53392.2021.9754651","DOIUrl":null,"url":null,"abstract":"Federated learning is a new type of multi-agency collaborative training model paradigm, which is widely used in many fields, among which communication overhead is a key issue. In order to reduce the amount of data transmitted in the communication process, we propose a federated learning algorithm based on knowledge distillation and deep gradient compression (Fed-KDDGC-SGD). First, we use local data on the client to train the teacher network, and then use the soft labels generated by the teacher network to train the student network and upload the gradient to the central server during the training process. In order to further reduce the communication bandwidth occupied by sending the gradient, the deep gradient compression algorithm is used to compress the gradient vector, and only the gradient value of the top R% of the absolute value is sent. The experimental results show that the improved federated learning algorithm effectively reduces the communication overhead and has certain practical significance.","PeriodicalId":191226,"journal":{"name":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"74 1-2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Federated Learning Method Based on Knowledge Distillation and Deep Gradient Compression\",\"authors\":\"Haiyan Cui, Junping Du, Yang Jiang, Yue Wang, Runyu Yu\",\"doi\":\"10.1109/CCIS53392.2021.9754651\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated learning is a new type of multi-agency collaborative training model paradigm, which is widely used in many fields, among which communication overhead is a key issue. In order to reduce the amount of data transmitted in the communication process, we propose a federated learning algorithm based on knowledge distillation and deep gradient compression (Fed-KDDGC-SGD). First, we use local data on the client to train the teacher network, and then use the soft labels generated by the teacher network to train the student network and upload the gradient to the central server during the training process. In order to further reduce the communication bandwidth occupied by sending the gradient, the deep gradient compression algorithm is used to compress the gradient vector, and only the gradient value of the top R% of the absolute value is sent. The experimental results show that the improved federated learning algorithm effectively reduces the communication overhead and has certain practical significance.\",\"PeriodicalId\":191226,\"journal\":{\"name\":\"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"volume\":\"74 1-2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCIS53392.2021.9754651\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS53392.2021.9754651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Federated Learning Method Based on Knowledge Distillation and Deep Gradient Compression
Federated learning is a new type of multi-agency collaborative training model paradigm, which is widely used in many fields, among which communication overhead is a key issue. In order to reduce the amount of data transmitted in the communication process, we propose a federated learning algorithm based on knowledge distillation and deep gradient compression (Fed-KDDGC-SGD). First, we use local data on the client to train the teacher network, and then use the soft labels generated by the teacher network to train the student network and upload the gradient to the central server during the training process. In order to further reduce the communication bandwidth occupied by sending the gradient, the deep gradient compression algorithm is used to compress the gradient vector, and only the gradient value of the top R% of the absolute value is sent. The experimental results show that the improved federated learning algorithm effectively reduces the communication overhead and has certain practical significance.