T. D. Luong, Vuong Minh Tien, Hoang Anh, Ngan Van Luyen, Nguyen Chi Vy, Phan The Duy, V. Pham
{"title":"FedChain:使用区块链和联邦学习构建人工智能模型的协作框架","authors":"T. D. Luong, Vuong Minh Tien, Hoang Anh, Ngan Van Luyen, Nguyen Chi Vy, Phan The Duy, V. Pham","doi":"10.1109/NICS54270.2021.9701450","DOIUrl":null,"url":null,"abstract":"Machine learning (ML) has been drawn to attention from both academia and industry thanks to outstanding advances and its potential in many fields. Nevertheless, data collection for training models is a difficult task since there are many concerns on privacy and data breach reported recently. Data owners or holders are usually hesitant to share their private data. Also, the benefits from analyzing user data are not distributed to users. In addition, due to the lack of incentive mechanism for sharing data, ML builders cannot leverage the massive data from many sources. Thus, this paper introduces a collaborative approach for building artificial intelligence (AI) models, named FedChain to encourage many data owners to cooperate in the training phase without sharing their raw data. It helps data holders ensure privacy preservation for the collaborative training right on their premises, while reducing the computation load in case of centralized training. More specifically, we utilize federated learning (FL)and Hyperledger Sawtooth Blockchain to set up a prototype framework that enables many parties to join, contribute and receive rewards transparently from their training task results. Finally, we conduct experiments of our FedChain on cyber threat intelligence context, where AI model is trained within many organizations on each their private datastore, and then it is used for detecting malicious actions in the network. Experimental results with the CICIDS-2017 dataset prove that the FL-based strategy can help create effective privacy-preserving ML models while taking advantage of diverse data sources from the community.","PeriodicalId":296963,"journal":{"name":"2021 8th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FedChain: A Collaborative Framework for Building Artificial Intelligence Models using Blockchain and Federated Learning\",\"authors\":\"T. D. Luong, Vuong Minh Tien, Hoang Anh, Ngan Van Luyen, Nguyen Chi Vy, Phan The Duy, V. Pham\",\"doi\":\"10.1109/NICS54270.2021.9701450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning (ML) has been drawn to attention from both academia and industry thanks to outstanding advances and its potential in many fields. Nevertheless, data collection for training models is a difficult task since there are many concerns on privacy and data breach reported recently. Data owners or holders are usually hesitant to share their private data. Also, the benefits from analyzing user data are not distributed to users. In addition, due to the lack of incentive mechanism for sharing data, ML builders cannot leverage the massive data from many sources. Thus, this paper introduces a collaborative approach for building artificial intelligence (AI) models, named FedChain to encourage many data owners to cooperate in the training phase without sharing their raw data. It helps data holders ensure privacy preservation for the collaborative training right on their premises, while reducing the computation load in case of centralized training. More specifically, we utilize federated learning (FL)and Hyperledger Sawtooth Blockchain to set up a prototype framework that enables many parties to join, contribute and receive rewards transparently from their training task results. Finally, we conduct experiments of our FedChain on cyber threat intelligence context, where AI model is trained within many organizations on each their private datastore, and then it is used for detecting malicious actions in the network. Experimental results with the CICIDS-2017 dataset prove that the FL-based strategy can help create effective privacy-preserving ML models while taking advantage of diverse data sources from the community.\",\"PeriodicalId\":296963,\"journal\":{\"name\":\"2021 8th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"102 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 8th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS54270.2021.9701450\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS54270.2021.9701450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
FedChain: A Collaborative Framework for Building Artificial Intelligence Models using Blockchain and Federated Learning
Machine learning (ML) has been drawn to attention from both academia and industry thanks to outstanding advances and its potential in many fields. Nevertheless, data collection for training models is a difficult task since there are many concerns on privacy and data breach reported recently. Data owners or holders are usually hesitant to share their private data. Also, the benefits from analyzing user data are not distributed to users. In addition, due to the lack of incentive mechanism for sharing data, ML builders cannot leverage the massive data from many sources. Thus, this paper introduces a collaborative approach for building artificial intelligence (AI) models, named FedChain to encourage many data owners to cooperate in the training phase without sharing their raw data. It helps data holders ensure privacy preservation for the collaborative training right on their premises, while reducing the computation load in case of centralized training. More specifically, we utilize federated learning (FL)and Hyperledger Sawtooth Blockchain to set up a prototype framework that enables many parties to join, contribute and receive rewards transparently from their training task results. Finally, we conduct experiments of our FedChain on cyber threat intelligence context, where AI model is trained within many organizations on each their private datastore, and then it is used for detecting malicious actions in the network. Experimental results with the CICIDS-2017 dataset prove that the FL-based strategy can help create effective privacy-preserving ML models while taking advantage of diverse data sources from the community.