AF-DNDF: Asynchronous Federated Learning of Deep Neural Decision Forests

2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA) Pub Date : 2021-09-01 DOI:10.1109/SEAA53835.2021.00047

Hongyi Zhang, J. Bosch, H. H. Olsson, Ashok Chaitanya Koppisetty

{"title":"AF-DNDF: Asynchronous Federated Learning of Deep Neural Decision Forests","authors":"Hongyi Zhang, J. Bosch, H. H. Olsson, Ashok Chaitanya Koppisetty","doi":"10.1109/SEAA53835.2021.00047","DOIUrl":null,"url":null,"abstract":"In recent years, with more edge devices being put into use, the amount of data that is created, transmitted and stored is increasing exponentially. Moreover, due to the development of machine learning algorithms, modern software-intensive systems are able to take advantage of the data to further improve their service quality. However, it is expensive and inefficient to transmit large amounts of data to a central location for the purpose of training and deploying machine learning models. Data transfer from edge devices across the globe to central locations may also raise privacy and concerns related to local data regulations. As a distributed learning approach, Federated Learning has been introduced to tackle those challenges. Since Federated Learning simply exchanges locally trained machine learning models rather than the entire data set throughout the training process, the method not only protects user data privacy but also improves model training efficiency. In this paper, we have investigated an advanced machine learning algorithm, Deep Neural Decision Forests (DNDF), which unites classification trees with the representation learning functionality from deep convolutional neural networks. In this paper, we propose a novel algorithm, AF-DNDF which extends DNDF with an asynchronous federated aggregation protocol. Based on the local quality of each classification tree, our architecture can select and combine the optimal groups of decision trees from multiple local devices. The introduction of the asynchronous protocol enables the algorithm to be deployed in the industrial context with heterogeneous hardware settings. Our AF-DNDF architecture is validated in an automotive industrial use case focusing on road objects recognition and demonstrated by an empirical experiment with two different data sets. The experimental results show that our AF-DNDF algorithm significantly reduces the communication overhead and accelerates model training speed without sacrificing model classification performance. The algorithm can reach the same classification accuracy as the commonly used centralized machine learning methods but also greatly improve local edge model quality.","PeriodicalId":435977,"journal":{"name":"2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SEAA53835.2021.00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In recent years, with more edge devices being put into use, the amount of data that is created, transmitted and stored is increasing exponentially. Moreover, due to the development of machine learning algorithms, modern software-intensive systems are able to take advantage of the data to further improve their service quality. However, it is expensive and inefficient to transmit large amounts of data to a central location for the purpose of training and deploying machine learning models. Data transfer from edge devices across the globe to central locations may also raise privacy and concerns related to local data regulations. As a distributed learning approach, Federated Learning has been introduced to tackle those challenges. Since Federated Learning simply exchanges locally trained machine learning models rather than the entire data set throughout the training process, the method not only protects user data privacy but also improves model training efficiency. In this paper, we have investigated an advanced machine learning algorithm, Deep Neural Decision Forests (DNDF), which unites classification trees with the representation learning functionality from deep convolutional neural networks. In this paper, we propose a novel algorithm, AF-DNDF which extends DNDF with an asynchronous federated aggregation protocol. Based on the local quality of each classification tree, our architecture can select and combine the optimal groups of decision trees from multiple local devices. The introduction of the asynchronous protocol enables the algorithm to be deployed in the industrial context with heterogeneous hardware settings. Our AF-DNDF architecture is validated in an automotive industrial use case focusing on road objects recognition and demonstrated by an empirical experiment with two different data sets. The experimental results show that our AF-DNDF algorithm significantly reduces the communication overhead and accelerates model training speed without sacrificing model classification performance. The algorithm can reach the same classification accuracy as the commonly used centralized machine learning methods but also greatly improve local edge model quality.

查看原文本刊更多论文

AF-DNDF:深度神经决策森林的异步联邦学习

近年来，随着越来越多的边缘设备投入使用，产生、传输和存储的数据量呈指数级增长。此外，由于机器学习算法的发展，现代软件密集型系统能够利用数据进一步提高其服务质量。然而，为了训练和部署机器学习模型，将大量数据传输到一个中心位置是昂贵和低效的。从全球边缘设备到中心位置的数据传输也可能引起与本地数据法规相关的隐私和担忧。作为一种分布式学习方法，联邦学习已经被引入来解决这些挑战。由于联邦学习在整个训练过程中只是交换本地训练的机器学习模型，而不是交换整个数据集，因此该方法既保护了用户数据隐私，又提高了模型训练效率。在本文中，我们研究了一种先进的机器学习算法，深度神经决策森林(DNDF)，它将分类树与深度卷积神经网络的表示学习功能结合起来。在本文中，我们提出了一种新的算法AF-DNDF，它使用异步联邦聚合协议扩展了DNDF。基于每个分类树的局部质量，我们的架构可以从多个本地设备中选择和组合最优的决策树组。异步协议的引入使该算法能够部署在具有异构硬件设置的工业环境中。我们的AF-DNDF架构在一个专注于道路物体识别的汽车工业用例中得到了验证，并通过两个不同数据集的实证实验进行了验证。实验结果表明，AF-DNDF算法在不牺牲模型分类性能的前提下，显著降低了通信开销，加快了模型训练速度。该算法在达到与常用的集中式机器学习方法相同的分类精度的同时，也大大提高了局部边缘模型的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)

自引率

0.00%

发文量