Incremental federated learning for traffic flow classification in heterogeneous data scenarios

Neural Computing and Applications Pub Date : 2024-08-12 DOI:10.1007/s00521-024-10281-4

Adrian Pekar, Laszlo Arpad Makara, Gergely Biczok

{"title":"Incremental federated learning for traffic flow classification in heterogeneous data scenarios","authors":"Adrian Pekar, Laszlo Arpad Makara, Gergely Biczok","doi":"10.1007/s00521-024-10281-4","DOIUrl":null,"url":null,"abstract":"<p>This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"60 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10281-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community.

Abstract Image

查看原文本刊更多论文

异构数据场景中交通流分类的增量联合学习

本文探讨了联合学习（FL）和集中学习（CL）模型在网络应用的多类流量分类中的比较分析，在隐私保护日益受到关注的背景下，这是一项适时的研究。现有文献往往忽略了详细的分类性能评估以及一致的数据处理和特征选择方法，与之不同的是，我们的研究通过实施前馈神经网络和评估独立且同分布（IID）和非独立且同分布（非 IID）条件下的 FL 性能来纠正这些缺陷，并特别关注增量训练。在我们的跨ilo 实验设置中，每轮涉及五个客户端，FL 模型表现出显著的适应性。在 IID 条件下，FL 模型的准确率达到了 96.65% 的峰值，证明了它的鲁棒性。此外，尽管非 IID 环境带来了挑战，但我们的 FL 模型仍表现出了很强的适应能力，可在各轮中逐步调整以优化性能；在大多数情况下，我们的 FL 模型在多个成熟指标方面的表现与理想化的 CL 模型相当。通过一个全面的交通流分类使用案例，这项工作（i）有助于更好地理解 FL 的能力和局限性，为 FL 在现实世界中的部署提供了宝贵的见解，（ii）为研究界提供了一个新颖、大型、精心策划的交通流数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Computing and Applications

自引率

0.00%

发文量