Incremental federated learning for traffic flow classification in heterogeneous data scenarios

Adrian Pekar, Laszlo Arpad Makara, Gergely Biczok
{"title":"Incremental federated learning for traffic flow classification in heterogeneous data scenarios","authors":"Adrian Pekar, Laszlo Arpad Makara, Gergely Biczok","doi":"10.1007/s00521-024-10281-4","DOIUrl":null,"url":null,"abstract":"<p>This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"60 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10281-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community.

Abstract Image

异构数据场景中交通流分类的增量联合学习
本文探讨了联合学习(FL)和集中学习(CL)模型在网络应用的多类流量分类中的比较分析,在隐私保护日益受到关注的背景下,这是一项适时的研究。现有文献往往忽略了详细的分类性能评估以及一致的数据处理和特征选择方法,与之不同的是,我们的研究通过实施前馈神经网络和评估独立且同分布(IID)和非独立且同分布(非 IID)条件下的 FL 性能来纠正这些缺陷,并特别关注增量训练。在我们的跨ilo 实验设置中,每轮涉及五个客户端,FL 模型表现出显著的适应性。在 IID 条件下,FL 模型的准确率达到了 96.65% 的峰值,证明了它的鲁棒性。此外,尽管非 IID 环境带来了挑战,但我们的 FL 模型仍表现出了很强的适应能力,可在各轮中逐步调整以优化性能;在大多数情况下,我们的 FL 模型在多个成熟指标方面的表现与理想化的 CL 模型相当。通过一个全面的交通流分类使用案例,这项工作(i)有助于更好地理解 FL 的能力和局限性,为 FL 在现实世界中的部署提供了宝贵的见解,(ii)为研究界提供了一个新颖、大型、精心策划的交通流数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信