{"title":"雾环境下分布式机器学习框架的比较:概念和性能分析","authors":"Anusri Sanyadanam, Satish Narayana Srirama","doi":"10.1016/j.iot.2025.101774","DOIUrl":null,"url":null,"abstract":"<div><div>The growing demand for real-time, latency-sensitive, and privacy-preserving analytics in IoT has brought fog computing as an alternative to cloud-based processing. However, training machine learning and deep learning (ML/DL) models in fog environments remains challenging due to limited computational resources. Despite the availability of numerous distributed ML frameworks, there is a lack of a comprehensive evaluation tailored to fog devices. This study conducts a comparative analysis of distributed ML frameworks for neural network training on resource-constrained fog nodes, using Raspberry Pi (RPi) devices. We started with Actor programming model-based frameworks and the study extended to general purpose distributed frameworks suitable for fog computing devices. We evaluate four actor-model-based frameworks (Akkordeon, DistBelief with Akka, Aktorain, and CANTO) along with general-purpose distributed frameworks (KubeRay, TensorFlow MultiWorkerMirroredStrategy (MWMS), Dask Distributed and Spark with Elephas). The frameworks are compared across key metrics including training time, accuracy, and resource utilization on diverse datasets. Our results highlight performance trade-offs: KubeRay offers a balance between efficiency and performance, Dask and MWMS achieve higher accuracy with increased latency, while Spark with Elephas excels in speed but struggles with accuracy. Although CANTO is optimized for fog-based training, it faces challenges with complex datasets. Overall, KubeRay emerges as the most practical choice for fog-based ML training because of its additional support for scalability and fault tolerance. This work bridges a critical research gap by providing experimental insights into the feasibility and performance of distributed ML frameworks in fog computing environments.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"34 ","pages":"Article 101774"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of distributed Machine Learning frameworks in a fog environment: Conceptual and Performance analysis\",\"authors\":\"Anusri Sanyadanam, Satish Narayana Srirama\",\"doi\":\"10.1016/j.iot.2025.101774\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The growing demand for real-time, latency-sensitive, and privacy-preserving analytics in IoT has brought fog computing as an alternative to cloud-based processing. However, training machine learning and deep learning (ML/DL) models in fog environments remains challenging due to limited computational resources. Despite the availability of numerous distributed ML frameworks, there is a lack of a comprehensive evaluation tailored to fog devices. This study conducts a comparative analysis of distributed ML frameworks for neural network training on resource-constrained fog nodes, using Raspberry Pi (RPi) devices. We started with Actor programming model-based frameworks and the study extended to general purpose distributed frameworks suitable for fog computing devices. We evaluate four actor-model-based frameworks (Akkordeon, DistBelief with Akka, Aktorain, and CANTO) along with general-purpose distributed frameworks (KubeRay, TensorFlow MultiWorkerMirroredStrategy (MWMS), Dask Distributed and Spark with Elephas). The frameworks are compared across key metrics including training time, accuracy, and resource utilization on diverse datasets. Our results highlight performance trade-offs: KubeRay offers a balance between efficiency and performance, Dask and MWMS achieve higher accuracy with increased latency, while Spark with Elephas excels in speed but struggles with accuracy. Although CANTO is optimized for fog-based training, it faces challenges with complex datasets. Overall, KubeRay emerges as the most practical choice for fog-based ML training because of its additional support for scalability and fault tolerance. This work bridges a critical research gap by providing experimental insights into the feasibility and performance of distributed ML frameworks in fog computing environments.</div></div>\",\"PeriodicalId\":29968,\"journal\":{\"name\":\"Internet of Things\",\"volume\":\"34 \",\"pages\":\"Article 101774\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet of Things\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2542660525002884\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525002884","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Comparison of distributed Machine Learning frameworks in a fog environment: Conceptual and Performance analysis
The growing demand for real-time, latency-sensitive, and privacy-preserving analytics in IoT has brought fog computing as an alternative to cloud-based processing. However, training machine learning and deep learning (ML/DL) models in fog environments remains challenging due to limited computational resources. Despite the availability of numerous distributed ML frameworks, there is a lack of a comprehensive evaluation tailored to fog devices. This study conducts a comparative analysis of distributed ML frameworks for neural network training on resource-constrained fog nodes, using Raspberry Pi (RPi) devices. We started with Actor programming model-based frameworks and the study extended to general purpose distributed frameworks suitable for fog computing devices. We evaluate four actor-model-based frameworks (Akkordeon, DistBelief with Akka, Aktorain, and CANTO) along with general-purpose distributed frameworks (KubeRay, TensorFlow MultiWorkerMirroredStrategy (MWMS), Dask Distributed and Spark with Elephas). The frameworks are compared across key metrics including training time, accuracy, and resource utilization on diverse datasets. Our results highlight performance trade-offs: KubeRay offers a balance between efficiency and performance, Dask and MWMS achieve higher accuracy with increased latency, while Spark with Elephas excels in speed but struggles with accuracy. Although CANTO is optimized for fog-based training, it faces challenges with complex datasets. Overall, KubeRay emerges as the most practical choice for fog-based ML training because of its additional support for scalability and fault tolerance. This work bridges a critical research gap by providing experimental insights into the feasibility and performance of distributed ML frameworks in fog computing environments.
期刊介绍:
Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT.
The journal will place a high priority on timely publication, and provide a home for high quality.
Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.