{"title":"更快更强:通过硬件异构释放数据处理潜力","authors":"Cong Wang;Yang Luo;Wenzhuo Du;Ke Wang;Naijie Gu;Jun Yu","doi":"10.1109/JIOT.2025.3526662","DOIUrl":null,"url":null,"abstract":"With the rapid advancement of AI technology, there has been a substantial surge in the need for computational resources. Particularly in deep learning, machine learning, and large-scale data analysis, the processing of extensive datasets necessitates exceptionally high levels of computational efficacy and speed. Conventional homogeneous computing platforms, predominantly reliant on central processing units (CPUs), have encountered challenges in meeting the escalating demands for high-performance computing. Consequently, this study advocates for heterogeneous hardware acceleration technology, strategically migrating data operations from CPU to varied hardware components [e.g., graphics processing unit (GPU), neural processing unit (NPU)] to enhance processing efficiency and computational performance during the data preprocessing phase. We conducted experiments to evaluate the impact of utilizing hardware heterogeneous acceleration technologies on data processing speed under various workloads and system hardware configurations. By adjusting parameters like batch size and CPU utilization rates, we compared the performance of frameworks that support hardware heterogeneity with popular deep learning frameworks (e.g., PyTorch and TensorFlow) across various hardware configurations and neural network models. Empirical findings demonstrate that the system framework optimized through heterogeneous hardware acceleration technology (the preprocessing speed is improved in all the given experimental environment tests) exhibits commendable universality and superiority in performance. Codes are available at <uri>https://github.com/mindspore-ai/mindspore</uri>.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 10","pages":"14559-14576"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Faster and Stronger: Unleashing Data Processing Potential Through Hardware Heterogeneity\",\"authors\":\"Cong Wang;Yang Luo;Wenzhuo Du;Ke Wang;Naijie Gu;Jun Yu\",\"doi\":\"10.1109/JIOT.2025.3526662\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid advancement of AI technology, there has been a substantial surge in the need for computational resources. Particularly in deep learning, machine learning, and large-scale data analysis, the processing of extensive datasets necessitates exceptionally high levels of computational efficacy and speed. Conventional homogeneous computing platforms, predominantly reliant on central processing units (CPUs), have encountered challenges in meeting the escalating demands for high-performance computing. Consequently, this study advocates for heterogeneous hardware acceleration technology, strategically migrating data operations from CPU to varied hardware components [e.g., graphics processing unit (GPU), neural processing unit (NPU)] to enhance processing efficiency and computational performance during the data preprocessing phase. We conducted experiments to evaluate the impact of utilizing hardware heterogeneous acceleration technologies on data processing speed under various workloads and system hardware configurations. By adjusting parameters like batch size and CPU utilization rates, we compared the performance of frameworks that support hardware heterogeneity with popular deep learning frameworks (e.g., PyTorch and TensorFlow) across various hardware configurations and neural network models. Empirical findings demonstrate that the system framework optimized through heterogeneous hardware acceleration technology (the preprocessing speed is improved in all the given experimental environment tests) exhibits commendable universality and superiority in performance. Codes are available at <uri>https://github.com/mindspore-ai/mindspore</uri>.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 10\",\"pages\":\"14559-14576\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-01-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10829854/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10829854/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Faster and Stronger: Unleashing Data Processing Potential Through Hardware Heterogeneity
With the rapid advancement of AI technology, there has been a substantial surge in the need for computational resources. Particularly in deep learning, machine learning, and large-scale data analysis, the processing of extensive datasets necessitates exceptionally high levels of computational efficacy and speed. Conventional homogeneous computing platforms, predominantly reliant on central processing units (CPUs), have encountered challenges in meeting the escalating demands for high-performance computing. Consequently, this study advocates for heterogeneous hardware acceleration technology, strategically migrating data operations from CPU to varied hardware components [e.g., graphics processing unit (GPU), neural processing unit (NPU)] to enhance processing efficiency and computational performance during the data preprocessing phase. We conducted experiments to evaluate the impact of utilizing hardware heterogeneous acceleration technologies on data processing speed under various workloads and system hardware configurations. By adjusting parameters like batch size and CPU utilization rates, we compared the performance of frameworks that support hardware heterogeneity with popular deep learning frameworks (e.g., PyTorch and TensorFlow) across various hardware configurations and neural network models. Empirical findings demonstrate that the system framework optimized through heterogeneous hardware acceleration technology (the preprocessing speed is improved in all the given experimental environment tests) exhibits commendable universality and superiority in performance. Codes are available at https://github.com/mindspore-ai/mindspore.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.