异构边缘和云系统的竞争感知性能建模

Proceedings of the 3rd Workshop on Flexible Resource and Application Management on the Edge Pub Date : 2023-08-14 DOI:10.1145/3589010.3594889

Ismet Dagli, Andrew Depke, Andrew Mueller, M. S. Hassan, A. Akoglu, M. Belviranli

{"title":"异构边缘和云系统的竞争感知性能建模","authors":"Ismet Dagli, Andrew Depke, Andrew Mueller, M. S. Hassan, A. Akoglu, M. Belviranli","doi":"10.1145/3589010.3594889","DOIUrl":null,"url":null,"abstract":"Diversely Heterogeneous System-on-Chips (DH-SoC) are increasingly popular computing platforms in many fields, such as autonomous driving and AR/VR applications, due to their ability to effectively balance performance and energy efficiency. Having multiple target accelerators for multiple concurrent workloads requires a careful runtime analysis of scheduling. In this study, we examine a scenario that mandates several concerns to be carefully addressed: 1) exploring the mapping of various workloads to heterogeneous accelerators to optimize the system for better performance, 2) analyzing data from the physical world in runtime to minimize the response time of the system 3) accurately estimating the resource contention by workloads during runtime since there will be con- current operations running under the same die, and 4) deferring the operation to the cloud for computationally more demanding operations such as continuous learning or real-time rendering, de- pending on the complexity of the computation. We demonstrate our analysis and approach on a VR project as a case study by using NVIDIA Xavier NX Edge DH-SoC and a server equipped with NVIDIA GeForce RTX 3080 GPU and AMD EPYC 7402 CPU.","PeriodicalId":325857,"journal":{"name":"Proceedings of the 3rd Workshop on Flexible Resource and Application Management on the Edge","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Contention-aware Performance Modeling for Heterogeneous Edge and Cloud Systems\",\"authors\":\"Ismet Dagli, Andrew Depke, Andrew Mueller, M. S. Hassan, A. Akoglu, M. Belviranli\",\"doi\":\"10.1145/3589010.3594889\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diversely Heterogeneous System-on-Chips (DH-SoC) are increasingly popular computing platforms in many fields, such as autonomous driving and AR/VR applications, due to their ability to effectively balance performance and energy efficiency. Having multiple target accelerators for multiple concurrent workloads requires a careful runtime analysis of scheduling. In this study, we examine a scenario that mandates several concerns to be carefully addressed: 1) exploring the mapping of various workloads to heterogeneous accelerators to optimize the system for better performance, 2) analyzing data from the physical world in runtime to minimize the response time of the system 3) accurately estimating the resource contention by workloads during runtime since there will be con- current operations running under the same die, and 4) deferring the operation to the cloud for computationally more demanding operations such as continuous learning or real-time rendering, de- pending on the complexity of the computation. We demonstrate our analysis and approach on a VR project as a case study by using NVIDIA Xavier NX Edge DH-SoC and a server equipped with NVIDIA GeForce RTX 3080 GPU and AMD EPYC 7402 CPU.\",\"PeriodicalId\":325857,\"journal\":{\"name\":\"Proceedings of the 3rd Workshop on Flexible Resource and Application Management on the Edge\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd Workshop on Flexible Resource and Application Management on the Edge\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3589010.3594889\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd Workshop on Flexible Resource and Application Management on the Edge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3589010.3594889","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

由于能够有效地平衡性能和能源效率，异构系统芯片(DH-SoC)在自动驾驶和AR/VR应用等许多领域越来越受欢迎。为多个并发工作负载提供多个目标加速器需要对调度进行仔细的运行时分析。在本研究中，我们研究了一个场景，该场景要求仔细解决几个问题:1)探索各种工作负载到异构加速器的映射，以优化系统以获得更好的性能;2)在运行时分析来自物理世界的数据，以最小化系统的响应时间;3)在运行时准确估计工作负载的资源争用，因为将有并发操作运行在同一个die下。4)根据计算的复杂性，将计算要求更高的操作(如连续学习或实时渲染)推迟到云端。我们通过使用NVIDIA Xavier NX Edge hd - soc和配备NVIDIA GeForce RTX 3080 GPU和AMD EPYC 7402 CPU的服务器，展示了我们对VR项目的分析和方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Contention-aware Performance Modeling for Heterogeneous Edge and Cloud Systems

Diversely Heterogeneous System-on-Chips (DH-SoC) are increasingly popular computing platforms in many fields, such as autonomous driving and AR/VR applications, due to their ability to effectively balance performance and energy efficiency. Having multiple target accelerators for multiple concurrent workloads requires a careful runtime analysis of scheduling. In this study, we examine a scenario that mandates several concerns to be carefully addressed: 1) exploring the mapping of various workloads to heterogeneous accelerators to optimize the system for better performance, 2) analyzing data from the physical world in runtime to minimize the response time of the system 3) accurately estimating the resource contention by workloads during runtime since there will be con- current operations running under the same die, and 4) deferring the operation to the cloud for computationally more demanding operations such as continuous learning or real-time rendering, de- pending on the complexity of the computation. We demonstrate our analysis and approach on a VR project as a case study by using NVIDIA Xavier NX Edge DH-SoC and a server equipped with NVIDIA GeForce RTX 3080 GPU and AMD EPYC 7402 CPU.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 3rd Workshop on Flexible Resource and Application Management on the Edge

自引率

0.00%

发文量