Ali Aalsaud, A. Rafiev, Fei Xia, R. Shafik, A. Yakovlev
{"title":"高能效多核异构系统并发工作负载的无模型运行时管理","authors":"Ali Aalsaud, A. Rafiev, Fei Xia, R. Shafik, A. Yakovlev","doi":"10.1109/PATMOS.2018.8464142","DOIUrl":null,"url":null,"abstract":"Modern embedded systems execute multiple applications, both sequentially and concurrently, on heterogeneous platforms. Determining the most energy-efficient system configuration (i.e. the number of parallel threads, their core allocations and operating frequencies) tailored for each kind of workload is extremely challenging. In this paper, we propose a novel runtime optimization approach with the aim of maximizing power-normalized performance considering dynamic workload variations. To reduce overhead and complexity, we adopt a model-free approach to runtime adaptation based on workload classification. This classification is supported by analysis of data collected from a comprehensive study investigating the tradeoffs between inter-application concurrency with performance and power under different system configurations. We conduct extensive experiments on an Odroid XU3 heterogeneous platform with synthetic and standard benchmark applications to develop the control policies and validate our approach. These experiments show that workload classification into CPU-intensive and memory-intensive types provides the foundation for scalable energy minimization with low complexity. Implementing this approach as a Linux runtime governor, we demonstrate that IPS/Watt can be improved by over 120% compared to existing approaches.","PeriodicalId":234100,"journal":{"name":"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Model-Free Runtime Management of Concurrent Workloads for Energy-Efficient Many-Core Heterogeneous Systems\",\"authors\":\"Ali Aalsaud, A. Rafiev, Fei Xia, R. Shafik, A. Yakovlev\",\"doi\":\"10.1109/PATMOS.2018.8464142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Modern embedded systems execute multiple applications, both sequentially and concurrently, on heterogeneous platforms. Determining the most energy-efficient system configuration (i.e. the number of parallel threads, their core allocations and operating frequencies) tailored for each kind of workload is extremely challenging. In this paper, we propose a novel runtime optimization approach with the aim of maximizing power-normalized performance considering dynamic workload variations. To reduce overhead and complexity, we adopt a model-free approach to runtime adaptation based on workload classification. This classification is supported by analysis of data collected from a comprehensive study investigating the tradeoffs between inter-application concurrency with performance and power under different system configurations. We conduct extensive experiments on an Odroid XU3 heterogeneous platform with synthetic and standard benchmark applications to develop the control policies and validate our approach. These experiments show that workload classification into CPU-intensive and memory-intensive types provides the foundation for scalable energy minimization with low complexity. Implementing this approach as a Linux runtime governor, we demonstrate that IPS/Watt can be improved by over 120% compared to existing approaches.\",\"PeriodicalId\":234100,\"journal\":{\"name\":\"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PATMOS.2018.8464142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PATMOS.2018.8464142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Model-Free Runtime Management of Concurrent Workloads for Energy-Efficient Many-Core Heterogeneous Systems
Modern embedded systems execute multiple applications, both sequentially and concurrently, on heterogeneous platforms. Determining the most energy-efficient system configuration (i.e. the number of parallel threads, their core allocations and operating frequencies) tailored for each kind of workload is extremely challenging. In this paper, we propose a novel runtime optimization approach with the aim of maximizing power-normalized performance considering dynamic workload variations. To reduce overhead and complexity, we adopt a model-free approach to runtime adaptation based on workload classification. This classification is supported by analysis of data collected from a comprehensive study investigating the tradeoffs between inter-application concurrency with performance and power under different system configurations. We conduct extensive experiments on an Odroid XU3 heterogeneous platform with synthetic and standard benchmark applications to develop the control policies and validate our approach. These experiments show that workload classification into CPU-intensive and memory-intensive types provides the foundation for scalable energy minimization with low complexity. Implementing this approach as a Linux runtime governor, we demonstrate that IPS/Watt can be improved by over 120% compared to existing approaches.