{"title":"功率受限非对称多核处理器的缓存竞争感知运行时调度","authors":"Jian-He Liao, He-Ru Chen, Ya-Shu Chen","doi":"10.1145/3400286.3418230","DOIUrl":null,"url":null,"abstract":"Asymmetric multicore architecture is widely applied to the embedded systems to better trade-off performance and energy consumption. With an increased number of applications concurrently executed in the system, the power consumption and the associated last-level cache latency are increased. To maximize the system performance under the power constraint, we proposed a cache contention-aware run-time scheduling for asymmetric multicore systems. To deal with the dynamic workload and cache contention effect, the CPI model learning is presented to adjust the relation between system performance, executing frequency, and executing clusters. Based on the CPI model prediction, the run-time dispatcher is then presented to determine the executing frequency and cores to maximize system throughput under power constraint. The proposed algorithm was implemented on the commercial Odroid XU4 board. The performance was evaluated using benchmarks and impressive results were obtained.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Cache Contention-aware Run-time Scheduling for Power-constrained Asymmetric Multicore Processors\",\"authors\":\"Jian-He Liao, He-Ru Chen, Ya-Shu Chen\",\"doi\":\"10.1145/3400286.3418230\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Asymmetric multicore architecture is widely applied to the embedded systems to better trade-off performance and energy consumption. With an increased number of applications concurrently executed in the system, the power consumption and the associated last-level cache latency are increased. To maximize the system performance under the power constraint, we proposed a cache contention-aware run-time scheduling for asymmetric multicore systems. To deal with the dynamic workload and cache contention effect, the CPI model learning is presented to adjust the relation between system performance, executing frequency, and executing clusters. Based on the CPI model prediction, the run-time dispatcher is then presented to determine the executing frequency and cores to maximize system throughput under power constraint. The proposed algorithm was implemented on the commercial Odroid XU4 board. The performance was evaluated using benchmarks and impressive results were obtained.\",\"PeriodicalId\":326100,\"journal\":{\"name\":\"Proceedings of the International Conference on Research in Adaptive and Convergent Systems\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Conference on Research in Adaptive and Convergent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3400286.3418230\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3400286.3418230","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Cache Contention-aware Run-time Scheduling for Power-constrained Asymmetric Multicore Processors
Asymmetric multicore architecture is widely applied to the embedded systems to better trade-off performance and energy consumption. With an increased number of applications concurrently executed in the system, the power consumption and the associated last-level cache latency are increased. To maximize the system performance under the power constraint, we proposed a cache contention-aware run-time scheduling for asymmetric multicore systems. To deal with the dynamic workload and cache contention effect, the CPI model learning is presented to adjust the relation between system performance, executing frequency, and executing clusters. Based on the CPI model prediction, the run-time dispatcher is then presented to determine the executing frequency and cores to maximize system throughput under power constraint. The proposed algorithm was implemented on the commercial Odroid XU4 board. The performance was evaluated using benchmarks and impressive results were obtained.