主动热感知调度

Shehenaz Shaik, Sanjeev Baskiyar
{"title":"主动热感知调度","authors":"Shehenaz Shaik, Sanjeev Baskiyar","doi":"10.1109/IGCC.2017.8323604","DOIUrl":null,"url":null,"abstract":"High temperatures and fluctuating temperatures decrease component reliability and lifespan. This work proposes a proactive software-based thermal aware scheduler to lower core temperature and its temperature fluctuations. It proposes a Simple Time Derivative (STD) scheduler, which uses the time derivative of the core temperature as a predictor. Major heat dissipating processes can be identified by their usage of integer arithmetic, float operations and other CPU performance counters. The “hot” processes are put to sleep for a short duration, if the time derivative goes above an empirically defined threshold. This work evaluates STD using FFT, SOR, LU, and Sparse benchmarks of the SciMark benchmark suite running on a desktop computer. We found upto 5° C decrease in average/peak temperatures as compared to the baseline approach (without any thermal scheduling). The execution penalties only apply to the hot processes and not the whole system. For LU/Sparse the core stayed at 35° C or below for 100%/82% of time with STD vs. only 28%/19% of increase in run-time for the baseline. Furthermore, for the baseline the temperature went over 40° C for 16% of run-time vs. 0% for the STD. Holding the temperature lower has advantages in cooling energy reduction particularly when several systems are running together in a room or in a server system. We also compared our results against Simple Threshold approach. STD provided lower run-time penalties and energy consumption than the Simple Threshold strategy and marginally outperformed in terms of temperature reduction. This research provides insight into the temperature reductions possible using a user-defined software approach and the corresponding penalties on the hot processes. The approach can be combined with air conditioning management techniques in server production systems to reduce energy consumption for any job mix where execution time is not paramount. The reduction in temperature and its variations also increases reliability and lifespan of the CPU chip.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Proactive thermal aware scheduling\",\"authors\":\"Shehenaz Shaik, Sanjeev Baskiyar\",\"doi\":\"10.1109/IGCC.2017.8323604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High temperatures and fluctuating temperatures decrease component reliability and lifespan. This work proposes a proactive software-based thermal aware scheduler to lower core temperature and its temperature fluctuations. It proposes a Simple Time Derivative (STD) scheduler, which uses the time derivative of the core temperature as a predictor. Major heat dissipating processes can be identified by their usage of integer arithmetic, float operations and other CPU performance counters. The “hot” processes are put to sleep for a short duration, if the time derivative goes above an empirically defined threshold. This work evaluates STD using FFT, SOR, LU, and Sparse benchmarks of the SciMark benchmark suite running on a desktop computer. We found upto 5° C decrease in average/peak temperatures as compared to the baseline approach (without any thermal scheduling). The execution penalties only apply to the hot processes and not the whole system. For LU/Sparse the core stayed at 35° C or below for 100%/82% of time with STD vs. only 28%/19% of increase in run-time for the baseline. Furthermore, for the baseline the temperature went over 40° C for 16% of run-time vs. 0% for the STD. Holding the temperature lower has advantages in cooling energy reduction particularly when several systems are running together in a room or in a server system. We also compared our results against Simple Threshold approach. STD provided lower run-time penalties and energy consumption than the Simple Threshold strategy and marginally outperformed in terms of temperature reduction. This research provides insight into the temperature reductions possible using a user-defined software approach and the corresponding penalties on the hot processes. The approach can be combined with air conditioning management techniques in server production systems to reduce energy consumption for any job mix where execution time is not paramount. The reduction in temperature and its variations also increases reliability and lifespan of the CPU chip.\",\"PeriodicalId\":133239,\"journal\":{\"name\":\"2017 Eighth International Green and Sustainable Computing Conference (IGSC)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Eighth International Green and Sustainable Computing Conference (IGSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IGCC.2017.8323604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGCC.2017.8323604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

高温和波动温度降低了组件的可靠性和寿命。本文提出了一种基于软件的主动热感知调度器,以降低堆芯温度及其温度波动。提出了一种简单时间导数(STD)调度程序,该调度程序使用核心温度的时间导数作为预测器。主要的散热过程可以通过它们使用整数运算、浮点运算和其他CPU性能计数器来识别。如果时间导数超过经验定义的阈值,“热”过程将在短时间内处于休眠状态。这项工作使用在台式计算机上运行的SciMark基准测试套件的FFT、SOR、LU和Sparse基准测试来评估STD。我们发现,与基线方法(没有任何热调度)相比,平均/峰值温度降低了5°C。执行惩罚只适用于热流程,而不是整个系统。对于LU/Sparse,核心在STD中保持在35°C或更低的时间为100%/82%,而基线的运行时间仅增加28%/19%。此外,在基准测试中,16%的运行时间温度超过40°C,而STD测试的运行时间为0%。保持较低的温度在降低冷却能耗方面具有优势,特别是当多个系统在一个房间或服务器系统中一起运行时。我们还将我们的结果与简单阈值方法进行了比较。STD提供了比Simple Threshold策略更低的运行时惩罚和能耗,并且在降低温度方面略优于Simple Threshold策略。这项研究为使用用户定义的软件方法和对热过程的相应处罚提供了可能的温度降低的见解。该方法可以与服务器生产系统中的空调管理技术相结合,以减少执行时间不重要的任何作业组合的能耗。温度的降低及其变化也增加了CPU芯片的可靠性和寿命。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Proactive thermal aware scheduling
High temperatures and fluctuating temperatures decrease component reliability and lifespan. This work proposes a proactive software-based thermal aware scheduler to lower core temperature and its temperature fluctuations. It proposes a Simple Time Derivative (STD) scheduler, which uses the time derivative of the core temperature as a predictor. Major heat dissipating processes can be identified by their usage of integer arithmetic, float operations and other CPU performance counters. The “hot” processes are put to sleep for a short duration, if the time derivative goes above an empirically defined threshold. This work evaluates STD using FFT, SOR, LU, and Sparse benchmarks of the SciMark benchmark suite running on a desktop computer. We found upto 5° C decrease in average/peak temperatures as compared to the baseline approach (without any thermal scheduling). The execution penalties only apply to the hot processes and not the whole system. For LU/Sparse the core stayed at 35° C or below for 100%/82% of time with STD vs. only 28%/19% of increase in run-time for the baseline. Furthermore, for the baseline the temperature went over 40° C for 16% of run-time vs. 0% for the STD. Holding the temperature lower has advantages in cooling energy reduction particularly when several systems are running together in a room or in a server system. We also compared our results against Simple Threshold approach. STD provided lower run-time penalties and energy consumption than the Simple Threshold strategy and marginally outperformed in terms of temperature reduction. This research provides insight into the temperature reductions possible using a user-defined software approach and the corresponding penalties on the hot processes. The approach can be combined with air conditioning management techniques in server production systems to reduce energy consumption for any job mix where execution time is not paramount. The reduction in temperature and its variations also increases reliability and lifespan of the CPU chip.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信