Shervin Hajiamini, B. Shirazi, Chris Cain, Hongbo Dong
{"title":"An energy-constrained makespan optimization framework in fine-to coarse-grain partitioned multicore systems","authors":"Shervin Hajiamini, B. Shirazi, Chris Cain, Hongbo Dong","doi":"10.1109/IGCC.2017.8323582","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323582","url":null,"abstract":"In today's multicore systems, depending on an application's computational demand, cores are either operated individually at different Voltage/Frequency (V/F) levels or grouped into multiple Voltage-Frequency Islands (VFIs) to reduce system energy consumption. This paper formulates a task scheduling and VFI partitioning problem whose optimization goal is to minimize the task set (application) execution time (makespan) for a given energy budget. First, the combinatorial optimization problem is formulated with Integer Linear Programming (ILP) to obtain per-core, per-task dynamic V/F levels in a fine-grain VFI-based system with single-core islands. Next, static task scheduling on coarse-grain VFI-based systems, where an island can contain several cores operated at the same V/F level, is formulated with Mixed Integer Linear Programming (MILP), considering the energy budget and task set's precedence constraints. The experimental results show that under different energy budget constraints, fine-grain, dynamic task allocations provide on average 1.35x speedup over static coarse grain scheduling and partitioning methods.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"514 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123248565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ryan E. Grant, J. Laros, M. Levenhagen, Stephen L. Olivier, K. Pedretti, L. Ward, A. Younge
{"title":"Evaluating energy and power profiling techniques for HPC workloads","authors":"Ryan E. Grant, J. Laros, M. Levenhagen, Stephen L. Olivier, K. Pedretti, L. Ward, A. Younge","doi":"10.1109/IGCC.2017.8323587","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323587","url":null,"abstract":"Advanced power measurement capabilities are becoming available on large scale High Performance Computing (HPC) deployments. There exist several approaches to providing power measurements today, primarily through in-band (e.g. RAPL) and out-of-band measurements (e.g. power meters). Both types of measurement can be augmented with application-level profiling, however it can be difficult to assess the type and detail of measurement needed to obtain insight from the application power profile. This paper presents a taxonomy for classifying power profiling techniques on modern HPC platforms. Three HPC mini-applications are analyzed across three production HPC systems to examine the level of detail, scope, and complexity of these power profiles. We demonstrate that a combination of out-of-band measurement with in-band application region profiling can provide an accurate, detailed view of power usage without introducing overhead. This work also provides a set of recommendations for how to best profile HPC workloads.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"384 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116641349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Energy consumption in Java: An early experience","authors":"Mohit Kumar, Youhuizi Li, Weisong Shi","doi":"10.1109/IGCC.2017.8323579","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323579","url":null,"abstract":"There has been a 10,000-fold increase in performance of supercomputers since 1992 but only 300-fold improvement in performance per watt. Dynamic adaptation of hardware techniques such as fine-grain clock gating, power gating and dynamic voltage/frequency scaling, are used for many years to improve the computer's energy efficiency. However, recent demands of exascale computation, as well as the increasing carbon footprint, require new breakthrough to make ICT systems more energy efficient. Energy efficient software has not been well studied in the last decade. In this paper, we take an early step to investigate the energy efficiency of Java which is one of the most common languages used in ICT systems. We evaluate energy consumption of data types, operators, control statements, exception, and object in Java at a granular level. Intel Running Average Power Limit (RAPL) technology is applied to measure the relative power consumption of small code snippets. Several observations are found, and these results will help in standardizing the energy consumption traits of Java which can be leveraged by software developers to generate energy efficient code in future.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133261235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Investigation of LSTM based prediction for dynamic energy management in chip multiprocessors","authors":"M. Moghaddam, Wenkai Guan, Cristinel Ababei","doi":"10.1109/IGCC.2017.8323597","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323597","url":null,"abstract":"In this paper, we investigate the effectiveness of using long short-term memory (LSTM) instead of Kalman filtering to do prediction for the purpose of constructing dynamic energy management (DEM) algorithms in chip multi-processors (CMPs). Either of the two prediction methods is employed to estimate the workload in the next control period for each of the processor cores. These estimates are then used to select voltage-frequency (VF) pairs for each core of the CMP during the next control period as part of a dynamic voltage and frequency scaling (DVFS) technique. The objective of the DVFS technique is to reduce energy consumption under performance constraints that are set by the user. We conduct our investigation using a custom Sniper system simulation framework. Simulation results for 16 and 64 core network-on-chip based CMP architectures and using several benchmarks demonstrate that the LSTM is slightly better than Kalman filtering.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124579940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proactive thermal aware scheduling","authors":"Shehenaz Shaik, Sanjeev Baskiyar","doi":"10.1109/IGCC.2017.8323604","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323604","url":null,"abstract":"High temperatures and fluctuating temperatures decrease component reliability and lifespan. This work proposes a proactive software-based thermal aware scheduler to lower core temperature and its temperature fluctuations. It proposes a Simple Time Derivative (STD) scheduler, which uses the time derivative of the core temperature as a predictor. Major heat dissipating processes can be identified by their usage of integer arithmetic, float operations and other CPU performance counters. The “hot” processes are put to sleep for a short duration, if the time derivative goes above an empirically defined threshold. This work evaluates STD using FFT, SOR, LU, and Sparse benchmarks of the SciMark benchmark suite running on a desktop computer. We found upto 5° C decrease in average/peak temperatures as compared to the baseline approach (without any thermal scheduling). The execution penalties only apply to the hot processes and not the whole system. For LU/Sparse the core stayed at 35° C or below for 100%/82% of time with STD vs. only 28%/19% of increase in run-time for the baseline. Furthermore, for the baseline the temperature went over 40° C for 16% of run-time vs. 0% for the STD. Holding the temperature lower has advantages in cooling energy reduction particularly when several systems are running together in a room or in a server system. We also compared our results against Simple Threshold approach. STD provided lower run-time penalties and energy consumption than the Simple Threshold strategy and marginally outperformed in terms of temperature reduction. This research provides insight into the temperature reductions possible using a user-defined software approach and the corresponding penalties on the hot processes. The approach can be combined with air conditioning management techniques in server production systems to reduce energy consumption for any job mix where execution time is not paramount. The reduction in temperature and its variations also increases reliability and lifespan of the CPU chip.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129353217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Evaluating the impact of data layout and placement on the energy efficiency of heterogeneous applications","authors":"Apan Qasem, Samuel Teich","doi":"10.1109/IGCC.2017.8323574","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323574","url":null,"abstract":"Heterogeneous compute nodes have become an integral component of today's HPC systems. Recent research has established the importance of data layout and placement on such systems. This paper explores the power and energy aspects of data layout and placement on heterogeneous systems. We present results of an experimental study that evaluates the impact of data layout and placement on candidate HPC node architectures for kernels that exhibit a wide variety of performance characteristics. The results of the study show that data layout and placement can have a significant impact on the energy efficiency of heterogeneous applications. On some platforms, selecting the appropriate layout can yield up to an order-of-magnitude improvement in energy efficiency. The study shows that the conventional approach of using a structure-of-arrays for device-mapped data structures is not always profitable and that in addition to memory divergence, data layout choices are impacted by a variety of factors including arithmetic intensity and task granularity. The results of the study are used to establish a set of energy imperatives to guide data layout and placement across different architectures.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132403193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"POWSER: A novel user-experience based power management metric","authors":"Maithili P. Gandhe, L. John, A. Gerstlauer","doi":"10.1109/IGCC.2017.8323606","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323606","url":null,"abstract":"With the rapid increase in deployment of a variety of applications on mobile devices, and an increasing amount of activity and time spent on the devices, battery life continues to be a major source of concern. At the same time, hardware and software advances in mobile computing have enabled multiple usage functions to be integrated into one device, making energy consumption of the device more critical. Some approximations in computing can be made without noticeable degradation in quality to achieve savings in power. We explain the concept of user-experience based power management (UEPM) and propose a novel simple to use user-perceived-quality-energy metric called POWSER to build a UEPM-based system with a holistic objective. POWSER can also provide optimal operating points for the applications to perform fast dynamic online optimizations of applications such as video. Quality-energy tradeoffs have been carried out at various levels before. However, existing approaches do not take the final user-experience into account. POWSER is an easy to compute and easy to use weighted metric based on the nature of quality-energy tradeoffs. POWSER models QoS parameters in software that can be propagated down to hardware to implement further power savings across the system stack. Keeping acceptable user experience, POWSER provides an optimal set of operating points for each application by exploiting power savings of up to 34% in image applications.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124426722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Context-driven power management in cache-enabled base stations using a Bayesian neural network","authors":"Luhao Wang, Shuang Chen, Massoud Pedram","doi":"10.1109/IGCC.2017.8323565","DOIUrl":"https://doi.org/10.1109/IGCC.2017.8323565","url":null,"abstract":"Aggressive network densification in next generation cellular networks is accompanied by an increase of the system energy consumption and calls for more advanced power management techniques in base stations. In this paper, we present a novel proactive and decentralized power management method for small cell base stations in a cache-enabled multitier heterogeneous cellular network. User contexts are utilized to drive the decision of dynamically switching a small cell base station between the active mode and the sleep mode to minimize the total energy consumption. The online control problem is formulated as a contextual multi-armed bandit problem. A variational inference based Bayesian neural network is proposed as the solution method, which implicitly finds a proper balance between exploration and exploitation. Experimental results show that the proposed solution can achieve up to 46.9% total energy reduction compared to baseline algorithms in the high density deployment scenario and has comparable performance to an offline optimal solution.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"201 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115578433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}