{"title":"A measurement study of GPU DVFS on energy conservation","authors":"Xinxin Mei, L. Yung, Kaiyong Zhao, Xiaowen Chu","doi":"10.1145/2525526.2525852","DOIUrl":"https://doi.org/10.1145/2525526.2525852","url":null,"abstract":"Nowadays, GPUs are widely used to accelerate many high performance computing applications. Energy conservation of such computing systems has become an important research topic. Dynamic voltage/frequency scaling (DVFS) is proved to be an appealing method for saving energy for traditional computing centers. However, there is still a lack of firsthand study on the effectiveness of GPU DVFS. This paper presents a thorough measurement study that aims to explore how GPU DVFS affects the system energy consumption. We conduct experiments on a real GPU platform with 37 benchmark applications. Our results show that GPU voltage/frequency scaling is an effective approach to conserving energy. For example, by scaling down the GPU core voltage and frequency, we have achieved an average of 19.28% energy reduction compared with the default setting, while giving up no more than 4% of performance. For all tested GPU applications, core voltage scaling is significantly effective to reduce system energy consumption. Meanwhile the effects of scaling core frequency and memory frequency depend on the characteristics of GPU applications.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115097333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Evaluating integrated graphics processors for data center workloads","authors":"Sangman Kim, Indrajit Roy, V. Talwar","doi":"10.1145/2525526.2525847","DOIUrl":"https://doi.org/10.1145/2525526.2525847","url":null,"abstract":"More than 90% of consumer computers use integrated graphics processors. In these processors, the CPU and the GPU share the same physical memory. Due to high density, good power efficiency, and low cost, integrated graphics processors are promising candidates for next-generation micro-servers and, hence, data-center workloads.\u0000 While discrete graphics processors have been extensively studied, there is very little work on characterizing integrated GPUs. This paper is a step towards understanding the power and performance of integrated GPUs. Our results reveal many architectural caveats that programmers need to be aware of to exploit integrated GPUs: memory contention between the CPU and GPU, workload dependent energy efficiency, and data transfer tradeoffs.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121069030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Greening the compute cloud's pricing plans","authors":"Rini T. Kaushik, P. Sarkar, Abdullah Gharaibeh","doi":"10.1145/2525526.2525855","DOIUrl":"https://doi.org/10.1145/2525526.2525855","url":null,"abstract":"Customers are levied a charge by the cloud service providers for using their services and the attractiveness of the offered pricing plan is one of the key determinants in the cloud adoption. The paper presents a synergistic cloud where the pricing plan, scheduler, and the charge-back model work in tandem with each other to provide green and cost-efficient computing options and incentives to the environmentally-friendly and cost-conscious end-users. The fine-grained, wholesale electricity price aware scheduler works in synergy with end-users' choices to reduce operating energy costs. The fine-grained, resource-utilization based, and spot price aware charge-back model is used to provide incentives in form of reduced usage costs. Strong evaluation results with real-world traces from Google show the feasibility of the proposed techniques.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114475964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards better CPU power management on multicore smartphones","authors":"Yifan Zhang, Xudong Wang, Xuanzhe Liu, Yunxin Liu, Linda Zhuang, Feng Zhao","doi":"10.1145/2525526.2525849","DOIUrl":"https://doi.org/10.1145/2525526.2525849","url":null,"abstract":"Although multicore smartphones have become increasingly mainstream, it is unclear whether and how smartphone applications can utilize multicore CPUs to improve performance. In this paper we study the performance of mobile applications using multicore CPUs, in terms of power and computation cost. Using Web browsing as an example, our preliminary measurement results show that even large applications like Web browsers with multi-threading acceleration cannot fully utilize the multicore CPUs. Furthermore, we find that the existing CPU power models on smartphones are ill-suited for modern multicore CPUs. We develop a new CPU power model with a high accuracy, 95.6% on average. Our work helps to better understand the performance of multicore smartphones and paves the way towards better CPU power management on multicore smartphones.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"145 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127310969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. C. Riekstin, Sean James, A. Kansal, Jie Liu, Eric Peterson
{"title":"No more electrical infrastructure: towards fuel cell powered data centers","authors":"A. C. Riekstin, Sean James, A. Kansal, Jie Liu, Eric Peterson","doi":"10.1145/2525526.2525853","DOIUrl":"https://doi.org/10.1145/2525526.2525853","url":null,"abstract":"We consider the use of fuel cells for powering data centers, based on benefits in reliability, capital and operational costs, and reduced environmental emissions. Using fuel cells effectively in data centers introduces several challenges and we highlight key research questions for designing a fuel cell based data center power distribution system. We analyze a specific configuration in the design space to quantify the cost benefits for a large scale data center, for the most mature and commonly deployed fuel cell technology, achieving over 20% reduction in costs using conservative projections.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115192214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Racing and pacing to idle: an evaluation of heuristics for energy-aware resource allocation","authors":"H. Hoffmann","doi":"10.1145/2525526.2525854","DOIUrl":"https://doi.org/10.1145/2525526.2525854","url":null,"abstract":"We examine the problem of assigning computing resources to an application to meet a performance goal while minimizing energy consumption. We present a general formulation of this problem as a linear program, discuss several potential heuristic solutions, and evaluate these heuristics on two real systems (one purchased in 2010, the other in 2013). We find that the well-known race-to-idle heuristic is close to the optimal solution on the older machine. On the newer machine, however, the optimal solution outperforms race-to-idle by over 35%. A generalization of race-to-idle, called pace-to-idle, is found to provide better results in a wider range of scenarios.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124644412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Why application errors drain battery easily?: a study of memory leaks in smartphone apps","authors":"Mingyuan Xia, Wenbo He, Xue Liu, Jie Liu","doi":"10.1145/2525526.2525846","DOIUrl":"https://doi.org/10.1145/2525526.2525846","url":null,"abstract":"Mobile operating systems embrace new mechanisms that reduce energy consumption for common usage scenarios. The background app design is a representative implemented in all major mobile OSes. The OS keeps apps that are not currently interacting with the user in memory to avoid repeated app loading. This mechanism improves responsiveness and reduces the energy consumption when the user switches apps. However, we demonstrate that application errors, in particular memory leaks that cause system memory pressure, can easily cripple this mechanism. In this paper, we conduct experiments on real Android smartphones to 1) evaluate how the background app design improves responsiveness and saves energy; 2) characterize memory leaks in Android apps and outline its energy impact; 3) propose design improvements to retrofit the mechanism against memory leaks.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127942171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chenguang Shen, Supriyo Chakraborty, K. Raghavan, Haksoo Choi, M. Srivastava
{"title":"Exploiting processor heterogeneity for energy efficient context inference on mobile phones","authors":"Chenguang Shen, Supriyo Chakraborty, K. Raghavan, Haksoo Choi, M. Srivastava","doi":"10.1145/2525526.2525856","DOIUrl":"https://doi.org/10.1145/2525526.2525856","url":null,"abstract":"In recent years we have seen the emergence of context-aware mobile sensing apps which employ machine learning algorithms on real-time sensor data to infer user behaviors and contexts. These apps are typically optimized for power and performance on the app processors of mobile platforms. However, modern mobile platforms are sophisticated system on chips (SoCs) where the main app processors are complemented by multiple co-processors. Recently chip vendors have undertaken nascent efforts to make these previously hidden co-processors such as the digital signal processors (DSPs) programmable. In this paper, we explore the energy and performance implications of off-loading the computation associated with machine learning algorithms in context-aware apps to DSPs embedded in mobile SoCs. Our results show a 17% reduction in a TI OMAP4 based mobile platform's energy usage from off-loading context classification computation to the DSP core with indiscernible latency overhead. We also describe the design of a run-time system service for energy efficient context inference on Android devices, which takes parameters from the app to instantiate the classification model and schedules the execution on the DSP or app processor as specified by the app.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116572519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cooperative control architecture of fan-less servers and fresh-air cooling in container servers for low power operation","authors":"Hiroshi Endo, H. Kodama, Hiroyuki Fukuda, Toshio Sugimoto, Takashi Horie, Masao Kondo","doi":"10.1145/2525526.2525844","DOIUrl":"https://doi.org/10.1145/2525526.2525844","url":null,"abstract":"In order to minimize the container server power consumption, a new cooling system that incorporates fan-less servers and fresh-air cooling is proposed.\u0000 In a conventional container data center, the required air flow for sever cooling is supplied by both server built-in fans and container facility fans. Therefore, this work has been carried out on fan-less servers to reduce power consumption. Although fan-less servers are expected to reduce power consumption, facility fans have to provide excessive air to secure a safe operation of servers. In order to achieve optimized air-flow from facility fans to cool fan-less servers, a power saving control system incorporating the IT system and cooling facilities is proposed. Here, facility fans are controlled based on server information such as CPU temperature, rack position and so on. Through this study, we suggest that the minimum point in total power consumption of the container server with no performance penalty existed by the trade-off relationship between the power consumption changes of servers and of facility fans with CPU temperature. This enables us to operate the server system with minimized power consumption depending on the air temperature. To verify the energy-saving effect of this technology, a prototype container server with the proposed system was constructed. As a result, 22.8% energy saving was achieved with this new system, compared with the conventional container servers with built-in fans.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128167868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
John L. Byrne, Jichuan Chang, Kevin T. Lim, Laura L. Ramirez, Parthasarathy Ranganathan
{"title":"Power-efficient networking for balanced system designs: early experiences with PCIe","authors":"John L. Byrne, Jichuan Chang, Kevin T. Lim, Laura L. Ramirez, Parthasarathy Ranganathan","doi":"10.1145/2039252.2039255","DOIUrl":"https://doi.org/10.1145/2039252.2039255","url":null,"abstract":"Recent proposals using low-power processors and Flash-based storage can dramatically improve the energy-efficiency of compute and storage subsystems in data-centric computing. However, in a balanced system design, these changes call for matching improvement in the network subsystem as well. Conventional Ethernet-based networks are a potential energy-efficiency bottleneck due to the limited performance of gigabit Ethernet and the high power overhead of 10-gigbit Ethernet. In this paper, we evaluate the benefits of using an alternative, high-bandwidth, low-power, interconnect---PCIe---for power-efficient networking. Our experiments using PCIe's Non-Transparent Bridging for data transfer demonstrate significant performance gains at lower power, leading to 60--124% better energy efficiency. Early experiences with PCIe clustering also point to several challenges of PCIe-based networks and new opportunities for low-latency power-efficient datacenter networking.","PeriodicalId":112226,"journal":{"name":"Power-Aware Computer Systems","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116912304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}