IEEE International Symposium on High-Performance Parallel Distributed Computing最新文献_第5页

Coupling scheduler for MapReduce/Hadoop MapReduce/Hadoop的耦合调度程序

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287097

Jian Tan, Xiaoqiao Meng, Li Zhang

引用次数: 14

QBox: guaranteeing I/O performance on black box storage systems QBox:保证黑匣子存储系统的I/O性能

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287087

Dimitris Skourtis, S. Kato, S. Brandt

引用次数: 15

Work stealing and persistence-based load balancers for iterative overdecomposed applications 用于迭代过度分解应用程序的工作窃取和基于持久性的负载平衡器

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287103

J. Lifflander, S. Krishnamoorthy, L. Kalé

引用次数: 69

A system-aware optimized data organization for efficient scientific analytics 一个系统感知的优化数据组织，用于高效的科学分析

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287095

Yuan Tian, S. Klasky, Weikuan Yu, H. Abbasi, Bin Wang, N. Podhorszki, R. Grout, M. Wolf

引用次数: 4

Enabling event tracing at leadership-class scale through I/O forwarding middleware 通过I/O转发中间件实现领导级规模的事件跟踪

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287085

T. Ilsche, Joseph Schuchart, Jason Cope, D. Kimpe, T. Jones, A. Knüpfer, K. Iskra, R. Ross, W. Nagel, S. Poole

引用次数: 25

Understanding the effects and implications of compute node related failures in hadoop 了解hadoop中计算节点相关故障的影响和含义

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287108

Florin Dinu, T. Ng

{"title":"Understanding the effects and implications of compute node related failures in hadoop","authors":"Florin Dinu, T. Ng","doi":"10.1145/2287076.2287108","DOIUrl":"https://doi.org/10.1145/2287076.2287108","url":null,"abstract":"Hadoop has become a critical component in today's cloud environment. Ensuring good performance for Hadoop is paramount for the wide-range of applications built on top of it. In this paper we analyze Hadoop's behavior under failures involving compute nodes. We find that even a single failure can result in inflated, variable and unpredictable job running times, all undesirable properties in a distributed system. We systematically track the causes underlying this distressing behavior. First, we find that Hadoop makes unrealistic assumptions about task progress rates. These assumptions can be easily invalidated by the cloud environment and, more surprisingly, by Hadoop's own design decisions. The result are significant inefficiencies in Hadoop's speculative execution algorithm. Second, failures are re-discovered individually by each task at the cost of great degradation in job running time. The reason is that Hadoop focuses on extreme scalability and thus trades off possible improvements resulting from sharing failure information between tasks. Third, Hadoop does not consider the causes of connection failures between its tasks. We show that the resulting overloading of connection failure semantics unnecessarily causes an otherwise localized failure to propagate to healthy tasks. We also discuss the implications of our findings and draw attention to new ways of improving Hadoop-like frameworks.","PeriodicalId":330072,"journal":{"name":"IEEE International Symposium on High-Performance Parallel Distributed Computing","volume":"23 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121004220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 81

Putting a "big-data" platform to good use: training kinect 善用“大数据”平台:训练kinect

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287078

M. Budiu

引用次数: 2

Dynamic adaptive virtual core mapping to improve power, energy, and performance in multi-socket multicores 动态自适应虚拟核映射，以提高多插槽多核的功率、能量和性能

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287114

C. Bae, Lei Xia, P. Dinda, J. Lange

{"title":"Dynamic adaptive virtual core mapping to improve power, energy, and performance in multi-socket multicores","authors":"C. Bae, Lei Xia, P. Dinda, J. Lange","doi":"10.1145/2287076.2287114","DOIUrl":"https://doi.org/10.1145/2287076.2287114","url":null,"abstract":"Consider a multithreaded parallel application running inside a multicore virtual machine context that is itself hosted on a multi-socket multicore physical machine. How should the VMM map virtual cores to physical cores? We compare a local mapping, which compacts virtual cores to processor sockets, and an interleaved mapping, which spreads them over the sockets. Simply choosing between these two mappings exposes clear tradeoffs between performance, energy, and power. We then describe the design, implementation, and evaluation of a system that automatically and dynamically chooses between the two mappings. The system consists of a set of efficient online VMM-based mechanisms and policies that (a) capture the relevant characteristics of memory reference behavior, (b) provide a policy and mechanism for configuring the mapping of virtual machine cores to physical cores that optimizes for power, energy, or performance, and (c) drive dynamic migrations of virtual cores among local physical cores based on the workload and the currently specified objective. Using these techniques we demonstrate that the performance of SPEC and PARSEC benchmarks can be increased by as much as 66%, energy reduced by as much as 31%, and power reduced by as much as 17%, depending on the optimization objective.","PeriodicalId":330072,"journal":{"name":"IEEE International Symposium on High-Performance Parallel Distributed Computing","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123125082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

Exploring the performance and mapping of HPC applications to platforms in the cloud 探索高性能计算应用程序到云平台的性能和映射

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287093

Abhishek K. Gupta, L. Kalé, D. Milojicic, P. Faraboschi, R. Kaufmann, Verdi March, F. Gioachin, Chun Hui Suen, Bu-Sung Lee

引用次数: 37

vSlicer: latency-aware virtual machine scheduling via differentiated-frequency CPU slicing vSlicer:通过差频CPU切片实现延迟感知的虚拟机调度

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2012-06-18 DOI: 10.1145/2287076.2287080

Cong Xu, S. Gamage, P. N. Rao, Ardalan Kangarlou, R. Kompella, Dongyan Xu

{"title":"vSlicer: latency-aware virtual machine scheduling via differentiated-frequency CPU slicing","authors":"Cong Xu, S. Gamage, P. N. Rao, Ardalan Kangarlou, R. Kompella, Dongyan Xu","doi":"10.1145/2287076.2287080","DOIUrl":"https://doi.org/10.1145/2287076.2287080","url":null,"abstract":"Recent advances in virtualization technologies have made it feasible to host multiple virtual machines (VMs) in the same physical host and even the same CPU core, with fair share of the physical resources among the VMs. However, as more VMs share the same core/CPU, the CPU access latency experienced by each VM increases substantially, which translates into longer I/O processing latency perceived by I/O-bound applications. To mitigate such impact while retaining the benefit of CPU sharing, we introduce a new class of VMs called latency-sensitive VMs (LSVMs), which achieve better performance for I/O-bound applications while maintaining the same resource share (and thus cost) as other CPU-sharing VMs. LSVMs are enabled by vSlicer, a hypervisor-level technique that schedules each LSVM more frequently but with a smaller micro time slice. vSlicer enables more timely processing of I/O events by LSVMs, without violating the CPU share fairness among all sharing VMs. Our evaluation of a vSlicer prototype in Xen shows that vSlicer substantially reduces network packet round-trip times and jitter and improves application-level performance. For example, vSlicer doubles both the connection rate and request processing throughput of an Apache web server; reduces a VoIP server's upstream jitter by 62%; and shortens the execution times of Intel MPI benchmark programs by half or more.","PeriodicalId":330072,"journal":{"name":"IEEE International Symposium on High-Performance Parallel Distributed Computing","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121715295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 104