RACES '12最新文献

筛选
英文 中文
How FIFO is your concurrent FIFO queue? 如何FIFO是你的并发FIFO队列?
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414731
Andreas Haas, C. Kirsch, Michael Lippautz, H. Payer
{"title":"How FIFO is your concurrent FIFO queue?","authors":"Andreas Haas, C. Kirsch, Michael Lippautz, H. Payer","doi":"10.1145/2414729.2414731","DOIUrl":"https://doi.org/10.1145/2414729.2414731","url":null,"abstract":"Designing and implementing high-performance concurrent data structures whose access performance scales on multicore hardware is difficult. Concurrent implementations of FIFO queues, for example, seem to require algorithms that efficiently increase the potential for parallel access by implementing semantically relaxed rather than strict FIFO queues where elements may be returned in some out-of-order fashion. However, we show experimentally that the on average shorter execution time of enqueue and dequeue operations of fast but relaxed implementations may offset the effect of semantical relaxations making them appear as behaving more FIFO than strict but slow implementations. Our key assumption is that ideal concurrent data structure operations should execute in zero time. We define two metrics, element-fairness and operation-fairness, to measure the degree of element and operation reordering, respectively, assuming operations take zero time. Element-fairness quantifies the deviation from FIFO queue semantics had all operations executed in zero time. With this metric even strict implementations of FIFO queues are not FIFO. Operation-fairness helps explaining element-fairness by quantifying operation reordering when considering the actual time operations took effect relative to their invocation time. In our experiments, the effect of poor operation-fairness of strict but slow implementations on element-fairness may outweigh the effect of semantical relaxation of fast but relaxed implementations.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"139 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127476271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Approximate parallel sorting on a spatial computer 空间计算机上的近似并行排序
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414739
Max OrHai, A. Black
{"title":"Approximate parallel sorting on a spatial computer","authors":"Max OrHai, A. Black","doi":"10.1145/2414729.2414739","DOIUrl":"https://doi.org/10.1145/2414729.2414739","url":null,"abstract":"We describe collision sort, a simple distributed sorting algorithm for a spatial computer on a regular lattice, which represents data by mobile particles in an abstract space. Although collision sort produces only approximate results, it tolerates faults, minimizes communication, and adapts easily to simultaneous multi-axis sorting. We show simulations of collision sort in 1 and 2 dimensions, and discuss its scalability properties.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"398 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125504101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Does better throughput require worse latency? 更好的吞吐量是否需要更差的延迟?
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414736
D. Ungar, D. Kimelman, Sam S. Adams, M. Wegman
{"title":"Does better throughput require worse latency?","authors":"D. Ungar, D. Kimelman, Sam S. Adams, M. Wegman","doi":"10.1145/2414729.2414736","DOIUrl":"https://doi.org/10.1145/2414729.2414736","url":null,"abstract":"Let throughput denote the amount of application-level work performed in unit time, normalized to the amount of work that would be accomplished with perfect linear scaling. Let latency denote the mean time required for a thread on one core to observe a change effected by a thread on another core, normalized to the best latency possible for the given platform. Might it be true that algorithms that improve application-level throughput worsen inter-core application-level latency? As techniques for improving performance have evolved from mutex-and-locks to race-and-repair, each seems to have offered more throughput at the expense of increased latency.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129643277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dancing with uncertainty 与不确定性共舞
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414738
Sasa Misailovic, Stelios Sidiroglou, M. Rinard
{"title":"Dancing with uncertainty","authors":"Sasa Misailovic, Stelios Sidiroglou, M. Rinard","doi":"10.1145/2414729.2414738","DOIUrl":"https://doi.org/10.1145/2414729.2414738","url":null,"abstract":"We present Dubstep, a novel system that uses the find-transform-navigate paradigm to automatically explore new parallelization opportunities in already parallelized (fully-synchronized) programs by opportunistically relaxing synchronization primitives. This set of transformations generates a space of alternative, possibly non-deterministic, parallel programs with varying performance and accuracy characteristics. The freedom to generate parallel programs whose output may differ (within statistical accuracy bounds) from the output of the original program enables a significantly larger optimization space. Dubstep then searches this space to find a parallel program that will, with high likelihood, produce outputs that are acceptably close to the outputs that the original, fully synchronized parallel program would have produced.\u0000 Initial results from our benchmarked application show that Dubstep can generate acceptably accurate and efficient versions of a parallel program that occupy different positions in performance/accuracy trade off space.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128497064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Edge chasing delayed consistency: pushing the limits of weak memory models 边缘追逐延迟一致性:推动弱内存模型的极限
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414733
Harold W. Cain, Mikko H. Lipasti
{"title":"Edge chasing delayed consistency: pushing the limits of weak memory models","authors":"Harold W. Cain, Mikko H. Lipasti","doi":"10.1145/2414729.2414733","DOIUrl":"https://doi.org/10.1145/2414729.2414733","url":null,"abstract":"In shared memory multiprocessors utilizing invalidation-based coherence protocols, cache misses caused by inter-processor communication are a dominant source of processor stall cycles for many applications. We explore a novel coherence protocol implementation called edge-chasing delayed consistency (ECDC) that mitigates some of the performance degradation caused by this class of misses. Edge-chasing delayed consistency allows a processor to non-speculatively continue reading a cache line after receiving an invalidation from another core, without changing the consistency model offered to programmers. While the idea of using stale data for as long as possible is enticing, our study shows that the benefits of such delay are small, and that the majority of these delayed invalidation benefits come from mitigating the false sharing problem, rather than any tolerance of races or an application's ability to consume stale data in a productive manner.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"231 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120923817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Programming with relaxed synchronization 轻松同步编程
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414737
Lakshminarayanan Renganarayana, V. Srinivasan, R. Nair, D. Prener
{"title":"Programming with relaxed synchronization","authors":"Lakshminarayanan Renganarayana, V. Srinivasan, R. Nair, D. Prener","doi":"10.1145/2414729.2414737","DOIUrl":"https://doi.org/10.1145/2414729.2414737","url":null,"abstract":"Synchronization overhead is a major bottleneck in scaling parallel applications to a large number of cores. This continues to be true in spite of various synchronization-reduction techniques that have been proposed. Previously studied synchronization-reduction techniques tacitly assume that all synchronizations specified in a source program are essential to guarantee quality of the results produced by the program. Recently there have been proposals to relax the synchronizations in a parallel program and compute approximate results. A fundamental challenge in using relaxed synchronization is guaranteeing that the relaxed program always produces results with a specified quality. We propose a methodology that addresses this challenge in programming with relaxed synchronization. Using our methodology programmers can systematically relax synchronization while always producing results that are of same quality as the original (un-relaxed) program. We demonstrate significant speedups using our methodology on a variety of benchmarks (e.g., up to 15x on KMeans benchmark, and up to 3x on a already highly tuned kernel from Graph500 benchmark).","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121365773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 84
Position paper: nondeterminism is unavoidable, but data races are pure evil 立场文件:不确定性是不可避免的,但数据竞赛是纯粹的邪恶
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414732
H. Boehm
{"title":"Position paper: nondeterminism is unavoidable, but data races are pure evil","authors":"H. Boehm","doi":"10.1145/2414729.2414732","DOIUrl":"https://doi.org/10.1145/2414729.2414732","url":null,"abstract":"Modern mainstream programming languages distinguish between \"atomic\" (or sometimes \"volatile\") variables, and ordinary data. Atomic accesses are treated as synchronization constructs, and support concurrent access with well-defined semantics. In contrast, concurrent accesses to ordinary data, if at least one access is an update, constitute a data race. Code with data races does not have well-defined semantics. Such code may fail completely when recompiled or run on a different operating system version. In C and C++ data races are equivalent to assignments to out-of-bounds array elements; any data race can result in arbitrary failures, including application crashes, hangs, and inexplicably and completely wrong answers.\u0000 These language specifications, combined with implementation realities, make it unsafe to exploit \"benign\" data races to obtain performance, even if we are willing to tolerate approximate answers. Furthermore, even if we happen to get lucky, and code with data races happens to execute correctly with our current compiler, data races provide at best inconsequential performance advantages over atomics. In fact, there are interesting, and probably common, cases in which data races provide only a minor performance advantage, even over pervasive locking to avoid them, it at sufficiently large core counts. We demonstrate such a case.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127614677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
A case for relativistic programming 相对论规划的一个例子
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414735
Philip W. Howard, J. Walpole
{"title":"A case for relativistic programming","authors":"Philip W. Howard, J. Walpole","doi":"10.1145/2414729.2414735","DOIUrl":"https://doi.org/10.1145/2414729.2414735","url":null,"abstract":"Programmers traditionally assume we live and program in a sequentially consistent world. Hardware developers have long since abandoned sequential consistency in order to develop higher performance computers. In this paper, we argue that the natural world maintains causal consistency not sequential consistency. From that observation, we argue that concurrent programs only need to maintain causal consistency. Since causal consistency does not require a total order on operations, it can be implemented with less impact on scalability than more strict ordering guarantees.","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121896004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Beyond expert-only parallel programming? 超越仅限专家的并行编程?
RACES '12 Pub Date : 2012-10-21 DOI: 10.1145/2414729.2414734
P. McKenney
{"title":"Beyond expert-only parallel programming?","authors":"P. McKenney","doi":"10.1145/2414729.2414734","DOIUrl":"https://doi.org/10.1145/2414729.2414734","url":null,"abstract":"My parallel-programming education began in earnest when I joined Sequent Computer Systems in late 1990. This education was both brief and effective: within a few short years, my co-workers and I were breaking new ground [MG92, MS93, MS98].1 Nor was I alone: Sequent habitually hired new-to-parallelism engineers and had them producing competent parallel code within a few months. Nevertheless, more than two decades later, parallel programming is perceived to be difficult to teach and learn. Is parallel programming an exception to the typical transitioning of technnology from impossible to expert-only to routine to unworthy of conscious thought?","PeriodicalId":137547,"journal":{"name":"RACES '12","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115449966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信