迈向一个时间和空间高效功能实现的蒙特卡罗光子传输代码

Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques Pub Date : 1997-11-11 DOI:10.1109/PACT.1997.644024

J. Hammes, A. Böhm

{"title":"迈向一个时间和空间高效功能实现的蒙特卡罗光子传输代码","authors":"J. Hammes, A. Böhm","doi":"10.1109/PACT.1997.644024","DOIUrl":null,"url":null,"abstract":"In this paper we present three Sisal versions of a large Monte Carlo radiation transport code: a straightforward version, a stream version, and a stripmined loop version. We compare these versions with respect to their time and space efficiency and their parallelism. We discuss the compiler used in this project, which generates multithreaded shared memory code. We discuss the effect of strictness on program behavior. Sisal provides the fastest, purely functional, sequential code we have seen for this benchmark:, using a constant amount of space. The stream version suffers from the fact that streams have a strict implementation in the Sisal compiler, so programs using long streams are both space inefficient and can show limited parallel speedup. The stripmined version of our code uses relatively small amounts of space, and shows a speedup of only around two for four processors, as it exhibits significant reference count lock contention.","PeriodicalId":177411,"journal":{"name":"Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques","volume":"133 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a time and space efficient functional implementation of a Monte Carlo photon transport code\",\"authors\":\"J. Hammes, A. Böhm\",\"doi\":\"10.1109/PACT.1997.644024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we present three Sisal versions of a large Monte Carlo radiation transport code: a straightforward version, a stream version, and a stripmined loop version. We compare these versions with respect to their time and space efficiency and their parallelism. We discuss the compiler used in this project, which generates multithreaded shared memory code. We discuss the effect of strictness on program behavior. Sisal provides the fastest, purely functional, sequential code we have seen for this benchmark:, using a constant amount of space. The stream version suffers from the fact that streams have a strict implementation in the Sisal compiler, so programs using long streams are both space inefficient and can show limited parallel speedup. The stripmined version of our code uses relatively small amounts of space, and shows a speedup of only around two for four processors, as it exhibits significant reference count lock contention.\",\"PeriodicalId\":177411,\"journal\":{\"name\":\"Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques\",\"volume\":\"133 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PACT.1997.644024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PACT.1997.644024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在本文中，我们提出了一个大型蒙特卡罗辐射传输码的三个Sisal版本:一个直接版本，一个流版本和一个条带循环版本。我们比较了这些版本的时间和空间效率以及并行性。我们将讨论这个项目中使用的编译器，它生成多线程共享内存代码。我们讨论了严格性对程序行为的影响。Sisal提供了我们在这个基准测试中见过的最快的、纯功能的、顺序的代码，使用了恒定的空间。流版本的缺点是，流在Sisal编译器中有严格的实现，因此使用长流的程序不仅空间效率低，而且只能显示有限的并行加速。我们的代码的精简版本使用了相对较少的空间，并且对于4个处理器只显示了大约2的加速，因为它显示了显著的引用计数锁争用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Towards a time and space efficient functional implementation of a Monte Carlo photon transport code

In this paper we present three Sisal versions of a large Monte Carlo radiation transport code: a straightforward version, a stream version, and a stripmined loop version. We compare these versions with respect to their time and space efficiency and their parallelism. We discuss the compiler used in this project, which generates multithreaded shared memory code. We discuss the effect of strictness on program behavior. Sisal provides the fastest, purely functional, sequential code we have seen for this benchmark:, using a constant amount of space. The stream version suffers from the fact that streams have a strict implementation in the Sisal compiler, so programs using long streams are both space inefficient and can show limited parallel speedup. The stripmined version of our code uses relatively small amounts of space, and shows a speedup of only around two for four processors, as it exhibits significant reference count lock contention.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques

自引率

0.00%

发文量