Optimization of heterogeneous parallel algorithm for Monte Carlo neutron transport simulation aiming at thread divergence Issues

IF 1.9 3区 工程技术 Q1 NUCLEAR SCIENCE & TECHNOLOGY
Xian Zhang , Shu Li , Xin Wang , Danhua ShangGuan , Shichang Liu
{"title":"Optimization of heterogeneous parallel algorithm for Monte Carlo neutron transport simulation aiming at thread divergence Issues","authors":"Xian Zhang ,&nbsp;Shu Li ,&nbsp;Xin Wang ,&nbsp;Danhua ShangGuan ,&nbsp;Shichang Liu","doi":"10.1016/j.anucene.2025.111508","DOIUrl":null,"url":null,"abstract":"<div><div>The Monte Carlo simulation of large-scale neutron transport problems has always faced the problem of slow computation. In order to fully exploit the acceleration advantage of heterogeneous parallelism on the Monte Carlo neutron transport simulation, this paper carries out research around the history-based neutron tracking algorithm, deeply explores the adaptation of the Monte Carlo algorithm and heterogeneous parallelism. Aiming at the thread divergence problem, optimization strategies for particle tracking algorithm are proposed to ensure load balancing among parallel threads. In addition, to mitigate the impact of global memory access latency, the memory layout of the particle state data is reasonably arranged by comprehensively considering the random memory access of Monte Carlo algorithm and the hardware characteristics of GPU. The reliability and efficiency of heterogeneous parallel algorithm are validated in calculations of benchmarks, the computing performance on an NVIDIA A800 GPU is equivalent to the performance of 62–87 CPU cores.</div></div>","PeriodicalId":8006,"journal":{"name":"Annals of Nuclear Energy","volume":"219 ","pages":"Article 111508"},"PeriodicalIF":1.9000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Nuclear Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306454925003251","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The Monte Carlo simulation of large-scale neutron transport problems has always faced the problem of slow computation. In order to fully exploit the acceleration advantage of heterogeneous parallelism on the Monte Carlo neutron transport simulation, this paper carries out research around the history-based neutron tracking algorithm, deeply explores the adaptation of the Monte Carlo algorithm and heterogeneous parallelism. Aiming at the thread divergence problem, optimization strategies for particle tracking algorithm are proposed to ensure load balancing among parallel threads. In addition, to mitigate the impact of global memory access latency, the memory layout of the particle state data is reasonably arranged by comprehensively considering the random memory access of Monte Carlo algorithm and the hardware characteristics of GPU. The reliability and efficiency of heterogeneous parallel algorithm are validated in calculations of benchmarks, the computing performance on an NVIDIA A800 GPU is equivalent to the performance of 62–87 CPU cores.
针对线程发散问题的蒙特卡罗中子输运模拟异构并行算法优化
大规模中子输运问题的蒙特卡罗模拟一直面临着计算速度慢的问题。为了充分发挥异构并行在蒙特卡罗中子输运模拟中的加速优势,本文围绕基于历史的中子跟踪算法展开研究,深入探讨了蒙特卡罗算法与异构并行的自适应。针对线程发散问题,提出了粒子跟踪算法的优化策略,以保证并行线程间的负载均衡。此外,为了减轻全局存储器访问延迟的影响,综合考虑蒙特卡罗算法的随机存储器访问和GPU的硬件特性,合理安排粒子状态数据的存储器布局。在基准计算中验证了异构并行算法的可靠性和效率,在NVIDIA A800 GPU上的计算性能相当于62-87个CPU核的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Annals of Nuclear Energy
Annals of Nuclear Energy 工程技术-核科学技术
CiteScore
4.30
自引率
21.10%
发文量
632
审稿时长
7.3 months
期刊介绍: Annals of Nuclear Energy provides an international medium for the communication of original research, ideas and developments in all areas of the field of nuclear energy science and technology. Its scope embraces nuclear fuel reserves, fuel cycles and cost, materials, processing, system and component technology (fission only), design and optimization, direct conversion of nuclear energy sources, environmental control, reactor physics, heat transfer and fluid dynamics, structural analysis, fuel management, future developments, nuclear fuel and safety, nuclear aerosol, neutron physics, computer technology (both software and hardware), risk assessment, radioactive waste disposal and reactor thermal hydraulics. Papers submitted to Annals need to demonstrate a clear link to nuclear power generation/nuclear engineering. Papers which deal with pure nuclear physics, pure health physics, imaging, or attenuation and shielding properties of concretes and various geological materials are not within the scope of the journal. Also, papers that deal with policy or economics are not within the scope of the journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信