分布式 GPU 平台上量子计算模拟的性能分析和建模

IF 2.2 3区 物理与天体物理 Q1 PHYSICS, MATHEMATICAL
Armin Ahmadzadeh, Hamid Sarbazi-Azad
{"title":"分布式 GPU 平台上量子计算模拟的性能分析和建模","authors":"Armin Ahmadzadeh,&nbsp;Hamid Sarbazi-Azad","doi":"10.1007/s11128-024-04580-x","DOIUrl":null,"url":null,"abstract":"<div><p>Quantum computing holds great promise for accelerating computational tasks, but they are still not accessible. To fill this gap, quantum computing simulators have been widely used for the developing of quantum circuits and algorithms. Simulating quantum algorithms on classical computers also poses challenges due to the need for exponential memory and computational requirements. Many researchers attempted to address such challenges on different single-core, multi-core, and many-core systems, especially graphics processing units (GPUs). The diversity of CPU and GPU simulation of quantum circuits, including various CPU–GPU combinations and multiple parameters, including qubit size, memory capacity, circuit depth, GPU performance, resource heterogeneity, and load imbalance, makes it even more challenging. Finding the best configuration requires an exhaustive search in the design space, which is not possible in an acceptable time frame. Therefore, given the multitude of parameters and the analysis of influential factors, having an analytical model for selecting the proper configuration is desirable and even essential for large systems. This paper proposes a novel analytical performance model for quantum circuit simulation on a hybrid CPU–GPU platform of various sizes and parameters such as number of CPUs/GPUs, qubit size, memory capacity, quantum circuit depth, CPU/GPU performance, resource heterogeneity, and processing load. To do so, we focus on evaluating a scalable and adaptive hybrid quantum simulator in a hybrid platform with some CPUs and GPUs across multiple hosts. The model analyzes the execution time of individual GPU kernels and the impact of major micro-architecture features on performance. By employing dynamic load partitioning (DLP) and the heterogeneous multi-GPU kernel, performance bottlenecks are accurately identified, and execution time is estimated. The proposed model shows high accuracy, with a 94% accuracy compared to the experimental results on a hybrid multi-node cluster. Therefore, the proposed model provides insights into scalability, efficiency, and load balancing in hybrid parallel systems, hence supporting code optimization and development of efficient quantum algorithms and advanced quantum circuit simulation on hybrid parallel architectures.\n</p></div>","PeriodicalId":746,"journal":{"name":"Quantum Information Processing","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance analysis and modeling for quantum computing simulation on distributed GPU platforms\",\"authors\":\"Armin Ahmadzadeh,&nbsp;Hamid Sarbazi-Azad\",\"doi\":\"10.1007/s11128-024-04580-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Quantum computing holds great promise for accelerating computational tasks, but they are still not accessible. To fill this gap, quantum computing simulators have been widely used for the developing of quantum circuits and algorithms. Simulating quantum algorithms on classical computers also poses challenges due to the need for exponential memory and computational requirements. Many researchers attempted to address such challenges on different single-core, multi-core, and many-core systems, especially graphics processing units (GPUs). The diversity of CPU and GPU simulation of quantum circuits, including various CPU–GPU combinations and multiple parameters, including qubit size, memory capacity, circuit depth, GPU performance, resource heterogeneity, and load imbalance, makes it even more challenging. Finding the best configuration requires an exhaustive search in the design space, which is not possible in an acceptable time frame. Therefore, given the multitude of parameters and the analysis of influential factors, having an analytical model for selecting the proper configuration is desirable and even essential for large systems. This paper proposes a novel analytical performance model for quantum circuit simulation on a hybrid CPU–GPU platform of various sizes and parameters such as number of CPUs/GPUs, qubit size, memory capacity, quantum circuit depth, CPU/GPU performance, resource heterogeneity, and processing load. To do so, we focus on evaluating a scalable and adaptive hybrid quantum simulator in a hybrid platform with some CPUs and GPUs across multiple hosts. The model analyzes the execution time of individual GPU kernels and the impact of major micro-architecture features on performance. By employing dynamic load partitioning (DLP) and the heterogeneous multi-GPU kernel, performance bottlenecks are accurately identified, and execution time is estimated. The proposed model shows high accuracy, with a 94% accuracy compared to the experimental results on a hybrid multi-node cluster. Therefore, the proposed model provides insights into scalability, efficiency, and load balancing in hybrid parallel systems, hence supporting code optimization and development of efficient quantum algorithms and advanced quantum circuit simulation on hybrid parallel architectures.\\n</p></div>\",\"PeriodicalId\":746,\"journal\":{\"name\":\"Quantum Information Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantum Information Processing\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11128-024-04580-x\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PHYSICS, MATHEMATICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantum Information Processing","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s11128-024-04580-x","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHYSICS, MATHEMATICAL","Score":null,"Total":0}
引用次数: 0

摘要

量子计算在加速计算任务方面大有可为,但目前仍无法实现。为了填补这一空白,量子计算模拟器已被广泛用于开发量子电路和算法。由于需要指数级的内存和计算要求,在经典计算机上模拟量子算法也带来了挑战。许多研究人员试图在不同的单核、多核和多核系统,尤其是图形处理器(GPU)上应对这些挑战。量子电路的CPU和GPU仿真具有多样性,包括各种CPU-GPU组合和多种参数,包括量子比特大小、内存容量、电路深度、GPU性能、资源异构性和负载不平衡性,这使得仿真更具挑战性。要找到最佳配置,需要在设计空间中进行穷举式搜索,而这在可接受的时间范围内是不可能实现的。因此,考虑到众多参数和影响因素的分析,建立一个分析模型来选择适当的配置是可取的,甚至对大型系统来说是必不可少的。本文提出了一种新的分析性能模型,用于在 CPU/GPU 混合平台上进行量子电路仿真,该平台具有不同的规模和参数,如 CPU/GPU 数量、量子比特大小、内存容量、量子电路深度、CPU/GPU 性能、资源异构性和处理负载。为此,我们重点评估了一个可扩展和自适应的混合量子模拟器,该模拟器在一个混合平台中使用一些 CPU 和 GPU,跨越多个主机。该模型分析了单个 GPU 内核的执行时间以及主要微架构特性对性能的影响。通过采用动态负载分区(DLP)和异构多 GPU 内核,可以准确识别性能瓶颈并估算执行时间。与混合多节点集群上的实验结果相比,所提模型的准确率高达 94%。因此,所提出的模型有助于深入了解混合并行系统的可扩展性、效率和负载平衡,从而支持混合并行架构上高效量子算法和先进量子电路仿真的代码优化和开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Performance analysis and modeling for quantum computing simulation on distributed GPU platforms

Performance analysis and modeling for quantum computing simulation on distributed GPU platforms

Quantum computing holds great promise for accelerating computational tasks, but they are still not accessible. To fill this gap, quantum computing simulators have been widely used for the developing of quantum circuits and algorithms. Simulating quantum algorithms on classical computers also poses challenges due to the need for exponential memory and computational requirements. Many researchers attempted to address such challenges on different single-core, multi-core, and many-core systems, especially graphics processing units (GPUs). The diversity of CPU and GPU simulation of quantum circuits, including various CPU–GPU combinations and multiple parameters, including qubit size, memory capacity, circuit depth, GPU performance, resource heterogeneity, and load imbalance, makes it even more challenging. Finding the best configuration requires an exhaustive search in the design space, which is not possible in an acceptable time frame. Therefore, given the multitude of parameters and the analysis of influential factors, having an analytical model for selecting the proper configuration is desirable and even essential for large systems. This paper proposes a novel analytical performance model for quantum circuit simulation on a hybrid CPU–GPU platform of various sizes and parameters such as number of CPUs/GPUs, qubit size, memory capacity, quantum circuit depth, CPU/GPU performance, resource heterogeneity, and processing load. To do so, we focus on evaluating a scalable and adaptive hybrid quantum simulator in a hybrid platform with some CPUs and GPUs across multiple hosts. The model analyzes the execution time of individual GPU kernels and the impact of major micro-architecture features on performance. By employing dynamic load partitioning (DLP) and the heterogeneous multi-GPU kernel, performance bottlenecks are accurately identified, and execution time is estimated. The proposed model shows high accuracy, with a 94% accuracy compared to the experimental results on a hybrid multi-node cluster. Therefore, the proposed model provides insights into scalability, efficiency, and load balancing in hybrid parallel systems, hence supporting code optimization and development of efficient quantum algorithms and advanced quantum circuit simulation on hybrid parallel architectures.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Quantum Information Processing
Quantum Information Processing 物理-物理:数学物理
CiteScore
4.10
自引率
20.00%
发文量
337
审稿时长
4.5 months
期刊介绍: Quantum Information Processing is a high-impact, international journal publishing cutting-edge experimental and theoretical research in all areas of Quantum Information Science. Topics of interest include quantum cryptography and communications, entanglement and discord, quantum algorithms, quantum error correction and fault tolerance, quantum computer science, quantum imaging and sensing, and experimental platforms for quantum information. Quantum Information Processing supports and inspires research by providing a comprehensive peer review process, and broadcasting high quality results in a range of formats. These include original papers, letters, broadly focused perspectives, comprehensive review articles, book reviews, and special topical issues. The journal is particularly interested in papers detailing and demonstrating quantum information protocols for cryptography, communications, computation, and sensing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信