CCF Transactions on High Performance Computing最新文献

筛选
英文 中文
Extending OP2 framework to support portable parallel programming of complex applications 扩展 OP2 框架,支持复杂应用程序的可移植并行编程
IF 0.9
CCF Transactions on High Performance Computing Pub Date : 2023-12-07 DOI: 10.1007/s42514-023-00174-8
Zongjing Chen, Kangjin Huang, Yonggang Che, Chuanfu Xu, Jian Zhang, Z. Dai, Ming Li
{"title":"Extending OP2 framework to support portable parallel programming of complex applications","authors":"Zongjing Chen, Kangjin Huang, Yonggang Che, Chuanfu Xu, Jian Zhang, Z. Dai, Ming Li","doi":"10.1007/s42514-023-00174-8","DOIUrl":"https://doi.org/10.1007/s42514-023-00174-8","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138591803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging simulation of high performance computing systems with node simulation using architecture simulator 利用架构模拟器对高性能计算系统进行节点仿真
CCF Transactions on High Performance Computing Pub Date : 2023-11-13 DOI: 10.1007/s42514-023-00173-9
Fang Lin, Yi Liu, Xin Wang, Xueyan Gai
{"title":"Leveraging simulation of high performance computing systems with node simulation using architecture simulator","authors":"Fang Lin, Yi Liu, Xin Wang, Xueyan Gai","doi":"10.1007/s42514-023-00173-9","DOIUrl":"https://doi.org/10.1007/s42514-023-00173-9","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136281968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OneGraph: a cross-architecture framework for large-scale graph computing on GPUs based on oneAPI OneGraph:基于oneAPI的gpu大规模图计算跨架构框架
CCF Transactions on High Performance Computing Pub Date : 2023-11-09 DOI: 10.1007/s42514-023-00172-w
Shiyang Li, Jingyu Zhu, Jiaxun Han, Yuting Peng, Zhuoran Wang, Xiaoli Gong, Gang Wang, Jin Zhang, Xuqiang Wang
{"title":"OneGraph: a cross-architecture framework for large-scale graph computing on GPUs based on oneAPI","authors":"Shiyang Li, Jingyu Zhu, Jiaxun Han, Yuting Peng, Zhuoran Wang, Xiaoli Gong, Gang Wang, Jin Zhang, Xuqiang Wang","doi":"10.1007/s42514-023-00172-w","DOIUrl":"https://doi.org/10.1007/s42514-023-00172-w","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135241910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BSPADMM: block splitting proximal ADMM for sparse representation with strong scalability BSPADMM:块分割近端ADMM,具有较强的可扩展性
CCF Transactions on High Performance Computing Pub Date : 2023-10-07 DOI: 10.1007/s42514-023-00164-w
Yidong Chen, Jingshan Pan, Zidong Han, Yonghong Hu, Meng Guo, Zhonghua Lu
{"title":"BSPADMM: block splitting proximal ADMM for sparse representation with strong scalability","authors":"Yidong Chen, Jingshan Pan, Zidong Han, Yonghong Hu, Meng Guo, Zhonghua Lu","doi":"10.1007/s42514-023-00164-w","DOIUrl":"https://doi.org/10.1007/s42514-023-00164-w","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135252112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conflict-aware workload co-execution on SX-aurora TSUBASA SX-aurora TSUBASA上的冲突感知工作负载协同执行
CCF Transactions on High Performance Computing Pub Date : 2023-10-05 DOI: 10.1007/s42514-023-00171-x
Riku Nunokawa, Yoichi Shimomura, Mulya Agung, Ryusuke Egawa, Hiroyuki Takizawa
{"title":"Conflict-aware workload co-execution on SX-aurora TSUBASA","authors":"Riku Nunokawa, Yoichi Shimomura, Mulya Agung, Ryusuke Egawa, Hiroyuki Takizawa","doi":"10.1007/s42514-023-00171-x","DOIUrl":"https://doi.org/10.1007/s42514-023-00171-x","url":null,"abstract":"Abstract NEC SX-Aurora TSUBASA (SX-AT) is the latest vector supercomputer, consisting of host processors called Vector Hosts (VHs) and vector processors called Vector Engines (VEs). The goal of this work is to simultaneously use both VHs and VEs to increase the resource utilization and improve the system throughput by co-executing more workloads. One difficulty is that performance interferences among VH and VE workloads could occur because they share some computing resources and potentially compete to use the same resource at the same time, so-called resource conflicts. To achieve efficient workload co-execution, first, this paper experimentally investigates the performance interference between a VH and a VE, when each of the two processors executes a different workload. It is empirically shown that the frequency of system calls from the VE workload could be a good indicator to predict if the co-execution could cause severe performance interference, even though monitoring system calls requires a huge runtime overhead and it is impractical to simply use it for decision making of co-execution. Then, this paper proposes a workload co-execution strategy based on a practical approach to identifying a pair of VE and VH workloads that could cause severe performance interferences. Our evaluation results clearly demonstrate that the system call frequency can be used to predict if the workload can affect the performance of another co-executing workload, and VH’s CPU load can be a good approximation of the system call frequency. The proposed approach based on the CPU loads could accurately identify a pair of workloads causing frequent resource conflicts, and thus reduce the risk of severe performance interferences between co-executing workloads on an SX-AT system, resulting in shorter makespan without significantly increasing the turn-around time.","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135480691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FILL: a heterogeneous resource scheduling system addressing the low throughput problem in GROMACS FILL:一个异构资源调度系统,解决GROMACS中的低吞吐量问题
CCF Transactions on High Performance Computing Pub Date : 2023-09-23 DOI: 10.1007/s42514-023-00169-5
Yueyuan Zhou, ZiYi Ren, En Shao, Lixian Ma, Qiang Hu, Leping Wang, Guangming Tan
{"title":"FILL: a heterogeneous resource scheduling system addressing the low throughput problem in GROMACS","authors":"Yueyuan Zhou, ZiYi Ren, En Shao, Lixian Ma, Qiang Hu, Leping Wang, Guangming Tan","doi":"10.1007/s42514-023-00169-5","DOIUrl":"https://doi.org/10.1007/s42514-023-00169-5","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135959455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ConvDarts: a fast and exact convolutional algorithm selector for deep learning frameworks convdart:一个快速、精确的深度学习框架卷积算法选择器
CCF Transactions on High Performance Computing Pub Date : 2023-09-20 DOI: 10.1007/s42514-023-00167-7
Lu Bai, Weixing Ji, Qinyuan Li, Xilai Yao, Wei Xin, Wanyi Zhu
{"title":"ConvDarts: a fast and exact convolutional algorithm selector for deep learning frameworks","authors":"Lu Bai, Weixing Ji, Qinyuan Li, Xilai Yao, Wei Xin, Wanyi Zhu","doi":"10.1007/s42514-023-00167-7","DOIUrl":"https://doi.org/10.1007/s42514-023-00167-7","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136308147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920 用静态代码分析器揭示现代高性能计算处理器的性能瓶颈——以鲲鹏920为例
CCF Transactions on High Performance Computing Pub Date : 2023-09-15 DOI: 10.1007/s42514-023-00160-0
Shaojie Tan, Qingcai Jiang, Zhenwei Cao, Xiaoyu Hao, Junshi Chen, Hong An
{"title":"Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920","authors":"Shaojie Tan, Qingcai Jiang, Zhenwei Cao, Xiaoyu Hao, Junshi Chen, Hong An","doi":"10.1007/s42514-023-00160-0","DOIUrl":"https://doi.org/10.1007/s42514-023-00160-0","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135395212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An efficient cloud-based elastic RDMA protocol for HPC applications 一个高效的基于云的弹性RDMA协议,用于高性能计算应用
CCF Transactions on High Performance Computing Pub Date : 2023-09-15 DOI: 10.1007/s42514-023-00170-y
Hang Cao, Cheng Xu, Yunqi Han, Muhui Lin, Kai Shen, Geng Wang, Jinhu Li, Xiangzheng Sun, Ronghui He, Liang You, Hang Yang, Xiantao Zhang
{"title":"An efficient cloud-based elastic RDMA protocol for HPC applications","authors":"Hang Cao, Cheng Xu, Yunqi Han, Muhui Lin, Kai Shen, Geng Wang, Jinhu Li, Xiangzheng Sun, Ronghui He, Liang You, Hang Yang, Xiantao Zhang","doi":"10.1007/s42514-023-00170-y","DOIUrl":"https://doi.org/10.1007/s42514-023-00170-y","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135436699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixed-precision block incomplete sparse approximate preconditioner on Tensor core 张量核上的混合精度块不完全稀疏近似预调节器
CCF Transactions on High Performance Computing Pub Date : 2023-09-13 DOI: 10.1007/s42514-023-00165-9
Haoyuan Zhang, Wenpeng Ma, Wu Yuan, Jian Zhang, Zhonghua Lu
{"title":"Mixed-precision block incomplete sparse approximate preconditioner on Tensor core","authors":"Haoyuan Zhang, Wenpeng Ma, Wu Yuan, Jian Zhang, Zhonghua Lu","doi":"10.1007/s42514-023-00165-9","DOIUrl":"https://doi.org/10.1007/s42514-023-00165-9","url":null,"abstract":"","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135740789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信