识别NAS并行基准自动并行化中的缺陷

S. Prema, R. Jehadeesan, B. K. Panigrahi
{"title":"识别NAS并行基准自动并行化中的缺陷","authors":"S. Prema, R. Jehadeesan, B. K. Panigrahi","doi":"10.1109/PARCOMPTECH.2017.8068329","DOIUrl":null,"url":null,"abstract":"This paper provides an examination of OpenMP based auto-parallelizers and their limitations encountered during parallelization of NAS parallel benchmarks. It also elucidates the issues faced by the parallelizers during parallelization and the resolutions to overcome the problems. Compute-intensive loops are pinpointed using Gprof and the problematic loops within the hotspot area were recognized. Our work concentrates on identifying the pitfalls within the located hotspots and rendering solution in such cases. Analysis on measured speedup and its reasons are well illustrated. This paper underlines the need of a user-interactive environment that highlights the problems evoked during parallelization. It also underscores the obligation for minimal manual intervention concerning coding changes to resolve the problematic code section and make them amenable to parallelization.","PeriodicalId":219266,"journal":{"name":"2017 National Conference on Parallel Computing Technologies (PARCOMPTECH)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Identifying pitfalls in automatic parallelization of NAS parallel benchmarks\",\"authors\":\"S. Prema, R. Jehadeesan, B. K. Panigrahi\",\"doi\":\"10.1109/PARCOMPTECH.2017.8068329\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper provides an examination of OpenMP based auto-parallelizers and their limitations encountered during parallelization of NAS parallel benchmarks. It also elucidates the issues faced by the parallelizers during parallelization and the resolutions to overcome the problems. Compute-intensive loops are pinpointed using Gprof and the problematic loops within the hotspot area were recognized. Our work concentrates on identifying the pitfalls within the located hotspots and rendering solution in such cases. Analysis on measured speedup and its reasons are well illustrated. This paper underlines the need of a user-interactive environment that highlights the problems evoked during parallelization. It also underscores the obligation for minimal manual intervention concerning coding changes to resolve the problematic code section and make them amenable to parallelization.\",\"PeriodicalId\":219266,\"journal\":{\"name\":\"2017 National Conference on Parallel Computing Technologies (PARCOMPTECH)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 National Conference on Parallel Computing Technologies (PARCOMPTECH)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PARCOMPTECH.2017.8068329\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 National Conference on Parallel Computing Technologies (PARCOMPTECH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PARCOMPTECH.2017.8068329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

摘要

本文介绍了基于OpenMP的自动并行化器及其在NAS并行基准的并行化过程中遇到的限制。本文还阐述了并行化器在并行化过程中所面临的问题以及克服这些问题的解决方案。利用Gprof对计算密集型环路进行精确定位,识别出热点区域内存在问题的环路。我们的工作集中在识别热点中的陷阱和在这种情况下的渲染解决方案。对实测加速进行了分析,并说明了加速产生的原因。本文强调需要一个用户交互环境,突出并行化过程中引起的问题。它还强调了对编码更改进行最小人工干预的义务,以解决有问题的代码部分并使其易于并行化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identifying pitfalls in automatic parallelization of NAS parallel benchmarks
This paper provides an examination of OpenMP based auto-parallelizers and their limitations encountered during parallelization of NAS parallel benchmarks. It also elucidates the issues faced by the parallelizers during parallelization and the resolutions to overcome the problems. Compute-intensive loops are pinpointed using Gprof and the problematic loops within the hotspot area were recognized. Our work concentrates on identifying the pitfalls within the located hotspots and rendering solution in such cases. Analysis on measured speedup and its reasons are well illustrated. This paper underlines the need of a user-interactive environment that highlights the problems evoked during parallelization. It also underscores the obligation for minimal manual intervention concerning coding changes to resolve the problematic code section and make them amenable to parallelization.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信