极端规模网络中mpi集体通信建模的大规模并行仿真

M. Mubarak, C. Carothers, R. Ross, P. Carns
{"title":"极端规模网络中mpi集体通信建模的大规模并行仿真","authors":"M. Mubarak, C. Carothers, R. Ross, P. Carns","doi":"10.1109/WSC.2014.7020148","DOIUrl":null,"url":null,"abstract":"MPI collective operations are a critical and frequently used part of most MPI-based large-scale scientific applications. In previous work, we have enabled the Rensselaer Optimistic Simulation System (ROSS) to predict the performance of MPI point-to-point messaging on high-fidelity million-node network simulations of torus and dragonfly interconnects. The main contribution of this work is an extension of these torus and dragonfly network models to support MPI collective communication operations using the optimistic event scheduling capability of ROSS. We demonstrate that both small- and large-scale ROSS collective communication models can execute efficiency on massively parallel architectures. We validate the results of our collective communication model against the measurements from IBM Blue Gene/Q and Cray XC30 platforms using a data-driven approach on our network simulations. We also perform experiments to explore the impact of tree degree on the performance of collective communication operations in large-scale network models.","PeriodicalId":446873,"journal":{"name":"Proceedings of the Winter Simulation Conference 2014","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Using massively parallel simulation for mpi collective communication modeling in extreme-scale networks\",\"authors\":\"M. Mubarak, C. Carothers, R. Ross, P. Carns\",\"doi\":\"10.1109/WSC.2014.7020148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"MPI collective operations are a critical and frequently used part of most MPI-based large-scale scientific applications. In previous work, we have enabled the Rensselaer Optimistic Simulation System (ROSS) to predict the performance of MPI point-to-point messaging on high-fidelity million-node network simulations of torus and dragonfly interconnects. The main contribution of this work is an extension of these torus and dragonfly network models to support MPI collective communication operations using the optimistic event scheduling capability of ROSS. We demonstrate that both small- and large-scale ROSS collective communication models can execute efficiency on massively parallel architectures. We validate the results of our collective communication model against the measurements from IBM Blue Gene/Q and Cray XC30 platforms using a data-driven approach on our network simulations. We also perform experiments to explore the impact of tree degree on the performance of collective communication operations in large-scale network models.\",\"PeriodicalId\":446873,\"journal\":{\"name\":\"Proceedings of the Winter Simulation Conference 2014\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Winter Simulation Conference 2014\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WSC.2014.7020148\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Winter Simulation Conference 2014","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSC.2014.7020148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

摘要

MPI集体操作是大多数基于MPI的大规模科学应用的关键和经常使用的部分。在之前的工作中,我们已经使Rensselaer乐观仿真系统(ROSS)能够在环面和蜻蜓互连的高保真百万节点网络仿真中预测MPI点对点消息传递的性能。这项工作的主要贡献是扩展了这些环面和蜻蜓网络模型,以支持使用ROSS的乐观事件调度能力的MPI集体通信操作。我们证明了小型和大型ROSS集体通信模型都可以在大规模并行架构上执行效率。我们使用数据驱动的方法在我们的网络模拟中验证了我们的集体通信模型的结果,对照IBM Blue Gene/Q和Cray XC30平台的测量结果。我们还进行了实验来探索树度对大规模网络模型中集体通信操作性能的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using massively parallel simulation for mpi collective communication modeling in extreme-scale networks
MPI collective operations are a critical and frequently used part of most MPI-based large-scale scientific applications. In previous work, we have enabled the Rensselaer Optimistic Simulation System (ROSS) to predict the performance of MPI point-to-point messaging on high-fidelity million-node network simulations of torus and dragonfly interconnects. The main contribution of this work is an extension of these torus and dragonfly network models to support MPI collective communication operations using the optimistic event scheduling capability of ROSS. We demonstrate that both small- and large-scale ROSS collective communication models can execute efficiency on massively parallel architectures. We validate the results of our collective communication model against the measurements from IBM Blue Gene/Q and Cray XC30 platforms using a data-driven approach on our network simulations. We also perform experiments to explore the impact of tree degree on the performance of collective communication operations in large-scale network models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信