Ana Luisa Veroneze Solórzano, Philippe Olivier Alexandre Navaux, Lucas Mello Schnorr
{"title":"不同多核架构 Ondes3D 地震模拟器上的时间负载失衡问题","authors":"Ana Luisa Veroneze Solórzano, Philippe Olivier Alexandre Navaux, Lucas Mello Schnorr","doi":"arxiv-2409.11392","DOIUrl":null,"url":null,"abstract":"The variety of today's multicore architectures motivates researchers to\nexplore parallel scientific applications on different platforms. Load imbalance\nis one performance issue that can prejudice parallel applications from\nexploiting the computational power of these platforms. Ondes3D is a scientific\napplication for seismic wave simulation used to assess the geological impact of\nearthquakes. Its parallelism relies on applying a regular domain decomposition\nin the geological domain provided and distributing each sub-domain to MPI\nranks. Previous works investigate the significant spatial and temporal\nimbalance in Ondes3D and suggest new parallelization and load balancing\ntechniques to minimize them. However, none explored its execution on different\narchitectures. Our paper evaluates the performance of Ondes3D for two\nearthquake scenarios on eight different multicore architectures, including\nIntel, AMD, and ARM processors. We measure the load distribution per MPI rank,\nevaluate the temporal load imbalance, and compare the execution of the\napplication's kernels. Our results show that the temporal load imbalance in\nOndes3D depends on the architecture chosen, with some platforms minimizing such\nimbalance more effectively.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Temporal Load Imbalance on Ondes3D Seismic Simulator for Different Multicore Architectures\",\"authors\":\"Ana Luisa Veroneze Solórzano, Philippe Olivier Alexandre Navaux, Lucas Mello Schnorr\",\"doi\":\"arxiv-2409.11392\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The variety of today's multicore architectures motivates researchers to\\nexplore parallel scientific applications on different platforms. Load imbalance\\nis one performance issue that can prejudice parallel applications from\\nexploiting the computational power of these platforms. Ondes3D is a scientific\\napplication for seismic wave simulation used to assess the geological impact of\\nearthquakes. Its parallelism relies on applying a regular domain decomposition\\nin the geological domain provided and distributing each sub-domain to MPI\\nranks. Previous works investigate the significant spatial and temporal\\nimbalance in Ondes3D and suggest new parallelization and load balancing\\ntechniques to minimize them. However, none explored its execution on different\\narchitectures. Our paper evaluates the performance of Ondes3D for two\\nearthquake scenarios on eight different multicore architectures, including\\nIntel, AMD, and ARM processors. We measure the load distribution per MPI rank,\\nevaluate the temporal load imbalance, and compare the execution of the\\napplication's kernels. Our results show that the temporal load imbalance in\\nOndes3D depends on the architecture chosen, with some platforms minimizing such\\nimbalance more effectively.\",\"PeriodicalId\":501291,\"journal\":{\"name\":\"arXiv - CS - Performance\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Performance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11392\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Temporal Load Imbalance on Ondes3D Seismic Simulator for Different Multicore Architectures
The variety of today's multicore architectures motivates researchers to
explore parallel scientific applications on different platforms. Load imbalance
is one performance issue that can prejudice parallel applications from
exploiting the computational power of these platforms. Ondes3D is a scientific
application for seismic wave simulation used to assess the geological impact of
earthquakes. Its parallelism relies on applying a regular domain decomposition
in the geological domain provided and distributing each sub-domain to MPI
ranks. Previous works investigate the significant spatial and temporal
imbalance in Ondes3D and suggest new parallelization and load balancing
techniques to minimize them. However, none explored its execution on different
architectures. Our paper evaluates the performance of Ondes3D for two
earthquake scenarios on eight different multicore architectures, including
Intel, AMD, and ARM processors. We measure the load distribution per MPI rank,
evaluate the temporal load imbalance, and compare the execution of the
application's kernels. Our results show that the temporal load imbalance in
Ondes3D depends on the architecture chosen, with some platforms minimizing such
imbalance more effectively.