2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness)最新文献

Leveraging the Dynamic Program Structure Tree to Detect Data Races in OpenMP Programs 利用动态程序结构树来检测OpenMP程序中的数据竞争

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/Correctness56720.2022.00012

Lechen Yu, Feiyang Jin, Joachim Protze, Vivek Sarkar

{"title":"Leveraging the Dynamic Program Structure Tree to Detect Data Races in OpenMP Programs","authors":"Lechen Yu, Feiyang Jin, Joachim Protze, Vivek Sarkar","doi":"10.1109/Correctness56720.2022.00012","DOIUrl":"https://doi.org/10.1109/Correctness56720.2022.00012","url":null,"abstract":"OpenMP provides a rich set of constructs to support multiple paradigms of parallelization, e.g., single program multiple data (SPMD) and task parallelism. Integrating these disparate paradigms in a single execution model increases the complexity of OpenMP, making OpenMP programs prone to concurrency bugs such as data races. Inspired by the task-oriented execution model in the OpenMP spec, we extended SPD3, a data race detection algorithm designed for async-finish task parallelism to support OpenMP programs. We found that by extending SPD3's key data structure, the Dynamic Program Structure Tree (DPST), SPD3 can support the majority of OpenMP constructs. We have implemented a prototype, TSan-spd3, on top of Google's ThreadSanitizer (TSan). To conduct an apples-to-apples comparison with Archer, the state-of-the-art dynamic race detector for OpenMP programs, we compared TSan-spd3 with an Archer implementation that executes on the same version of TSan. In addition, we evaluated Archer in two modes, the default mode using the original TSan and the accelerated mode enabling the use of SIMD instructions in TSan. The evaluation was conducted on nine benchmarks from the BOTS and SPEC OMP2012 benchmark suites. The evaluation results show that in eight out of nine benchmarks TSan-spd3 achieved similar overhead with Archer, while TSan-spd3 can identify more potential races than Archer.","PeriodicalId":211482,"journal":{"name":"2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130256976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Static Local Concurrency Errors Detection in MPI-RMA Programs MPI-RMA程序中的静态本地并发错误检测

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/Correctness56720.2022.00008

Emmanuelle Saillard, Marc Sergent, Célia Tassadit Ait Kaci, Denis Barthou

{"title":"Static Local Concurrency Errors Detection in MPI-RMA Programs","authors":"Emmanuelle Saillard, Marc Sergent, Célia Tassadit Ait Kaci, Denis Barthou","doi":"10.1109/Correctness56720.2022.00008","DOIUrl":"https://doi.org/10.1109/Correctness56720.2022.00008","url":null,"abstract":"Communications are a critical part of HPC simulations, and one of the main focuses of application developers when scaling on supercomputers. While classical message passing (also called two-sided communications) is the dominant communication paradigm, one-sided communications are often praised to be efficient to overlap communications with computations, but challenging to program. Their usage is then generally abstracted through languages and memory abstractions to ease programming (e.g. PGAS). Therefore, little work has been done to help programmers use intermediate runtime layers, such as MPI-RMA, that is often reserved to expert programmers. Indeed, programming with MPI - RMA presents several challenges that require handling the asynchronous nature of one-sided communications to ensure the proper semantics of the program while ensuring its memory consistency. To help programmers detect memory errors such as race conditions as early as possible, this paper proposes a new static analysis of MPI - RMA codes that shows to the programmer the errors that can be detected at compile time. The detection is based on a novel local concurrency errors detection algorithm that tracks accesses through BFS searches on the Control Flow Graphs of a program. We show on several tests and an MPI-RMA variant of the GUPS benchmark that the static analysis allows to detect such errors on user codes. The error codes are integrated in the MPI Bugs Initiative open-source test suite.","PeriodicalId":211482,"journal":{"name":"2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125702705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

MiniKokkos: A Calculus of Portable Parallelism MiniKokkos:可移植并行性的演算

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/Correctness56720.2022.00010

Feiyang Jin, J. Jacobson, Samuel D. Pollard, Vivek Sarkar

引用次数: 1

Towards Verified Rounding Error Analysis for Stationary Iterative Methods 平稳迭代法舍入误差验证分析

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/Correctness56720.2022.00007

Ariel E. Kellison, Mohit Tekriwal, Jean-Baptiste Jeannin, G. Hulette

引用次数: 1

Message from the Correctness 2022 Workshop Chairs 来自2022正确性研讨会主席的信息

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/correctness56720.2022.00004

引用次数: 0

On-the-Fly Data Race Detection for MPI RMA Programs with MUST 带MUST的MPI RMA程序的动态数据竞争检测

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/Correctness56720.2022.00009

Simon Schwitanski, Joachim Jenke, Felix Tomski, C. Terboven, Matthias S. Müller

引用次数: 1

Correctness 2022 Workshop Organization 2022车间组织

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/correctness56720.2022.00005

引用次数: 0

Early Experience with Transformer-Based Similarity Analysis for DataRaceBench DataRaceBench中基于变压器的相似性分析的早期经验

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-11-01 DOI: 10.1109/Correctness56720.2022.00011

Winson X. Chen, T. Vanderbruggen, Pei-Hung Lin, C. Liao, M. Emani

{"title":"Early Experience with Transformer-Based Similarity Analysis for DataRaceBench","authors":"Winson X. Chen, T. Vanderbruggen, Pei-Hung Lin, C. Liao, M. Emani","doi":"10.1109/Correctness56720.2022.00011","DOIUrl":"https://doi.org/10.1109/Correctness56720.2022.00011","url":null,"abstract":"DataRaceBench (DRB) is a dedicated benchmark suite to evaluate tools aimed to find data race bugs in OpenMP programs. Using microbenchmarks with or without data races, DRB is able to generate standard quality metrics and provide systematic and quantitative assessments of data race detection tools. However, as the number of microbenchmarks grows, it is challenging to manually identify similar code patterns for DRB, within the context of identifying duplicated kernels or guiding the additions of new kernels. In this paper, we experiment with a transformer-based, deep learning approach to similarity analysis. A state-of-the-art transformer model, CodeBERT, has been adapted to find similar OpenMP code regions. We explore the challenges and the solutions when applying transformer-based similarity analysis to new source codes which are unseen by pre-trained transformers. Using comparative experiments of different variants of similarity analysis, we comment on the strengths and limitations of the transformer-based approach and point out future research directions.","PeriodicalId":211482,"journal":{"name":"2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115427911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Proposed Consistent Exception Handling for the BLAS and LAPACK 建议一致的异常处理BLAS和LAPACK

2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) Pub Date : 2022-07-19 DOI: 10.1109/Correctness56720.2022.00006

J. Demmel, J. Dongarra, M. Gates, G. Henry, J. Langou, Xiaoye Li, P. Luszczek, W. Pereira, Jason Riedy, Cindy Rubio-Gonz'alez

{"title":"Proposed Consistent Exception Handling for the BLAS and LAPACK","authors":"J. Demmel, J. Dongarra, M. Gates, G. Henry, J. Langou, Xiaoye Li, P. Luszczek, W. Pereira, Jason Riedy, Cindy Rubio-Gonz'alez","doi":"10.1109/Correctness56720.2022.00006","DOIUrl":"https://doi.org/10.1109/Correctness56720.2022.00006","url":null,"abstract":"Numerical exceptions, which may be caused by overflow, operations like division by 0 or sqrt(−1), or conver-gence failures, are unavoidable in many cases, in particular when software is used on unforeseen and difficult inputs. As more aspects of society become automated e.g., self-driving cars, health monitors, and cyber-physical systems more generally, it is becoming increasingly important to design software that is resilient to exceptions, and that responds to them in a consistent way. Consistency is needed to allow users to build higher-level software that is also resilient and consistent (and so on recursively). In this paper we explore the design space of consistent exception handling for the widely used BLAS and LAPACK linear algebra libraries, pointing out a variety of instances of inconsistent exception handling in the current versions, and propose a new design that balances consistency, complexity, ease of use, and performance. Some compromises are needed, because there are preexisting inconsistencies that are outside our control, including in or between existing vendor BLAS implementations, different programming languages, and even compilers for the same programming language. And user requests from our surveys are quite diverse. We also propose our design as a possible model for other numerical software, and welcome comments on our design choices.","PeriodicalId":211482,"journal":{"name":"2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness)","volume":"164 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131887508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3