增加对百亿亿次模拟的信任

Dorra Ben Khalifa, Xinyi Li, I. Laguna, M. Martel, G. Gopalakrishnan
{"title":"增加对百亿亿次模拟的信任","authors":"Dorra Ben Khalifa, Xinyi Li, I. Laguna, M. Martel, G. Gopalakrishnan","doi":"10.1109/XLOOP56614.2022.00010","DOIUrl":null,"url":null,"abstract":"In recent decades, High Performance Computing (HPC) and simulations have become determinant in many areas of engineering and science. Since many HPC applications rely extensively on floating-point arithmetic operations to solve computational problems, many kinds of numerical errors can be introduced during the program execution, leading to instability or reproducibility problems. One kind of these error sources is the loss of significant digits or cancellation which produces inaccurate results when two nearby numbers are subtracted. In this article, we present Candy, a new dynamic library based on code instrumentation that detects cancellations in numerical software. The originality of our method is to compute the number of significant bits of floating-point numbers in a generalized framework by attaching a shadow value in higher precision to each number. This helps to detect in an accurate way if a program suffers from cancellation problems and thus to increase the trust in large-scale HPC applications and exascale simulations. We evaluate Candy over a set of complex and real-world numerical applications. In addition, we compare our method against the state-of-art tool FPChecker in terms of efficiency, mixed precision results and speed of the analysis.","PeriodicalId":401106,"journal":{"name":"2022 4th Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing (XLOOP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Toward Increasing Trust in Exascale Simulations\",\"authors\":\"Dorra Ben Khalifa, Xinyi Li, I. Laguna, M. Martel, G. Gopalakrishnan\",\"doi\":\"10.1109/XLOOP56614.2022.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent decades, High Performance Computing (HPC) and simulations have become determinant in many areas of engineering and science. Since many HPC applications rely extensively on floating-point arithmetic operations to solve computational problems, many kinds of numerical errors can be introduced during the program execution, leading to instability or reproducibility problems. One kind of these error sources is the loss of significant digits or cancellation which produces inaccurate results when two nearby numbers are subtracted. In this article, we present Candy, a new dynamic library based on code instrumentation that detects cancellations in numerical software. The originality of our method is to compute the number of significant bits of floating-point numbers in a generalized framework by attaching a shadow value in higher precision to each number. This helps to detect in an accurate way if a program suffers from cancellation problems and thus to increase the trust in large-scale HPC applications and exascale simulations. We evaluate Candy over a set of complex and real-world numerical applications. In addition, we compare our method against the state-of-art tool FPChecker in terms of efficiency, mixed precision results and speed of the analysis.\",\"PeriodicalId\":401106,\"journal\":{\"name\":\"2022 4th Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing (XLOOP)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 4th Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing (XLOOP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/XLOOP56614.2022.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing (XLOOP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/XLOOP56614.2022.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

近几十年来,高性能计算(HPC)和模拟在许多工程和科学领域已经成为决定性的因素。由于许多HPC应用程序广泛依赖于浮点算术运算来解决计算问题,因此在程序执行过程中可能引入许多类型的数值误差,从而导致不稳定或再现性问题。其中一种误差来源是有效数字的丢失或抵消,当两个相邻的数字相减时,会产生不准确的结果。在本文中,我们介绍了Candy,一个基于代码检测的动态库,用于检测数值软件中的取消。该方法的创新之处在于,通过在浮点数上附加更高精度的阴影值,在广义框架中计算浮点数的有效位数。这有助于以准确的方式检测程序是否存在取消问题,从而增加对大规模HPC应用程序和百亿亿次模拟的信任。我们通过一组复杂的和现实世界的数值应用程序来评估Candy。此外,我们将我们的方法与最先进的工具FPChecker在效率、混合精度结果和分析速度方面进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Toward Increasing Trust in Exascale Simulations
In recent decades, High Performance Computing (HPC) and simulations have become determinant in many areas of engineering and science. Since many HPC applications rely extensively on floating-point arithmetic operations to solve computational problems, many kinds of numerical errors can be introduced during the program execution, leading to instability or reproducibility problems. One kind of these error sources is the loss of significant digits or cancellation which produces inaccurate results when two nearby numbers are subtracted. In this article, we present Candy, a new dynamic library based on code instrumentation that detects cancellations in numerical software. The originality of our method is to compute the number of significant bits of floating-point numbers in a generalized framework by attaching a shadow value in higher precision to each number. This helps to detect in an accurate way if a program suffers from cancellation problems and thus to increase the trust in large-scale HPC applications and exascale simulations. We evaluate Candy over a set of complex and real-world numerical applications. In addition, we compare our method against the state-of-art tool FPChecker in terms of efficiency, mixed precision results and speed of the analysis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信