Exploratory data science on supercomputers for quantum mechanical calculations

IF 2.9 Q3 CHEMISTRY, PHYSICAL
William Dawson, Louis Beal, Laura E Ratcliff, Martina Stella, Takahito Nakajima, Luigi Genovese
{"title":"Exploratory data science on supercomputers for quantum mechanical calculations","authors":"William Dawson, Louis Beal, Laura E Ratcliff, Martina Stella, Takahito Nakajima, Luigi Genovese","doi":"10.1088/2516-1075/ad4b80","DOIUrl":null,"url":null,"abstract":"Literate programming—the bringing together of program code and natural language narratives—has become a ubiquitous approach in the realm of data science. This methodology is appealing as well for the domain of Density Functional Theory (DFT) calculations, particularly for interactively developing new methodologies and workflows. However, effective use of literate programming is hampered by old programming paradigms and the difficulties associated with using high performance computing (HPC) resources. Here we present two Python libraries that aim to remove these hurdles. First, we describe the PyBigDFT library, which can be used to setup materials or molecular systems and provides high-level access to the wavelet based BigDFT code. We then present the related <monospace>remotemanager</monospace> library, which is able to serialize and execute arbitrary Python functions on remote supercomputers. We show how together these libraries enable transparent access to HPC based DFT calculations and can serve as building blocks for rapid prototyping and data exploration.","PeriodicalId":42419,"journal":{"name":"Electronic Structure","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Structure","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2516-1075/ad4b80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Literate programming—the bringing together of program code and natural language narratives—has become a ubiquitous approach in the realm of data science. This methodology is appealing as well for the domain of Density Functional Theory (DFT) calculations, particularly for interactively developing new methodologies and workflows. However, effective use of literate programming is hampered by old programming paradigms and the difficulties associated with using high performance computing (HPC) resources. Here we present two Python libraries that aim to remove these hurdles. First, we describe the PyBigDFT library, which can be used to setup materials or molecular systems and provides high-level access to the wavelet based BigDFT code. We then present the related remotemanager library, which is able to serialize and execute arbitrary Python functions on remote supercomputers. We show how together these libraries enable transparent access to HPC based DFT calculations and can serve as building blocks for rapid prototyping and data exploration.
超级计算机上用于量子力学计算的探索性数据科学
有文字的编程--将程序代码和自然语言叙述结合在一起--已成为数据科学领域无处不在的方法。这种方法对于密度泛函理论(DFT)计算领域也很有吸引力,尤其是在交互式开发新方法和工作流程方面。然而,旧的编程范式和使用高性能计算(HPC)资源的困难阻碍了有文化编程的有效使用。在此,我们介绍两个旨在消除这些障碍的 Python 库。首先,我们介绍 PyBigDFT 库,该库可用于设置材料或分子系统,并提供对基于小波的 BigDFT 代码的高级访问。然后,我们介绍了相关的 remotemanager 库,它能够在远程超级计算机上序列化和执行任意 Python 函数。我们展示了这些库如何共同实现对基于 HPC 的 DFT 计算的透明访问,以及如何作为快速原型开发和数据探索的构建模块。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.70
自引率
11.50%
发文量
46
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信