Can Leng , Xuguang Chen , Jie Liu , Chunye Gong , Bo Yang , Zhuo Tang , Wangdong Yang , Wei-Qing Huang , Yi-Ge Zhou , Mengxia Mo , Kenli Li , Keqin Li
{"title":"Fully automated high-throughput computer-based catalytic material screening framework and its application on the new-generation Tianhe supercomputer","authors":"Can Leng , Xuguang Chen , Jie Liu , Chunye Gong , Bo Yang , Zhuo Tang , Wangdong Yang , Wei-Qing Huang , Yi-Ge Zhou , Mengxia Mo , Kenli Li , Keqin Li","doi":"10.1016/j.commatsci.2025.113775","DOIUrl":null,"url":null,"abstract":"<div><div>The integration of high-performance computing with machine learning (ML) has established a transformative scientific paradigm that significantly enhances the efficiency of material discovery, particularly in the search for catalysts in alternative energy research. However, significant challenges remain in the utilization of available computational resources to accelerate the screening of catalyst materials. In this study, we implement a high-throughput framework on the new-generation Tianhe supercomputer, featuring the development of a Ping-Fault Recovery algorithm, single-task optimization for Density Functional Theory (DFT) to maximize efficiency, and enhanced task scheduling using a two-level scheduling strategy to ensure efficient utilization of the abundant computational resources of the supercomputer. This framework facilitates the identification of 2,028 candidate surfaces across 868 intermetallics from 2,713,897 unique adsorption sites, achieving a screening speed 193 times faster than traditional methods. Alloys composed of Mo, Nb, and V are used as case studies to provide a detailed elucidation of the process of identifying the most effective catalytic surfaces. The framework achieved the best single-day candidate hit performance on 18,106 nodes, completing in one day what previously took a year. This supercomputer-based framework optimizes the use of computational resources, driving innovation in catalyst material discovery.</div></div>","PeriodicalId":10650,"journal":{"name":"Computational Materials Science","volume":"252 ","pages":"Article 113775"},"PeriodicalIF":3.1000,"publicationDate":"2025-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Materials Science","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0927025625001181","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The integration of high-performance computing with machine learning (ML) has established a transformative scientific paradigm that significantly enhances the efficiency of material discovery, particularly in the search for catalysts in alternative energy research. However, significant challenges remain in the utilization of available computational resources to accelerate the screening of catalyst materials. In this study, we implement a high-throughput framework on the new-generation Tianhe supercomputer, featuring the development of a Ping-Fault Recovery algorithm, single-task optimization for Density Functional Theory (DFT) to maximize efficiency, and enhanced task scheduling using a two-level scheduling strategy to ensure efficient utilization of the abundant computational resources of the supercomputer. This framework facilitates the identification of 2,028 candidate surfaces across 868 intermetallics from 2,713,897 unique adsorption sites, achieving a screening speed 193 times faster than traditional methods. Alloys composed of Mo, Nb, and V are used as case studies to provide a detailed elucidation of the process of identifying the most effective catalytic surfaces. The framework achieved the best single-day candidate hit performance on 18,106 nodes, completing in one day what previously took a year. This supercomputer-based framework optimizes the use of computational resources, driving innovation in catalyst material discovery.
期刊介绍:
The goal of Computational Materials Science is to report on results that provide new or unique insights into, or significantly expand our understanding of, the properties of materials or phenomena associated with their design, synthesis, processing, characterization, and utilization. To be relevant to the journal, the results should be applied or applicable to specific material systems that are discussed within the submission.