AI-driven discovery of high-performance corrosion inhibitors using a BERT-GPT framework for molecular generation

IF 7.4 1区 材料科学 Q1 MATERIALS SCIENCE, MULTIDISCIPLINARY
Haiyan Gong , Lingwei Ma , Diandian Liu , Dawei Zhang
{"title":"AI-driven discovery of high-performance corrosion inhibitors using a BERT-GPT framework for molecular generation","authors":"Haiyan Gong ,&nbsp;Lingwei Ma ,&nbsp;Diandian Liu ,&nbsp;Dawei Zhang","doi":"10.1016/j.corsci.2025.113327","DOIUrl":null,"url":null,"abstract":"<div><div>The discovery of high-efficiency corrosion inhibitors is constrained by the structural diversity of existing molecular entities. This study introduces a framework that integrates BERT and GPT models to predict, screen, and generate molecules with potential corrosion inhibiting properties. Predictive models for inhibition efficiency (IE) and toxicity on carbon steel surfaces in 1 M HCl environments were built by utilizing chemical language embeddings and molecular descriptors. The screening of the PubChem library identified 1,336,971 candidate molecules with a predicted IE greater than 90 % at a 1 mM concentration and a predicted LD50 (lethal dose for 50 % of test animals) exceeding 2000 mg/kg (indicating low acute toxicity). Scaffold analysis highlighted an enrichment of critical scaffold, particularly 6 imidazole scaffolds in conjunction with property constraints (IE = 90 %, LD50 = 1000 mg/kg), informed the conditional generation of molecules using MolGPT. A total of 310,962 valid molecules were generated, with 25.17 % (78,258 molecules) satisfying all specified conditions, which is much higher than the hit ratio (4.08 %) of screening from PubChem database. This study demonstrates an AI-driven molecular generation for corrosion inhibitor discovery.</div></div>","PeriodicalId":290,"journal":{"name":"Corrosion Science","volume":"257 ","pages":"Article 113327"},"PeriodicalIF":7.4000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Corrosion Science","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010938X25006559","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The discovery of high-efficiency corrosion inhibitors is constrained by the structural diversity of existing molecular entities. This study introduces a framework that integrates BERT and GPT models to predict, screen, and generate molecules with potential corrosion inhibiting properties. Predictive models for inhibition efficiency (IE) and toxicity on carbon steel surfaces in 1 M HCl environments were built by utilizing chemical language embeddings and molecular descriptors. The screening of the PubChem library identified 1,336,971 candidate molecules with a predicted IE greater than 90 % at a 1 mM concentration and a predicted LD50 (lethal dose for 50 % of test animals) exceeding 2000 mg/kg (indicating low acute toxicity). Scaffold analysis highlighted an enrichment of critical scaffold, particularly 6 imidazole scaffolds in conjunction with property constraints (IE = 90 %, LD50 = 1000 mg/kg), informed the conditional generation of molecules using MolGPT. A total of 310,962 valid molecules were generated, with 25.17 % (78,258 molecules) satisfying all specified conditions, which is much higher than the hit ratio (4.08 %) of screening from PubChem database. This study demonstrates an AI-driven molecular generation for corrosion inhibitor discovery.
使用BERT-GPT框架进行分子生成的高性能缓蚀剂的ai驱动发现
高效缓蚀剂的发现受到现有分子实体结构多样性的限制。本研究引入了一个框架,该框架集成了BERT和GPT模型,用于预测、筛选和生成具有潜在缓蚀性能的分子。利用化学语言嵌入和分子描述符建立了1 M HCl环境下碳钢表面的缓蚀效率(IE)和毒性预测模型。PubChem库的筛选鉴定出1,336,971个候选分子,在1 mM浓度下,预测IE大于90 %,预测LD50(50% %试验动物的致死剂量)超过2000 mg/kg(表明低急性毒性)。支架分析强调了关键支架的富集,特别是6个咪唑支架,并结合性质限制(IE = 90 %,LD50 = 1000 mg/kg),通知了使用MolGPT有条件地生成分子。总共生成了310,962个有效分子,其中25.17 %(78,258个分子)满足所有指定条件,远远高于PubChem数据库筛选的命中率(4.08 %)。该研究展示了人工智能驱动的缓蚀剂分子生成技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Corrosion Science
Corrosion Science 工程技术-材料科学:综合
CiteScore
13.60
自引率
18.10%
发文量
763
审稿时长
46 days
期刊介绍: Corrosion occurrence and its practical control encompass a vast array of scientific knowledge. Corrosion Science endeavors to serve as the conduit for the exchange of ideas, developments, and research across all facets of this field, encompassing both metallic and non-metallic corrosion. The scope of this international journal is broad and inclusive. Published papers span from highly theoretical inquiries to essentially practical applications, covering diverse areas such as high-temperature oxidation, passivity, anodic oxidation, biochemical corrosion, stress corrosion cracking, and corrosion control mechanisms and methodologies. This journal publishes original papers and critical reviews across the spectrum of pure and applied corrosion, material degradation, and surface science and engineering. It serves as a crucial link connecting metallurgists, materials scientists, and researchers investigating corrosion and degradation phenomena. Join us in advancing knowledge and understanding in the vital field of corrosion science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信