In-silico 3D molecular editing through physics-informed and preference-aligned generative foundation models.

IF 15.7 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Xiaohan Lin, Yijie Xia, Yanheng Li, Yu-Peng Huang, Shuo Liu, Jun Zhang, Yi Qin Gao
{"title":"In-silico 3D molecular editing through physics-informed and preference-aligned generative foundation models.","authors":"Xiaohan Lin, Yijie Xia, Yanheng Li, Yu-Peng Huang, Shuo Liu, Jun Zhang, Yi Qin Gao","doi":"10.1038/s41467-025-61323-x","DOIUrl":null,"url":null,"abstract":"<p><p>Generating molecular structures towards desired properties is a critical task in computer-aided drug and material design. As special 3D entities, molecules inherit non-trivial physical complexity, and many intrinsic properties may not be learnable through pure data-driven approaches, hindering the transaction of powerful generative artificial intelligence (GenAI) to this field. To avoid existing molecular GenAI's heavy reliance on domain-specific models and priors, in this research, we derive theoretical guidelines to bridge the methodological gap between GenAI for images and molecules, allowing pre-training of foundation models for 3D molecular generation. Difficulties due to symmetry, stability and entropy, which are critical for molecules, are overcome through a simple and model-agnostic training protocol. Moreover, we apply physics-informed strategies to force MolEdit, a pre-trained multimodal molecular GenAI, to obey physics laws and align with contextual preferences, and thus suppress undesired model hallucinations. MolEdit can generate valid molecules with comprehensive symmetry, strikes a better balance between configuration stability and conformer diversity, and supports complicated 3D scaffolds which frustrate other methods. Furthermore, MolEdit is applicable for zero-shot lead optimization and linker design following contextual and geometrical specifications. Collectively, as a foundation model, MolEdit offers flexibility and developability for AI-aided editing and manipulation of molecules serving various purposes.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"16 1","pages":"6043"},"PeriodicalIF":15.7000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12217601/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-025-61323-x","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Generating molecular structures towards desired properties is a critical task in computer-aided drug and material design. As special 3D entities, molecules inherit non-trivial physical complexity, and many intrinsic properties may not be learnable through pure data-driven approaches, hindering the transaction of powerful generative artificial intelligence (GenAI) to this field. To avoid existing molecular GenAI's heavy reliance on domain-specific models and priors, in this research, we derive theoretical guidelines to bridge the methodological gap between GenAI for images and molecules, allowing pre-training of foundation models for 3D molecular generation. Difficulties due to symmetry, stability and entropy, which are critical for molecules, are overcome through a simple and model-agnostic training protocol. Moreover, we apply physics-informed strategies to force MolEdit, a pre-trained multimodal molecular GenAI, to obey physics laws and align with contextual preferences, and thus suppress undesired model hallucinations. MolEdit can generate valid molecules with comprehensive symmetry, strikes a better balance between configuration stability and conformer diversity, and supports complicated 3D scaffolds which frustrate other methods. Furthermore, MolEdit is applicable for zero-shot lead optimization and linker design following contextual and geometrical specifications. Collectively, as a foundation model, MolEdit offers flexibility and developability for AI-aided editing and manipulation of molecules serving various purposes.

通过物理信息和偏好对齐的生成基础模型进行硅三维分子编辑。
在计算机辅助药物和材料设计中,生成符合所需性质的分子结构是一项关键任务。作为特殊的三维实体,分子继承了非平凡的物理复杂性,许多内在属性可能无法通过纯数据驱动的方法学习,阻碍了强大的生成式人工智能(GenAI)在该领域的交易。为了避免现有的分子GenAI对特定领域模型和先验的严重依赖,在本研究中,我们推导了理论指导方针,以弥合图像和分子GenAI之间的方法差距,允许对3D分子生成的基础模型进行预训练。由于对称性、稳定性和熵的困难,这对分子是至关重要的,通过一个简单的和模型不可知的训练协议克服。此外,我们应用物理信息策略来强制MolEdit,一个预先训练的多模态分子基因,服从物理定律并与上下文偏好保持一致,从而抑制不希望的模型幻觉。MolEdit可以生成具有全面对称性的有效分子,更好地平衡构型稳定性和构象多样性,支持复杂的3D支架,这是其他方法所无法做到的。此外,MolEdit适用于零射击引线优化和连接器设计遵循上下文和几何规格。总的来说,作为基础模型,MolEdit为人工智能辅助编辑和操作分子提供了灵活性和可发展性,用于各种目的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nature Communications
Nature Communications Biological Science Disciplines-
CiteScore
24.90
自引率
2.40%
发文量
6928
审稿时长
3.7 months
期刊介绍: Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信