基于专家引导的分子性质预测子结构信息瓶颈。

IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL
Tianyi Jiang,Qiang Yao,Zeyu Wang,Xiaoze Bao,Shanqing Yu,Qi Xuan
{"title":"基于专家引导的分子性质预测子结构信息瓶颈。","authors":"Tianyi Jiang,Qiang Yao,Zeyu Wang,Xiaoze Bao,Shanqing Yu,Qi Xuan","doi":"10.1021/acs.jcim.5c00456","DOIUrl":null,"url":null,"abstract":"Molecular property prediction plays a crucial role in cheminformatics, yet existing methods are constrained by data scarcity and molecular structural heterogeneity. The Mixture of Experts (MoE) framework adopts a divide-and-conquer approach by partitioning the input space and employing expert models. However, current methods primarily rely on scaffold or atomic-level information, often neglecting fine-grained features such as functional groups. Moreover, existing MoE models lack effective mechanisms to filter redundant and noisy information, limiting prediction accuracy and generalization. To address these challenges, we propose a novel Expert-Guided Substructure Information Bottleneck (ESIB-Mol) framework that integrates MoE learning with the Information Bottleneck (IB) principle to optimize molecular representation learning. ESIB-Mol employs substructure-specific experts to focus on key molecular scaffolds and functional groups, which play a crucial role in determining molecular properties such as bioactivity and pharmacokinetics. Meanwhile, the IB principle is leveraged to filter out redundant and irrelevant information, thereby enhancing prediction accuracy and interpretability. Additionally, a dynamic gating mechanism adaptively assigns molecules to the most relevant expert, optimizing computational efficiency. Extensive experiments on benchmark data sets demonstrate the effectiveness of ESIB-Mol, highlighting its superior performance in molecular property prediction.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"71 1","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Expert-Guided Substructure Information Bottleneck for Molecular Property Prediction.\",\"authors\":\"Tianyi Jiang,Qiang Yao,Zeyu Wang,Xiaoze Bao,Shanqing Yu,Qi Xuan\",\"doi\":\"10.1021/acs.jcim.5c00456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Molecular property prediction plays a crucial role in cheminformatics, yet existing methods are constrained by data scarcity and molecular structural heterogeneity. The Mixture of Experts (MoE) framework adopts a divide-and-conquer approach by partitioning the input space and employing expert models. However, current methods primarily rely on scaffold or atomic-level information, often neglecting fine-grained features such as functional groups. Moreover, existing MoE models lack effective mechanisms to filter redundant and noisy information, limiting prediction accuracy and generalization. To address these challenges, we propose a novel Expert-Guided Substructure Information Bottleneck (ESIB-Mol) framework that integrates MoE learning with the Information Bottleneck (IB) principle to optimize molecular representation learning. ESIB-Mol employs substructure-specific experts to focus on key molecular scaffolds and functional groups, which play a crucial role in determining molecular properties such as bioactivity and pharmacokinetics. Meanwhile, the IB principle is leveraged to filter out redundant and irrelevant information, thereby enhancing prediction accuracy and interpretability. Additionally, a dynamic gating mechanism adaptively assigns molecules to the most relevant expert, optimizing computational efficiency. Extensive experiments on benchmark data sets demonstrate the effectiveness of ESIB-Mol, highlighting its superior performance in molecular property prediction.\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\"71 1\",\"pages\":\"\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.5c00456\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.5c00456","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

摘要

分子性质预测在化学信息学中起着至关重要的作用,但现有的方法受到数据缺乏和分子结构异质性的限制。混合专家(MoE)框架通过划分输入空间和使用专家模型,采用分而治之的方法。然而,目前的方法主要依赖于支架或原子级别的信息,往往忽略了细粒度的特征,如官能团。此外,现有的MoE模型缺乏有效的机制来过滤冗余和噪声信息,限制了预测的准确性和泛化。为了解决这些挑战,我们提出了一种新的专家引导子结构信息瓶颈(ESIB-Mol)框架,该框架将MoE学习与信息瓶颈(IB)原理相结合,以优化分子表征学习。ESIB-Mol聘请亚结构特异性专家专注于关键的分子支架和官能团,它们在确定分子特性(如生物活性和药代动力学)中起着至关重要的作用。同时,利用IB原理过滤冗余和不相关信息,提高预测精度和可解释性。此外,动态门控机制自适应地将分子分配给最相关的专家,优化计算效率。在基准数据集上的大量实验证明了ESIB-Mol的有效性,突出了其在分子性质预测方面的优越性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Expert-Guided Substructure Information Bottleneck for Molecular Property Prediction.
Molecular property prediction plays a crucial role in cheminformatics, yet existing methods are constrained by data scarcity and molecular structural heterogeneity. The Mixture of Experts (MoE) framework adopts a divide-and-conquer approach by partitioning the input space and employing expert models. However, current methods primarily rely on scaffold or atomic-level information, often neglecting fine-grained features such as functional groups. Moreover, existing MoE models lack effective mechanisms to filter redundant and noisy information, limiting prediction accuracy and generalization. To address these challenges, we propose a novel Expert-Guided Substructure Information Bottleneck (ESIB-Mol) framework that integrates MoE learning with the Information Bottleneck (IB) principle to optimize molecular representation learning. ESIB-Mol employs substructure-specific experts to focus on key molecular scaffolds and functional groups, which play a crucial role in determining molecular properties such as bioactivity and pharmacokinetics. Meanwhile, the IB principle is leveraged to filter out redundant and irrelevant information, thereby enhancing prediction accuracy and interpretability. Additionally, a dynamic gating mechanism adaptively assigns molecules to the most relevant expert, optimizing computational efficiency. Extensive experiments on benchmark data sets demonstrate the effectiveness of ESIB-Mol, highlighting its superior performance in molecular property prediction.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信