Improved and Interpretable Prediction of Cytochrome P450-Mediated Metabolism by Molecule-Level Graph Modeling and Subgraph Information Bottlenecks.

IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL
Yi Li, Qin-Wei Xu, Guo-Lei Jian, Xiao-Ling Zhang, Hua Wang
{"title":"Improved and Interpretable Prediction of Cytochrome P450-Mediated Metabolism by Molecule-Level Graph Modeling and Subgraph Information Bottlenecks.","authors":"Yi Li, Qin-Wei Xu, Guo-Lei Jian, Xiao-Ling Zhang, Hua Wang","doi":"10.1021/acs.jcim.4c01632","DOIUrl":null,"url":null,"abstract":"<p><p>Accurately identifying sites of metabolism (SoM) mediated by cytochrome P450 (CYP) enzymes, which are responsible for drug metabolism in the body, is critical in the early stage of drug discovery and development. Current computational methods for CYP-mediated SoM prediction face several challenges, including limitations to traditional machine learning models at the atomic level, heavy reliance on complex feature engineering, and the lack of interpretability relevant to medicinal chemistry. Here, we propose GraphCySoM, a novel molecule-level modeling approach based on graph neural networks, utilizing lightweight features and interpretable annotations on substructures, to effectively and interpretably predict CYP-mediated SoM. Unlike computationally expensive atomic descriptors derived from resource-intensive chemistry or even quantum chemistry calculations, we emphasize that graph-based molecular modeling initialized solely with lightweight features enables the adaptive learning of molecular topology through message-passing mechanisms combined with various aggregation kernels. Extensive ablation experiments demonstrate that GraphCySoM significantly outperforms baseline models and achieves superior performance compared with competing methods while exhibiting advantages in computational efficiency. Moreover, the attention mechanism and subgraph information bottlenecks are incorporated to analyze node importance and feature significance, resulting in mining substructures associated with the SoM. To the best of our knowledge, this is the first comprehensive study of CYP-mediated SoM using molecule-level modeling and interpretable technology. Our method achieves new state-of-the-art performance and provides potential insights into the molecular and pharmacological mechanisms underlying drug metabolism catalyzed by CYP enzymes. All source files and trained models are freely available at https://github.com/liyigerry/GraphCySoM.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01632","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Accurately identifying sites of metabolism (SoM) mediated by cytochrome P450 (CYP) enzymes, which are responsible for drug metabolism in the body, is critical in the early stage of drug discovery and development. Current computational methods for CYP-mediated SoM prediction face several challenges, including limitations to traditional machine learning models at the atomic level, heavy reliance on complex feature engineering, and the lack of interpretability relevant to medicinal chemistry. Here, we propose GraphCySoM, a novel molecule-level modeling approach based on graph neural networks, utilizing lightweight features and interpretable annotations on substructures, to effectively and interpretably predict CYP-mediated SoM. Unlike computationally expensive atomic descriptors derived from resource-intensive chemistry or even quantum chemistry calculations, we emphasize that graph-based molecular modeling initialized solely with lightweight features enables the adaptive learning of molecular topology through message-passing mechanisms combined with various aggregation kernels. Extensive ablation experiments demonstrate that GraphCySoM significantly outperforms baseline models and achieves superior performance compared with competing methods while exhibiting advantages in computational efficiency. Moreover, the attention mechanism and subgraph information bottlenecks are incorporated to analyze node importance and feature significance, resulting in mining substructures associated with the SoM. To the best of our knowledge, this is the first comprehensive study of CYP-mediated SoM using molecule-level modeling and interpretable technology. Our method achieves new state-of-the-art performance and provides potential insights into the molecular and pharmacological mechanisms underlying drug metabolism catalyzed by CYP enzymes. All source files and trained models are freely available at https://github.com/liyigerry/GraphCySoM.

通过分子级图谱建模和子图谱信息瓶颈,改进细胞色素 P450 介导的新陈代谢的可解释性预测。
准确识别由细胞色素 P450(CYP)酶介导的代谢位点(SoM)对于药物发现和开发的早期阶段至关重要。目前用于 CYP 介导的 SoM 预测的计算方法面临着一些挑战,包括传统机器学习模型在原子水平上的局限性、对复杂特征工程的严重依赖以及缺乏与药物化学相关的可解释性。在此,我们提出了基于图神经网络的新型分子级建模方法 GraphCySoM,该方法利用轻量级特征和可解释的子结构注释,有效且可解释地预测 CYP 介导的 SoM。与从资源密集型化学甚至量子化学计算中得出的计算昂贵的原子描述符不同,我们强调基于图的分子建模仅用轻量级特征初始化,通过消息传递机制结合各种聚合内核实现分子拓扑的自适应学习。广泛的消融实验证明,GraphCySoM 的性能明显优于基线模型,与其他竞争方法相比性能更优,同时在计算效率方面也有优势。此外,GraphCySoM 还结合了注意力机制和子图信息瓶颈来分析节点重要性和特征重要性,从而挖掘出与 SoM 相关的子结构。据我们所知,这是首次利用分子级建模和可解释技术对 CYP 介导的 SoM 进行全面研究。我们的方法达到了最先进的新性能,为了解 CYP 酶催化药物代谢的分子和药理机制提供了潜在的见解。所有源文件和训练好的模型都可在 https://github.com/liyigerry/GraphCySoM 免费获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信