DRML-Ensemble:基于多层集合特征构建的药物再利用方法。

IF 2.1 4区 化学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY
Mengfei Zhang, Hongjian He, Jiang Xie, Qing Nie
{"title":"DRML-Ensemble:基于多层集合特征构建的药物再利用方法。","authors":"Mengfei Zhang,&nbsp;Hongjian He,&nbsp;Jiang Xie,&nbsp;Qing Nie","doi":"10.1007/s00894-024-06087-9","DOIUrl":null,"url":null,"abstract":"<div><h3>Context</h3><p>Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer’s disease.</p><h3>Methods</h3><p>DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model’s training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model’s performance.</p></div>","PeriodicalId":651,"journal":{"name":"Journal of Molecular Modeling","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble\",\"authors\":\"Mengfei Zhang,&nbsp;Hongjian He,&nbsp;Jiang Xie,&nbsp;Qing Nie\",\"doi\":\"10.1007/s00894-024-06087-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Context</h3><p>Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer’s disease.</p><h3>Methods</h3><p>DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model’s training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model’s performance.</p></div>\",\"PeriodicalId\":651,\"journal\":{\"name\":\"Journal of Molecular Modeling\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Molecular Modeling\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00894-024-06087-9\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Modeling","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s00894-024-06087-9","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:近年来,人们不断开发计算药物再利用方法,以减轻药物开发的高昂成本。作为药物靶点或疾病相关基因的产物,蛋白质在药物再利用中发挥着重要作用。尽管蛋白质的潜力已得到证实,但以蛋白质为独立节点的异构图仍有待研究,而从异构图中提取高质量蛋白质特征则是一项重大挑战。本研究提出了一种基于多层集合特征构建的新型药物再利用模型(DRML-Ensemble)。在公开数据集上评估的 DRML-Ensemble 性能达到了 0.93 的 AUPR 值和 0.92 的 AUROC 值,超过了现有的最先进方法。此外,DRML-Ensemble 还证明了其在阿尔茨海默病药物再利用方面的显著能力:DRML-Ensemble主要由多层异构图特征构建(HGFC)组成。每个 HGFC 都能利用药物、疾病和蛋白质之间的关系提取蛋白质特征。这些蛋白质特征可在后续层中用于构建药物和疾病特征,从而促进药物的再利用。通过多层堆叠,可以从异构图中获得最佳蛋白质特征,从而提高药物再利用的准确性。然而,层的过度堆叠通常会影响模型的训练过程,例如导致过拟合等问题;因此设计了一个多层集合预测模块,以进一步提高模型的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble

DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble

Context

Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer’s disease.

Methods

DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model’s training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model’s performance.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Molecular Modeling
Journal of Molecular Modeling 化学-化学综合
CiteScore
3.50
自引率
4.50%
发文量
362
审稿时长
2.9 months
期刊介绍: The Journal of Molecular Modeling focuses on "hardcore" modeling, publishing high-quality research and reports. Founded in 1995 as a purely electronic journal, it has adapted its format to include a full-color print edition, and adjusted its aims and scope fit the fast-changing field of molecular modeling, with a particular focus on three-dimensional modeling. Today, the journal covers all aspects of molecular modeling including life science modeling; materials modeling; new methods; and computational chemistry. Topics include computer-aided molecular design; rational drug design, de novo ligand design, receptor modeling and docking; cheminformatics, data analysis, visualization and mining; computational medicinal chemistry; homology modeling; simulation of peptides, DNA and other biopolymers; quantitative structure-activity relationships (QSAR) and ADME-modeling; modeling of biological reaction mechanisms; and combined experimental and computational studies in which calculations play a major role.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信