{"title":"DRML-Ensemble:基于多层集合特征构建的药物再利用方法。","authors":"Mengfei Zhang, Hongjian He, Jiang Xie, Qing Nie","doi":"10.1007/s00894-024-06087-9","DOIUrl":null,"url":null,"abstract":"<div><h3>Context</h3><p>Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer’s disease.</p><h3>Methods</h3><p>DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model’s training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model’s performance.</p></div>","PeriodicalId":651,"journal":{"name":"Journal of Molecular Modeling","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble\",\"authors\":\"Mengfei Zhang, Hongjian He, Jiang Xie, Qing Nie\",\"doi\":\"10.1007/s00894-024-06087-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Context</h3><p>Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer’s disease.</p><h3>Methods</h3><p>DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model’s training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model’s performance.</p></div>\",\"PeriodicalId\":651,\"journal\":{\"name\":\"Journal of Molecular Modeling\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Molecular Modeling\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00894-024-06087-9\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Modeling","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s00894-024-06087-9","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble
Context
Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer’s disease.
Methods
DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model’s training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model’s performance.
期刊介绍:
The Journal of Molecular Modeling focuses on "hardcore" modeling, publishing high-quality research and reports. Founded in 1995 as a purely electronic journal, it has adapted its format to include a full-color print edition, and adjusted its aims and scope fit the fast-changing field of molecular modeling, with a particular focus on three-dimensional modeling.
Today, the journal covers all aspects of molecular modeling including life science modeling; materials modeling; new methods; and computational chemistry.
Topics include computer-aided molecular design; rational drug design, de novo ligand design, receptor modeling and docking; cheminformatics, data analysis, visualization and mining; computational medicinal chemistry; homology modeling; simulation of peptides, DNA and other biopolymers; quantitative structure-activity relationships (QSAR) and ADME-modeling; modeling of biological reaction mechanisms; and combined experimental and computational studies in which calculations play a major role.