Prediction of Drug Pathway-based Disease Classes using Multiple Properties of Drugs

IF 2.9 3区生物学 Q3 BIOCHEMICAL RESEARCH METHODS

Current Bioinformatics Pub Date : 2024-01-29 DOI:10.2174/0115748936284973240105115444

Lei Chen, Linyang Li

{"title":"Prediction of Drug Pathway-based Disease Classes using Multiple Properties of Drugs","authors":"Lei Chen, Linyang Li","doi":"10.2174/0115748936284973240105115444","DOIUrl":null,"url":null,"abstract":"Background:: Drug repositioning now is an important research area in drug discovery as it can accelerate the procedures of discovering novel effects of existing drugs. However, it is challenging to screen out possible effects for given drugs. Designing computational methods are a quick and cheap way to complete this task. Most existing computational methods infer the relationships between drugs and diseases. The pathway-based disease classification reported in KEGG provides us a new way to investigate drug repositioning as such classification can be applied to drugs. A predicted class of a given drug suggests latent diseases it can treat. Objective:: The purpose of this study is to set up efficient multi-label classifiers to predict the classes of drugs. Method:: We adopt three types of drug information to generate drug features, including drug pathway information, label information and drug network. For the first two types, drugs are first encoded into binary vectors, which are further processed by singular value decomposition. For the third type, the network embedding algorithm, Mashup, is employed to yield drug features. Above features are combined and fed into RAndom k-labELsets (RAKEL) to construct multi-label classifiers, where support vector machine is selected as the base classification algorithm. Results:: The ten-fold cross-validation results show that the classifiers provide high performance with accuracy higher than 0.95 and absolute true higher than 0.92. The case study indicates the novel effects of three drugs, i.e., they may treat new diseases. Conclusion:: The proposed classifiers have high performance and are superiority to the classifiers with other classic algorithms and drug information. Furthermore, they have the ability to discover new effects of drugs.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"222 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/0115748936284973240105115444","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Background:: Drug repositioning now is an important research area in drug discovery as it can accelerate the procedures of discovering novel effects of existing drugs. However, it is challenging to screen out possible effects for given drugs. Designing computational methods are a quick and cheap way to complete this task. Most existing computational methods infer the relationships between drugs and diseases. The pathway-based disease classification reported in KEGG provides us a new way to investigate drug repositioning as such classification can be applied to drugs. A predicted class of a given drug suggests latent diseases it can treat. Objective:: The purpose of this study is to set up efficient multi-label classifiers to predict the classes of drugs. Method:: We adopt three types of drug information to generate drug features, including drug pathway information, label information and drug network. For the first two types, drugs are first encoded into binary vectors, which are further processed by singular value decomposition. For the third type, the network embedding algorithm, Mashup, is employed to yield drug features. Above features are combined and fed into RAndom k-labELsets (RAKEL) to construct multi-label classifiers, where support vector machine is selected as the base classification algorithm. Results:: The ten-fold cross-validation results show that the classifiers provide high performance with accuracy higher than 0.95 and absolute true higher than 0.92. The case study indicates the novel effects of three drugs, i.e., they may treat new diseases. Conclusion:: The proposed classifiers have high performance and are superiority to the classifiers with other classic algorithms and drug information. Furthermore, they have the ability to discover new effects of drugs.

查看原文本刊更多论文

利用药物的多种特性预测基于药物途径的疾病类别

背景目前，药物重新定位是药物发现的一个重要研究领域，因为它可以加快发现现有药物新作用的程序。然而，要筛选出特定药物的可能作用是一项挑战。设计计算方法是完成这项任务的快速而廉价的途径。现有的大多数计算方法都是推断药物与疾病之间的关系。KEGG 中报告的基于通路的疾病分类为我们提供了一种研究药物重新定位的新方法，因为这种分类可以应用于药物。某种药物的预测类别暗示了它可以治疗的潜在疾病。研究目的本研究的目的是建立高效的多标签分类器来预测药物类别。方法：我们采用三种药物信息来生成药物特征，包括药物路径信息、标签信息和药物网络。对于前两种类型，首先将药物编码为二进制向量，然后对其进行奇异值分解处理。对于第三种类型，则采用网络嵌入算法 Mashup 来生成药物特征。上述特征经组合后输入 RAndom k-labELsets (RAKEL) 以构建多标签分类器，并选择支持向量机作为基础分类算法。结果十倍交叉验证结果表明，分类器具有较高的性能，准确率高于 0.95，绝对真实度高于 0.92。案例研究表明了三种药物的新作用，即它们可以治疗新的疾病。结论所提出的分类器具有很高的性能，优于使用其他经典算法和药物信息的分类器。此外，它们还具有发现药物新功效的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Current Bioinformatics 生物-生化研究方法

CiteScore

6.60

自引率

2.50%

发文量

审稿时长

>12 weeks

期刊介绍： Current Bioinformatics aims to publish all the latest and outstanding developments in bioinformatics. Each issue contains a series of timely, in-depth/mini-reviews, research papers and guest edited thematic issues written by leaders in the field, covering a wide range of the integration of biology with computer and information science. The journal focuses on advances in computational molecular/structural biology, encompassing areas such as computing in biomedicine and genomics, computational proteomics and systems biology, and metabolic pathway engineering. Developments in these fields have direct implications on key issues related to health care, medicine, genetic disorders, development of agricultural products, renewable energy, environmental protection, etc.