TEMCL:基于Transformer和增强的多视图对比学习的药物-疾病关联预测。

IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Ming-Li Cui, Cui-Na Jiao, Ying-Lian Gao, Junliang Shang, Chun-Hou Zheng, Jin-Xing Liu
{"title":"TEMCL:基于Transformer和增强的多视图对比学习的药物-疾病关联预测。","authors":"Ming-Li Cui, Cui-Na Jiao, Ying-Lian Gao, Junliang Shang, Chun-Hou Zheng, Jin-Xing Liu","doi":"10.1109/JBHI.2025.3564360","DOIUrl":null,"url":null,"abstract":"<p><p>Drug repositioning (DR) has emerged as an effective method of identifying new indications for existing drugs. Many DR methods have demonstrated superior performance. However, most of them utilize a limited number of biological entities, ignoring the critical role of other entities in addressing data sparsity as well as improving model generalization capabilities. In addition, fully capturing high-order information of biological data still needs to be fully explored. To address above issues, a model based on transformer and enhanced multi-view contrastive learning (TEMCL) is proposed for predicting drug-disease associations (DDAs). Firstly, transformer is employed to obtain high-order features of nodes from similarity information. Secondly, based on similarity matrices and association matrices of nodes, two different types of views are constructed, i.e., homogeneous hypergraphs and heterogeneous association graphs. Among them, to alleviate sparsity problem existing in heterogeneous graphs, protein nodes as well as meta-path enhancement strategy are introduced. Thirdly, hypergraph convolutional network and heterogeneous graph transformer are used to extract node features on above two types of views, respectively. Contrastive learning is applied to obtain more representative features. Finally, multilayer perceptron (MLP) is used for predicting DDAs. Experiments show that TEMCL outperforms existing methods on DR task, exhibiting superior performance. In addition, case studies further demonstrate the effectiveness of this model. TEMCL provides new insights for identifying novel DDAs.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TEMCL: Prediction of Drug-disease Associations Based on Transformer and Enhanced Multi-view Contrastive Learning.\",\"authors\":\"Ming-Li Cui, Cui-Na Jiao, Ying-Lian Gao, Junliang Shang, Chun-Hou Zheng, Jin-Xing Liu\",\"doi\":\"10.1109/JBHI.2025.3564360\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Drug repositioning (DR) has emerged as an effective method of identifying new indications for existing drugs. Many DR methods have demonstrated superior performance. However, most of them utilize a limited number of biological entities, ignoring the critical role of other entities in addressing data sparsity as well as improving model generalization capabilities. In addition, fully capturing high-order information of biological data still needs to be fully explored. To address above issues, a model based on transformer and enhanced multi-view contrastive learning (TEMCL) is proposed for predicting drug-disease associations (DDAs). Firstly, transformer is employed to obtain high-order features of nodes from similarity information. Secondly, based on similarity matrices and association matrices of nodes, two different types of views are constructed, i.e., homogeneous hypergraphs and heterogeneous association graphs. Among them, to alleviate sparsity problem existing in heterogeneous graphs, protein nodes as well as meta-path enhancement strategy are introduced. Thirdly, hypergraph convolutional network and heterogeneous graph transformer are used to extract node features on above two types of views, respectively. Contrastive learning is applied to obtain more representative features. Finally, multilayer perceptron (MLP) is used for predicting DDAs. Experiments show that TEMCL outperforms existing methods on DR task, exhibiting superior performance. In addition, case studies further demonstrate the effectiveness of this model. TEMCL provides new insights for identifying novel DDAs.</p>\",\"PeriodicalId\":13073,\"journal\":{\"name\":\"IEEE Journal of Biomedical and Health Informatics\",\"volume\":\"PP \",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Biomedical and Health Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1109/JBHI.2025.3564360\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3564360","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

药物重新定位(DR)已成为识别现有药物新适应症的有效方法。许多DR方法都表现出了优异的性能。然而,它们中的大多数利用了有限数量的生物实体,忽略了其他实体在解决数据稀疏性和提高模型泛化能力方面的关键作用。此外,如何充分捕获生物数据的高阶信息还有待进一步探索。针对上述问题,提出了一种基于变形和增强多视图对比学习(TEMCL)的药物-疾病关联预测模型。首先,利用变压器从相似度信息中获取节点的高阶特征;其次,基于节点的相似矩阵和关联矩阵,构造了同构超图和异构关联图两种不同类型的视图;其中,为了缓解异构图存在的稀疏性问题,引入了蛋白质节点和元路径增强策略。第三,利用超图卷积网络和异构图转换器分别提取上述两类视图的节点特征。采用对比学习的方法获得更有代表性的特征。最后,利用多层感知器(MLP)对DDAs进行预测。实验表明,TEMCL算法在DR任务上的性能优于现有算法。此外,案例研究进一步证明了该模型的有效性。TEMCL为识别新的dda提供了新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TEMCL: Prediction of Drug-disease Associations Based on Transformer and Enhanced Multi-view Contrastive Learning.

Drug repositioning (DR) has emerged as an effective method of identifying new indications for existing drugs. Many DR methods have demonstrated superior performance. However, most of them utilize a limited number of biological entities, ignoring the critical role of other entities in addressing data sparsity as well as improving model generalization capabilities. In addition, fully capturing high-order information of biological data still needs to be fully explored. To address above issues, a model based on transformer and enhanced multi-view contrastive learning (TEMCL) is proposed for predicting drug-disease associations (DDAs). Firstly, transformer is employed to obtain high-order features of nodes from similarity information. Secondly, based on similarity matrices and association matrices of nodes, two different types of views are constructed, i.e., homogeneous hypergraphs and heterogeneous association graphs. Among them, to alleviate sparsity problem existing in heterogeneous graphs, protein nodes as well as meta-path enhancement strategy are introduced. Thirdly, hypergraph convolutional network and heterogeneous graph transformer are used to extract node features on above two types of views, respectively. Contrastive learning is applied to obtain more representative features. Finally, multilayer perceptron (MLP) is used for predicting DDAs. Experiments show that TEMCL outperforms existing methods on DR task, exhibiting superior performance. In addition, case studies further demonstrate the effectiveness of this model. TEMCL provides new insights for identifying novel DDAs.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Journal of Biomedical and Health Informatics
IEEE Journal of Biomedical and Health Informatics COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
CiteScore
13.60
自引率
6.50%
发文量
1151
期刊介绍: IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信