{"title":"TEMCL: Prediction of Drug-disease Associations Based on Transformer and Enhanced Multi-view Contrastive Learning.","authors":"Ming-Li Cui, Cui-Na Jiao, Ying-Lian Gao, Junliang Shang, Chun-Hou Zheng, Jin-Xing Liu","doi":"10.1109/JBHI.2025.3564360","DOIUrl":null,"url":null,"abstract":"<p><p>Drug repositioning (DR) has emerged as an effective method of identifying new indications for existing drugs. Many DR methods have demonstrated superior performance. However, most of them utilize a limited number of biological entities, ignoring the critical role of other entities in addressing data sparsity as well as improving model generalization capabilities. In addition, fully capturing high-order information of biological data still needs to be fully explored. To address above issues, a model based on transformer and enhanced multi-view contrastive learning (TEMCL) is proposed for predicting drug-disease associations (DDAs). Firstly, transformer is employed to obtain high-order features of nodes from similarity information. Secondly, based on similarity matrices and association matrices of nodes, two different types of views are constructed, i.e., homogeneous hypergraphs and heterogeneous association graphs. Among them, to alleviate sparsity problem existing in heterogeneous graphs, protein nodes as well as meta-path enhancement strategy are introduced. Thirdly, hypergraph convolutional network and heterogeneous graph transformer are used to extract node features on above two types of views, respectively. Contrastive learning is applied to obtain more representative features. Finally, multilayer perceptron (MLP) is used for predicting DDAs. Experiments show that TEMCL outperforms existing methods on DR task, exhibiting superior performance. In addition, case studies further demonstrate the effectiveness of this model. TEMCL provides new insights for identifying novel DDAs.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3564360","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Drug repositioning (DR) has emerged as an effective method of identifying new indications for existing drugs. Many DR methods have demonstrated superior performance. However, most of them utilize a limited number of biological entities, ignoring the critical role of other entities in addressing data sparsity as well as improving model generalization capabilities. In addition, fully capturing high-order information of biological data still needs to be fully explored. To address above issues, a model based on transformer and enhanced multi-view contrastive learning (TEMCL) is proposed for predicting drug-disease associations (DDAs). Firstly, transformer is employed to obtain high-order features of nodes from similarity information. Secondly, based on similarity matrices and association matrices of nodes, two different types of views are constructed, i.e., homogeneous hypergraphs and heterogeneous association graphs. Among them, to alleviate sparsity problem existing in heterogeneous graphs, protein nodes as well as meta-path enhancement strategy are introduced. Thirdly, hypergraph convolutional network and heterogeneous graph transformer are used to extract node features on above two types of views, respectively. Contrastive learning is applied to obtain more representative features. Finally, multilayer perceptron (MLP) is used for predicting DDAs. Experiments show that TEMCL outperforms existing methods on DR task, exhibiting superior performance. In addition, case studies further demonstrate the effectiveness of this model. TEMCL provides new insights for identifying novel DDAs.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.