Structure-sensitive transformer and multi-view graph contrastive learning enhanced prediction of drug-related microbes.

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics Pub Date : 2025-09-26 DOI:10.1186/s12859-025-06199-w

Ping Xuan, Rui Wang, Jing Gu, Hui Cui, Tiangang Zhang

{"title":"Structure-sensitive transformer and multi-view graph contrastive learning enhanced prediction of drug-related microbes.","authors":"Ping Xuan, Rui Wang, Jing Gu, Hui Cui, Tiangang Zhang","doi":"10.1186/s12859-025-06199-w","DOIUrl":null,"url":null,"abstract":"Background: The human microbiome plays a crucial role in regulating the efficacy and toxicity of drugs as well as in developing the drugs. Therefore, predicting the drug-related microbes is beneficial for analyzing the functional mechanisms of drugs. Recently, the graph learning based methods demonstrated their advantages in extracting the node features from the biological heterogeneous graphs. However, the previous methods failed to completely preserve the intrinsic structures of biological data and did not fully utilize the topological and positional information for predicting the drug-microbe associations.Results: We propose a new prediction model, structure-sensitive transformer and multi-view graph contrastive learning for microbe-drug association prediction (SMMDA), to encode and integrate the topological structures, semantics, and multiple-view embedding features of the drugs and microbes. Considering the sparsity of the original features of drugs and microbes, the learnable data augmentation strategy is designed to learn their global representations. Since similar drugs are more likely to associate with the similar microbes, a structure-sensitive transformer is proposed to integrate the topology structures composed of drugs (microbes) to form the multi-view embedding features. We design two contrastive learning strategies to exploit the complementary semantics across multiple views. As the embedding features from multiple views have various semantics, we design view-level attention to adaptively integrate these features.Conclusions: The extensive experimental results show that SMMDA outperforms several state-of-the-art methods for predicting the drug-related candidate microbes. The ablation studies show the effectiveness of the major innovations which include the learnable data augmentation, structure-sensitive transformer-based node feature learning, and multi-view contrastive learning. The case studies on three drugs also demonstrate SMMDA's capability in retrieving the potential microbe candidates for the drugs.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"231"},"PeriodicalIF":3.3000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465207/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06199-w","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The human microbiome plays a crucial role in regulating the efficacy and toxicity of drugs as well as in developing the drugs. Therefore, predicting the drug-related microbes is beneficial for analyzing the functional mechanisms of drugs. Recently, the graph learning based methods demonstrated their advantages in extracting the node features from the biological heterogeneous graphs. However, the previous methods failed to completely preserve the intrinsic structures of biological data and did not fully utilize the topological and positional information for predicting the drug-microbe associations.

Results: We propose a new prediction model, structure-sensitive transformer and multi-view graph contrastive learning for microbe-drug association prediction (SMMDA), to encode and integrate the topological structures, semantics, and multiple-view embedding features of the drugs and microbes. Considering the sparsity of the original features of drugs and microbes, the learnable data augmentation strategy is designed to learn their global representations. Since similar drugs are more likely to associate with the similar microbes, a structure-sensitive transformer is proposed to integrate the topology structures composed of drugs (microbes) to form the multi-view embedding features. We design two contrastive learning strategies to exploit the complementary semantics across multiple views. As the embedding features from multiple views have various semantics, we design view-level attention to adaptively integrate these features.

Conclusions: The extensive experimental results show that SMMDA outperforms several state-of-the-art methods for predicting the drug-related candidate microbes. The ablation studies show the effectiveness of the major innovations which include the learnable data augmentation, structure-sensitive transformer-based node feature learning, and multi-view contrastive learning. The case studies on three drugs also demonstrate SMMDA's capability in retrieving the potential microbe candidates for the drugs.

查看原文本刊更多论文

结构敏感变压器和多视图图对比学习增强了药物相关微生物的预测。

背景：人体微生物组在调节药物的疗效和毒性以及开发药物方面起着至关重要的作用。因此，预测药物相关微生物有助于分析药物的作用机制。近年来，基于图学习的方法在生物异构图的节点特征提取方面显示出其优势。然而，以往的方法未能完全保留生物数据的内在结构，并没有充分利用拓扑和位置信息来预测药物-微生物关联。结果：我们提出了一种新的微生物-药物关联预测模型——结构敏感转换和多视图图对比学习，对药物和微生物的拓扑结构、语义和多视图嵌入特征进行编码和整合。考虑到药物和微生物原始特征的稀疏性，设计了可学习数据增强策略来学习它们的全局表示。针对相似药物更容易与相似微生物关联的特点，提出了一种结构敏感转换器，将药物（微生物）组成的拓扑结构进行整合，形成多视图嵌入特征。我们设计了两种对比学习策略来利用跨多个视图的互补语义。由于来自多个视图的嵌入特征具有不同的语义，我们设计了视图级关注来自适应地集成这些特征。结论：广泛的实验结果表明，SMMDA在预测药物相关候选微生物方面优于几种最先进的方法。消融研究表明了主要创新的有效性，包括可学习数据增强、基于结构敏感变压器的节点特征学习和多视图对比学习。对三种药物的案例研究也证明了SMMDA在检索药物潜在候选微生物方面的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Bioinformatics 生物-生化研究方法

CiteScore

5.70

自引率

3.30%

发文量

506

审稿时长

4.3 months

期刊介绍： BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.