PMLocMSCAM: Predicting miRNA Subcellular Localisations by miRNA Similarities and Cross-Attention Mechanism

IF 1.9 4区 生物学 Q4 CELL BIOLOGY
Jipu Jiang, Cheng Yan
{"title":"PMLocMSCAM: Predicting miRNA Subcellular Localisations by miRNA Similarities and Cross-Attention Mechanism","authors":"Jipu Jiang,&nbsp;Cheng Yan","doi":"10.1049/syb2.70023","DOIUrl":null,"url":null,"abstract":"<p>Many studies have shown that microRNAs (miRNAs) play key roles in some important processes and human complicated diseases. In addition, they also have specific physiological roles at different cellular sites. Therefore, identifying their subcellular localisation is very urgent to systemically understand their physiological functions. In this study, we propose a computational method, called PMLocMSCAM, to predict miRNA subcellular localisation based on miRNA similarities and cross-attention mechanism. PMLocMSCAM implements a multimodal integration framework that systematically processes miRNA sequence data, miRNA-mRNA association networks with mRNA subcellular localisation annotations, miRNA-disease associations, and miRNA-drug association networks. The architecture initiates with intrinsic feature extraction through Smith-Waterman alignment for sequence similarity computation and disease ontology-based functional similarity derivation. Subsequent heterogeneous network embedding employs Node2vec for topological feature learning across three interaction modalities (miRNA-disease, miRNA-drug, and miRNA-mRNA networks), enhanced by hypergraph convolution to capture higher-order relationships through incidence matrix decomposition. Localisation-specific patterns are propagated via miRNA-mRNA interaction weights, culminating in a multi-head attention mechanism that dynamically fuses five feature matrices—miRNA sequence features, miRNA-disease association features, miRNA-drug association features, miRNA-mRNA association features, and miRNA-mRNA localisation features. These integrated representations are processed through residual-connected multilayer perceptrons to generate probabilistic predictions across seven subcellular compartments, establishing an end-to-end computational paradigm for multimodal miRNA localisation analysis. In order to assess the prediction performance of our method and compare it with other miRNA subcellular localisation computational methods, we conduct 10-fold cross validation (10-CV) and independent test dataset. The AUC (area of receiver operating characteristic curve) and AUPR (area of precision-recall curve) are used as metrics. The experiment results show that the average AUC and AUPR values exceed 0.9182 and 0.8487 on 10-CV, respectively. The AUC and AUPR values also reach 0.9157 and 0.8469 on independent test dataset, respectively. It is superior with compared methods. The ablation experiment results also further that PMLocMSCAM can effective predict miRNA subcellular localisations and provide help to understand their physiological functions.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"19 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/syb2.70023","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Systems Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/syb2.70023","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Many studies have shown that microRNAs (miRNAs) play key roles in some important processes and human complicated diseases. In addition, they also have specific physiological roles at different cellular sites. Therefore, identifying their subcellular localisation is very urgent to systemically understand their physiological functions. In this study, we propose a computational method, called PMLocMSCAM, to predict miRNA subcellular localisation based on miRNA similarities and cross-attention mechanism. PMLocMSCAM implements a multimodal integration framework that systematically processes miRNA sequence data, miRNA-mRNA association networks with mRNA subcellular localisation annotations, miRNA-disease associations, and miRNA-drug association networks. The architecture initiates with intrinsic feature extraction through Smith-Waterman alignment for sequence similarity computation and disease ontology-based functional similarity derivation. Subsequent heterogeneous network embedding employs Node2vec for topological feature learning across three interaction modalities (miRNA-disease, miRNA-drug, and miRNA-mRNA networks), enhanced by hypergraph convolution to capture higher-order relationships through incidence matrix decomposition. Localisation-specific patterns are propagated via miRNA-mRNA interaction weights, culminating in a multi-head attention mechanism that dynamically fuses five feature matrices—miRNA sequence features, miRNA-disease association features, miRNA-drug association features, miRNA-mRNA association features, and miRNA-mRNA localisation features. These integrated representations are processed through residual-connected multilayer perceptrons to generate probabilistic predictions across seven subcellular compartments, establishing an end-to-end computational paradigm for multimodal miRNA localisation analysis. In order to assess the prediction performance of our method and compare it with other miRNA subcellular localisation computational methods, we conduct 10-fold cross validation (10-CV) and independent test dataset. The AUC (area of receiver operating characteristic curve) and AUPR (area of precision-recall curve) are used as metrics. The experiment results show that the average AUC and AUPR values exceed 0.9182 and 0.8487 on 10-CV, respectively. The AUC and AUPR values also reach 0.9157 and 0.8469 on independent test dataset, respectively. It is superior with compared methods. The ablation experiment results also further that PMLocMSCAM can effective predict miRNA subcellular localisations and provide help to understand their physiological functions.

PMLocMSCAM:通过miRNA相似性和交叉注意机制预测miRNA亚细胞定位
许多研究表明,microRNAs (miRNAs)在一些重要过程和人类复杂疾病中起着关键作用。此外,它们在不同的细胞部位也有特定的生理作用。因此,确定它们的亚细胞定位对于系统地了解它们的生理功能是非常迫切的。在这项研究中,我们提出了一种称为PMLocMSCAM的计算方法,基于miRNA相似性和交叉注意机制来预测miRNA亚细胞定位。PMLocMSCAM实现了一个多模式集成框架,系统地处理miRNA序列数据、带有mRNA亚细胞定位注释的miRNA-mRNA关联网络、miRNA-疾病关联和miRNA-药物关联网络。该体系结构首先通过Smith-Waterman比对进行序列相似度计算和基于疾病本体的功能相似度派生的内在特征提取。随后的异构网络嵌入采用Node2vec在三种相互作用模式(mirna -疾病、mirna -药物和miRNA-mRNA网络)中进行拓扑特征学习,并通过超图卷积增强,通过关联矩阵分解捕获高阶关系。定位特异性模式通过miRNA-mRNA相互作用权重传播,最终形成一个多头注意机制,该机制动态融合了五个特征矩阵:mirna序列特征、mirna -疾病关联特征、mirna -药物关联特征、miRNA-mRNA关联特征和miRNA-mRNA定位特征。这些综合表征通过残差连接多层感知器进行处理,生成跨七个亚细胞区室的概率预测,为多模态miRNA定位分析建立端到端计算范式。为了评估我们的方法的预测性能并将其与其他miRNA亚细胞定位计算方法进行比较,我们进行了10倍交叉验证(10-CV)和独立的测试数据集。以接收者工作特征曲线面积(AUC)和精确召回率曲线面积(AUPR)为指标。实验结果表明,在10-CV条件下,平均AUC和AUPR值分别超过0.9182和0.8487。在独立测试数据集上,AUC和AUPR值也分别达到0.9157和0.8469。与比较方法相比,该方法具有优越性。消融实验结果也进一步证实了PMLocMSCAM能够有效预测miRNA亚细胞定位,为了解其生理功能提供帮助。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IET Systems Biology
IET Systems Biology 生物-数学与计算生物学
CiteScore
4.20
自引率
4.30%
发文量
17
审稿时长
>12 weeks
期刊介绍: IET Systems Biology covers intra- and inter-cellular dynamics, using systems- and signal-oriented approaches. Papers that analyse genomic data in order to identify variables and basic relationships between them are considered if the results provide a basis for mathematical modelling and simulation of cellular dynamics. Manuscripts on molecular and cell biological studies are encouraged if the aim is a systems approach to dynamic interactions within and between cells. The scope includes the following topics: Genomics, transcriptomics, proteomics, metabolomics, cells, tissue and the physiome; molecular and cellular interaction, gene, cell and protein function; networks and pathways; metabolism and cell signalling; dynamics, regulation and control; systems, signals, and information; experimental data analysis; mathematical modelling, simulation and theoretical analysis; biological modelling, simulation, prediction and control; methodologies, databases, tools and algorithms for modelling and simulation; modelling, analysis and control of biological networks; synthetic biology and bioengineering based on systems biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信