Fusion of multi-source relationships and topology to infer lncRNA-protein interactions

IF 6.5 2区 医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL
Xinyu Zhang, Mingzhe Liu, Zhen Li, Linlin Zhuo, Xiangzheng Fu, Quan Zou
{"title":"Fusion of multi-source relationships and topology to infer lncRNA-protein interactions","authors":"Xinyu Zhang, Mingzhe Liu, Zhen Li, Linlin Zhuo, Xiangzheng Fu, Quan Zou","doi":"10.1016/j.omtn.2024.102187","DOIUrl":null,"url":null,"abstract":"Long non-coding RNAs (lncRNAs) are important factors involved in biological regulatory networks. Accurately predicting lncRNA-protein interactions (LPIs) is vital for clarifying lncRNA’s functions and pathogenic mechanisms. Existing deep learning models have yet to yield satisfactory results in LPI prediction. Recently, graph autoencoders (GAEs) have seen rapid development, excelling in tasks like link prediction and node classification. We employed GAE technology for LPI prediction, devising the FMSRT-LPI model based on path masking and degree regression strategies and thereby achieving satisfactory outcomes. This represents the first known integration of path masking and degree regression strategies into the GAE framework for potential LPI inference. The effectiveness of our FMSRT-LPI model primarily relies on four key aspects. First, within the GAE framework, our model integrates multi-source relationships of lncRNAs and proteins with LPN’s topological data. Second, the implemented masking strategy efficiently identifies LPN’s key paths, reconstructs the network, and reduces the impact of redundant or incorrect data. Third, the integrated degree decoder balances degree and structural information, enhancing node representation. Fourth, the PolyLoss function we introduced is more appropriate for LPI prediction tasks. The results on multiple public datasets further demonstrate our model’s potential in LPI prediction. Our code and data can be freely accessed at .","PeriodicalId":18821,"journal":{"name":"Molecular Therapy. Nucleic Acids","volume":"38 1","pages":""},"PeriodicalIF":6.5000,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Therapy. Nucleic Acids","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.omtn.2024.102187","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Long non-coding RNAs (lncRNAs) are important factors involved in biological regulatory networks. Accurately predicting lncRNA-protein interactions (LPIs) is vital for clarifying lncRNA’s functions and pathogenic mechanisms. Existing deep learning models have yet to yield satisfactory results in LPI prediction. Recently, graph autoencoders (GAEs) have seen rapid development, excelling in tasks like link prediction and node classification. We employed GAE technology for LPI prediction, devising the FMSRT-LPI model based on path masking and degree regression strategies and thereby achieving satisfactory outcomes. This represents the first known integration of path masking and degree regression strategies into the GAE framework for potential LPI inference. The effectiveness of our FMSRT-LPI model primarily relies on four key aspects. First, within the GAE framework, our model integrates multi-source relationships of lncRNAs and proteins with LPN’s topological data. Second, the implemented masking strategy efficiently identifies LPN’s key paths, reconstructs the network, and reduces the impact of redundant or incorrect data. Third, the integrated degree decoder balances degree and structural information, enhancing node representation. Fourth, the PolyLoss function we introduced is more appropriate for LPI prediction tasks. The results on multiple public datasets further demonstrate our model’s potential in LPI prediction. Our code and data can be freely accessed at .
融合多源关系和拓扑结构推断 lncRNA 与蛋白质的相互作用
长非编码 RNA(lncRNA)是参与生物调控网络的重要因子。准确预测lncRNA与蛋白质的相互作用(LPIs)对于阐明lncRNA的功能和致病机制至关重要。现有的深度学习模型在 LPI 预测方面尚未取得令人满意的结果。最近,图自动编码器(GAE)得到了快速发展,在链接预测和节点分类等任务中表现出色。我们将 GAE 技术用于 LPI 预测,设计了基于路径屏蔽和度回归策略的 FMSRT-LPI 模型,从而取得了令人满意的结果。这是首次将路径屏蔽和度回归策略整合到 GAE 框架中,用于潜在的 LPI 推断。我们的 FMSRT-LPI 模型的有效性主要依赖于四个关键方面。首先,在 GAE 框架内,我们的模型将 lncRNA 和蛋白质的多源关系与 LPN 的拓扑数据整合在一起。其次,实施的掩码策略能有效识别 LPN 的关键路径,重建网络,并减少冗余或错误数据的影响。第三,集成的度解码器平衡了度和结构信息,增强了节点的代表性。第四,我们引入的 PolyLoss 函数更适合 LPI 预测任务。在多个公共数据集上的结果进一步证明了我们的模型在 LPI 预测方面的潜力。我们的代码和数据可在以下网址免费获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular Therapy. Nucleic Acids
Molecular Therapy. Nucleic Acids MEDICINE, RESEARCH & EXPERIMENTAL-
CiteScore
15.40
自引率
1.10%
发文量
336
审稿时长
20 weeks
期刊介绍: Molecular Therapy Nucleic Acids is an international, open-access journal that publishes high-quality research in nucleic-acid-based therapeutics to treat and correct genetic and acquired diseases. It is the official journal of the American Society of Gene & Cell Therapy and is built upon the success of Molecular Therapy. The journal focuses on gene- and oligonucleotide-based therapies and publishes peer-reviewed research, reviews, and commentaries. Its impact factor for 2022 is 8.8. The subject areas covered include the development of therapeutics based on nucleic acids and their derivatives, vector development for RNA-based therapeutics delivery, utilization of gene-modifying agents like Zn finger nucleases and triplex-forming oligonucleotides, pre-clinical target validation, safety and efficacy studies, and clinical trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信