基于mRNA数据,利用深度神经网络和LASSO回归预测miRNA表达变化。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in bioinformatics Pub Date : 2025-07-08 eCollection Date: 2025-01-01 DOI:10.3389/fbinf.2025.1566162
Franz Leonard Böge, Helena U Zacharias, Stefanie C Becker, Klaus Jung
{"title":"基于mRNA数据,利用深度神经网络和LASSO回归预测miRNA表达变化。","authors":"Franz Leonard Böge, Helena U Zacharias, Stefanie C Becker, Klaus Jung","doi":"10.3389/fbinf.2025.1566162","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Since the rise of molecular high-throughput technologies, many diseases are now studied on multiple omics layers in parallel. Understanding the interplay between microRNAs (miRNA) and their target mRNAs is important to understand the molecular level of diseases. While much public data from mRNA experiments are available for many diseases, few paired datasets with both miRNA and mRNA expression profiles are available. This study aimed to assess the possibility of predicting miRNA expression data based on mRNA expression data, serving as a proof of principle that such cross-omics predictions are feasible. Furthermore, current research relies on target databases where information about miRNA-target relationships is provided based on experimental and computational studies.</p><p><strong>Methods: </strong>To make use of publicly available mRNA profiles, we investigate the ability of artificial deep neural networks and linear least absolute shrinkage and selection operator (LASSO) regression to predict unknown miRNA expression profiles. We evaluate the approach using seven paired miRNA/mRNA expression datasets, four from studies on West Nile virus infection in mouse tissues and three from human immunodeficiency virus (HIV) infection in human tissues. We assessed the performance of each model first by within-data evaluations and second by cross-study evaluations. Furthermore, we investigated whether data augmentation or separate models for data from diseased and non-diseased samples can improve the prediction performance.</p><p><strong>Results: </strong>In general, most settings achieved strong correlations at the Level of individual samples. In some datasets and settings, correlations of log-fold changes and p-values from differential expression analysis (DEA) between true and predicted miRNA profiles can be observed. Correlation between log fold changes could also be seen in a cross-study evaluation for the HIV datasets. Data augmentation consistently improved performance in neural networks, while its impact on LASSO models was not significant.</p><p><strong>Discussion: </strong>Overall, cross-omics prediction of expression profiles appears possible, even with some correlations on the Level of the differential expression analysis.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1566162"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279838/pdf/","citationCount":"0","resultStr":"{\"title\":\"Using deep neural networks and LASSO regression to predict miRNA expression changes based on mRNA data.\",\"authors\":\"Franz Leonard Böge, Helena U Zacharias, Stefanie C Becker, Klaus Jung\",\"doi\":\"10.3389/fbinf.2025.1566162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Since the rise of molecular high-throughput technologies, many diseases are now studied on multiple omics layers in parallel. Understanding the interplay between microRNAs (miRNA) and their target mRNAs is important to understand the molecular level of diseases. While much public data from mRNA experiments are available for many diseases, few paired datasets with both miRNA and mRNA expression profiles are available. This study aimed to assess the possibility of predicting miRNA expression data based on mRNA expression data, serving as a proof of principle that such cross-omics predictions are feasible. Furthermore, current research relies on target databases where information about miRNA-target relationships is provided based on experimental and computational studies.</p><p><strong>Methods: </strong>To make use of publicly available mRNA profiles, we investigate the ability of artificial deep neural networks and linear least absolute shrinkage and selection operator (LASSO) regression to predict unknown miRNA expression profiles. We evaluate the approach using seven paired miRNA/mRNA expression datasets, four from studies on West Nile virus infection in mouse tissues and three from human immunodeficiency virus (HIV) infection in human tissues. We assessed the performance of each model first by within-data evaluations and second by cross-study evaluations. Furthermore, we investigated whether data augmentation or separate models for data from diseased and non-diseased samples can improve the prediction performance.</p><p><strong>Results: </strong>In general, most settings achieved strong correlations at the Level of individual samples. In some datasets and settings, correlations of log-fold changes and p-values from differential expression analysis (DEA) between true and predicted miRNA profiles can be observed. Correlation between log fold changes could also be seen in a cross-study evaluation for the HIV datasets. Data augmentation consistently improved performance in neural networks, while its impact on LASSO models was not significant.</p><p><strong>Discussion: </strong>Overall, cross-omics prediction of expression profiles appears possible, even with some correlations on the Level of the differential expression analysis.</p>\",\"PeriodicalId\":73066,\"journal\":{\"name\":\"Frontiers in bioinformatics\",\"volume\":\"5 \",\"pages\":\"1566162\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279838/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fbinf.2025.1566162\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2025.1566162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

导读:随着分子高通量技术的兴起,许多疾病在多个组学层面上并行研究。了解microRNAs (miRNA)及其靶mrna之间的相互作用对于了解疾病的分子水平非常重要。虽然从mRNA实验中获得的许多公开数据可用于许多疾病,但很少有miRNA和mRNA表达谱的配对数据集可用。本研究旨在评估基于mRNA表达数据预测miRNA表达数据的可能性,从原则上证明这种跨组学预测是可行的。此外,目前的研究依赖于靶标数据库,其中关于mirna -靶标关系的信息是基于实验和计算研究提供的。方法:为了利用公开可用的mRNA谱,我们研究了人工深度神经网络和线性最小绝对收缩和选择算子(LASSO)回归预测未知miRNA表达谱的能力。我们使用7个配对的miRNA/mRNA表达数据集来评估该方法,其中4个来自小鼠组织中西尼罗病毒感染的研究,3个来自人类组织中人类免疫缺陷病毒(HIV)感染的研究。我们首先通过数据内评估和交叉研究评估来评估每个模型的性能。此外,我们还研究了数据增强或对患病和非患病样本的数据进行分离模型是否可以提高预测性能。结果:一般来说,大多数设置在单个样本水平上实现了强相关性。在一些数据集和设置中,可以观察到真实和预测miRNA谱之间的对数倍变化和差异表达分析(DEA)的p值的相关性。对数折线变化之间的相关性也可以在HIV数据集的交叉研究评估中看到。数据增强持续提高神经网络的性能,而其对LASSO模型的影响不显著。讨论:总的来说,跨组学预测表达谱似乎是可能的,即使在差异表达分析水平上存在一些相关性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Using deep neural networks and LASSO regression to predict miRNA expression changes based on mRNA data.

Using deep neural networks and LASSO regression to predict miRNA expression changes based on mRNA data.

Using deep neural networks and LASSO regression to predict miRNA expression changes based on mRNA data.

Using deep neural networks and LASSO regression to predict miRNA expression changes based on mRNA data.

Introduction: Since the rise of molecular high-throughput technologies, many diseases are now studied on multiple omics layers in parallel. Understanding the interplay between microRNAs (miRNA) and their target mRNAs is important to understand the molecular level of diseases. While much public data from mRNA experiments are available for many diseases, few paired datasets with both miRNA and mRNA expression profiles are available. This study aimed to assess the possibility of predicting miRNA expression data based on mRNA expression data, serving as a proof of principle that such cross-omics predictions are feasible. Furthermore, current research relies on target databases where information about miRNA-target relationships is provided based on experimental and computational studies.

Methods: To make use of publicly available mRNA profiles, we investigate the ability of artificial deep neural networks and linear least absolute shrinkage and selection operator (LASSO) regression to predict unknown miRNA expression profiles. We evaluate the approach using seven paired miRNA/mRNA expression datasets, four from studies on West Nile virus infection in mouse tissues and three from human immunodeficiency virus (HIV) infection in human tissues. We assessed the performance of each model first by within-data evaluations and second by cross-study evaluations. Furthermore, we investigated whether data augmentation or separate models for data from diseased and non-diseased samples can improve the prediction performance.

Results: In general, most settings achieved strong correlations at the Level of individual samples. In some datasets and settings, correlations of log-fold changes and p-values from differential expression analysis (DEA) between true and predicted miRNA profiles can be observed. Correlation between log fold changes could also be seen in a cross-study evaluation for the HIV datasets. Data augmentation consistently improved performance in neural networks, while its impact on LASSO models was not significant.

Discussion: Overall, cross-omics prediction of expression profiles appears possible, even with some correlations on the Level of the differential expression analysis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信