Using flux theory in dynamic omics data sets to identify differentially changing signals using DPoP.

IF 2.9 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics Pub Date : 2024-09-27 DOI:10.1186/s12859-024-05938-9

Harley Edwards, Joseph Zavorskas, Walker Huso, Alexander G Doan, Caton Silbiger, Steven Harris, Ranjan Srivastava, Mark R Marten

{"title":"Using flux theory in dynamic omics data sets to identify differentially changing signals using DPoP.","authors":"Harley Edwards, Joseph Zavorskas, Walker Huso, Alexander G Doan, Caton Silbiger, Steven Harris, Ranjan Srivastava, Mark R Marten","doi":"10.1186/s12859-024-05938-9","DOIUrl":null,"url":null,"abstract":"Background: Derivative profiling is a novel approach to identify differential signals from dynamic omics data sets. This approach applies variable step-size differentiation to time dynamic omics data. This work assumes that there is a general omics derivative that is a useful and descriptive feature of dynamic omics experiments. We assert that this omics derivative, or omics flux, is a valuable descriptor that can be used instead of, or with, fold change calculations.Results: The results of derivative profiling are compared to established methods such as Multivariate Adaptive Regression Splines, significance versus fold change analysis (Volcano), and an adjusted ratio over intensity (M/A) analysis to find that there is a statistically significant similarity between the results. This comparison is repeated for transcriptomic and phosphoproteomic expression profiles previously characterized in Aspergillus nidulans. This method has been packaged in an open-source, GUI-based MATLAB app, the Derivative Profiling omics Package (DPoP). Gene Ontology (GO) term enrichment has been included in the app so that a user can automatically/programmatically describe the over/under-represented GO terms in the derivative profiling results using domain specific knowledge found in their organism's specific GO database file. The advantage of the DPoP analysis is that it is computationally inexpensive, it does not require fold change calculations, it describes both instantaneous as well as overall behavior, and it achieves statistical confidence with signal trajectories of a single bio-replicate over four or more points.Conclusions: While we apply this method to time dynamic transcriptomic and phosphoproteomic datasets, it is a numerically generalizable technique that can be applied to any organism and any field interested in time series data analysis. The app described in this work enables omics researchers with no computer science background to apply derivative profiling to their data sets, while also allowing multidisciplined users to build on the nascent idea of profiling derivatives in omics.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"312"},"PeriodicalIF":2.9000,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11437665/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-024-05938-9","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Derivative profiling is a novel approach to identify differential signals from dynamic omics data sets. This approach applies variable step-size differentiation to time dynamic omics data. This work assumes that there is a general omics derivative that is a useful and descriptive feature of dynamic omics experiments. We assert that this omics derivative, or omics flux, is a valuable descriptor that can be used instead of, or with, fold change calculations.

Results: The results of derivative profiling are compared to established methods such as Multivariate Adaptive Regression Splines, significance versus fold change analysis (Volcano), and an adjusted ratio over intensity (M/A) analysis to find that there is a statistically significant similarity between the results. This comparison is repeated for transcriptomic and phosphoproteomic expression profiles previously characterized in Aspergillus nidulans. This method has been packaged in an open-source, GUI-based MATLAB app, the Derivative Profiling omics Package (DPoP). Gene Ontology (GO) term enrichment has been included in the app so that a user can automatically/programmatically describe the over/under-represented GO terms in the derivative profiling results using domain specific knowledge found in their organism's specific GO database file. The advantage of the DPoP analysis is that it is computationally inexpensive, it does not require fold change calculations, it describes both instantaneous as well as overall behavior, and it achieves statistical confidence with signal trajectories of a single bio-replicate over four or more points.

Conclusions: While we apply this method to time dynamic transcriptomic and phosphoproteomic datasets, it is a numerically generalizable technique that can be applied to any organism and any field interested in time series data analysis. The app described in this work enables omics researchers with no computer science background to apply derivative profiling to their data sets, while also allowing multidisciplined users to build on the nascent idea of profiling derivatives in omics.

查看原文本刊更多论文

在动态 omics 数据集中使用通量理论，利用 DPoP 识别差异变化信号。

背景：衍生分析是一种从动态 omics 数据集中识别差异信号的新方法。这种方法将步长可变的微分应用于时间动态 omics 数据。这项工作假定存在一种通用的 omics 衍生物，它是动态 omics 实验的一个有用的描述性特征。我们认为，这种 omics 衍生物或 omics 通量是一种有价值的描述符，可用于替代折叠变化计算或与之一起使用：结果：我们将衍生分析的结果与多元自适应回归样条曲线、显著性与折叠变化分析（火山）以及调整后的强度比（M/A）分析等成熟方法进行了比较，发现这些结果在统计学上具有显著的相似性。对黑曲霉先前表征的转录组和磷酸蛋白组表达谱进行重复比较。这种方法已被打包成一个开源的、基于图形用户界面的 MATLAB 应用程序--衍射剖析 omics 软件包（DPoP）。该程序还包含基因本体（GO）术语富集功能，这样用户就可以利用生物特定 GO 数据库文件中的特定领域知识，自动/编程描述衍生剖析结果中代表性过高/过低的 GO 术语。DPoP 分析法的优点是计算成本低廉，不需要折叠变化计算，既能描述瞬时行为，也能描述整体行为，而且能通过单个生物复制品在四个或更多点上的信号轨迹实现统计置信度：我们将这种方法应用于时间动态转录组和磷酸蛋白组数据集，但它是一种数值通用技术，可应用于任何生物体和对时间序列数据分析感兴趣的任何领域。这项工作中描述的应用程序能让没有计算机科学背景的全局组学研究人员将导数剖析应用到他们的数据集，同时还能让多学科用户在全局组学导数剖析这一新兴理念的基础上更上一层楼。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Bioinformatics 生物-生化研究方法

CiteScore

5.70

自引率

3.30%

发文量

506

审稿时长

4.3 months

期刊介绍： BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.