De-Wei An, Yu-Ling Yu, Dries S. Martens, Agnieszka Latosinska, Zhen-Yu Zhang, Harald Mischak, Tim S. Nawrot, Jan A. Staessen
{"title":"Statistical approaches applicable in managing OMICS data: Urinary proteomics as exemplary case","authors":"De-Wei An, Yu-Ling Yu, Dries S. Martens, Agnieszka Latosinska, Zhen-Yu Zhang, Harald Mischak, Tim S. Nawrot, Jan A. Staessen","doi":"10.1002/mas.21849","DOIUrl":null,"url":null,"abstract":"<p>With urinary proteomics profiling (UPP) as exemplary omics technology, this review describes a workflow for the analysis of omics data in large study populations. The proposed workflow includes: (i) planning omics studies and sample size considerations; (ii) preparing the data for analysis; (iii) preprocessing the UPP data; (iv) the basic statistical steps required for data curation; (v) the selection of covariables; (vi) relating continuously distributed or categorical outcomes to a series of single markers (e.g., sequenced urinary peptide fragments identifying the parental proteins); (vii) showing the added diagnostic or prognostic value of the UPP markers over and beyond classical risk factors, and (viii) pathway analysis to identify targets for personalized intervention in disease prevention or treatment. Additionally, two short sections respectively address multiomics studies and machine learning. In conclusion, the analysis of adverse health outcomes in relation to omics biomarkers rests on the same statistical principle as any other data collected in large population or patient cohorts. The large number of biomarkers, which have to be considered simultaneously requires planning ahead how the study database will be structured and curated, imported in statistical software packages, analysis results will be triaged for clinical relevance, and presented.</p>","PeriodicalId":206,"journal":{"name":"Mass Spectrometry Reviews","volume":"43 6","pages":"1237-1254"},"PeriodicalIF":6.9000,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/mas.21849","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mass Spectrometry Reviews","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mas.21849","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0
Abstract
With urinary proteomics profiling (UPP) as exemplary omics technology, this review describes a workflow for the analysis of omics data in large study populations. The proposed workflow includes: (i) planning omics studies and sample size considerations; (ii) preparing the data for analysis; (iii) preprocessing the UPP data; (iv) the basic statistical steps required for data curation; (v) the selection of covariables; (vi) relating continuously distributed or categorical outcomes to a series of single markers (e.g., sequenced urinary peptide fragments identifying the parental proteins); (vii) showing the added diagnostic or prognostic value of the UPP markers over and beyond classical risk factors, and (viii) pathway analysis to identify targets for personalized intervention in disease prevention or treatment. Additionally, two short sections respectively address multiomics studies and machine learning. In conclusion, the analysis of adverse health outcomes in relation to omics biomarkers rests on the same statistical principle as any other data collected in large population or patient cohorts. The large number of biomarkers, which have to be considered simultaneously requires planning ahead how the study database will be structured and curated, imported in statistical software packages, analysis results will be triaged for clinical relevance, and presented.
期刊介绍:
The aim of the journal Mass Spectrometry Reviews is to publish well-written reviews in selected topics in the various sub-fields of mass spectrometry as a means to summarize the research that has been performed in that area, to focus attention of other researchers, to critically review the published material, and to stimulate further research in that area.
The scope of the published reviews include, but are not limited to topics, such as theoretical treatments, instrumental design, ionization methods, analyzers, detectors, application to the qualitative and quantitative analysis of various compounds or elements, basic ion chemistry and structure studies, ion energetic studies, and studies on biomolecules, polymers, etc.