缩小非靶向代谢组学中采集后样本归一化的知识差距。

IF 4.6 Q1 CHEMISTRY, ANALYTICAL
ACS Measurement Science Au Pub Date : 2024-10-14 eCollection Date: 2024-12-18 DOI:10.1021/acsmeasuresciau.4c00047
Brian Low, Yukai Wang, Tingting Zhao, Huaxu Yu, Tao Huan
{"title":"缩小非靶向代谢组学中采集后样本归一化的知识差距。","authors":"Brian Low, Yukai Wang, Tingting Zhao, Huaxu Yu, Tao Huan","doi":"10.1021/acsmeasuresciau.4c00047","DOIUrl":null,"url":null,"abstract":"<p><p>Sample normalization is a crucial step in metabolomics for fair quantitative comparisons. It aims to minimize sample-to-sample variations due to differences in the total metabolite amount. When samples lack a specific metabolic quantity to accurately represent their total metabolite amounts, post-acquisition sample normalization becomes essential. Despite many proposed normalization algorithms, understanding remains limited of their differences, hindering the selection of the most suitable one for a given metabolomics study. This study bridges this knowledge gap by employing data simulation, experimental simulation, and real experiments to elucidate the differences in the mechanism and performance among common post-acquisition sample normalization methods. Using public datasets, we first demonstrated the dramatic discrepancies between the outcomes of different sample normalization methods. Then, we benchmarked six normalization methods: sum, median, probabilistic quotient normalization (PQN), maximal density fold change (MDFC), quantile, and class-specific quantile. Our results show that most normalization methods are biased when there is unbalanced data, a phenomenon where the percentages of up- and downregulated metabolites are unequal. Notably, unbalanced data can be sourced from the underlying biological differences, experimental perturbations, and metabolic interference. Beyond normalization algorithms and data structure, our study also emphasizes the importance of considering additional factors contributed by data quality, such as background noise, signal saturation, and missingness. Based on these findings, we propose an evidence-based normalization strategy to maximize sample normalization outcomes, providing a robust bioinformatic solution for advancing metabolomics research with a fair quantitative comparison.</p>","PeriodicalId":29800,"journal":{"name":"ACS Measurement Science Au","volume":"4 6","pages":"702-711"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659990/pdf/","citationCount":"0","resultStr":"{\"title\":\"Closing the Knowledge Gap of Post-Acquisition Sample Normalization in Untargeted Metabolomics.\",\"authors\":\"Brian Low, Yukai Wang, Tingting Zhao, Huaxu Yu, Tao Huan\",\"doi\":\"10.1021/acsmeasuresciau.4c00047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Sample normalization is a crucial step in metabolomics for fair quantitative comparisons. It aims to minimize sample-to-sample variations due to differences in the total metabolite amount. When samples lack a specific metabolic quantity to accurately represent their total metabolite amounts, post-acquisition sample normalization becomes essential. Despite many proposed normalization algorithms, understanding remains limited of their differences, hindering the selection of the most suitable one for a given metabolomics study. This study bridges this knowledge gap by employing data simulation, experimental simulation, and real experiments to elucidate the differences in the mechanism and performance among common post-acquisition sample normalization methods. Using public datasets, we first demonstrated the dramatic discrepancies between the outcomes of different sample normalization methods. Then, we benchmarked six normalization methods: sum, median, probabilistic quotient normalization (PQN), maximal density fold change (MDFC), quantile, and class-specific quantile. Our results show that most normalization methods are biased when there is unbalanced data, a phenomenon where the percentages of up- and downregulated metabolites are unequal. Notably, unbalanced data can be sourced from the underlying biological differences, experimental perturbations, and metabolic interference. Beyond normalization algorithms and data structure, our study also emphasizes the importance of considering additional factors contributed by data quality, such as background noise, signal saturation, and missingness. Based on these findings, we propose an evidence-based normalization strategy to maximize sample normalization outcomes, providing a robust bioinformatic solution for advancing metabolomics research with a fair quantitative comparison.</p>\",\"PeriodicalId\":29800,\"journal\":{\"name\":\"ACS Measurement Science Au\",\"volume\":\"4 6\",\"pages\":\"702-711\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659990/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Measurement Science Au\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1021/acsmeasuresciau.4c00047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/18 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Measurement Science Au","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1021/acsmeasuresciau.4c00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/18 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

样本归一化是代谢组学中公平定量比较的关键步骤。它的目的是尽量减少样品之间的差异,由于总代谢物量的差异。当样品缺乏特定的代谢量来准确地表示其总代谢物量时,采集后的样品规范化就变得至关重要。尽管提出了许多归一化算法,但对它们之间差异的理解仍然有限,这阻碍了为给定代谢组学研究选择最合适的算法。本研究通过数据模拟、实验模拟和实际实验来弥补这一知识差距,阐明了常见的采集后样本归一化方法在机制和性能上的差异。使用公共数据集,我们首先证明了不同样本归一化方法的结果之间的巨大差异。然后,我们对六种归一化方法进行了基准测试:和、中位数、概率商归一化(PQN)、最大密度折叠变化(MDFC)、分位数和类别特定分位数。我们的结果表明,当存在不平衡数据时,大多数归一化方法是有偏差的,这是一种向上和向下调节代谢物的百分比不相等的现象。值得注意的是,不平衡数据可能来自潜在的生物学差异、实验扰动和代谢干扰。除了归一化算法和数据结构之外,我们的研究还强调了考虑数据质量带来的其他因素的重要性,例如背景噪声、信号饱和度和缺失。基于这些发现,我们提出了一种基于证据的归一化策略,以最大化样本归一化结果,为推进代谢组学研究提供了一个强大的生物信息学解决方案,并进行了公平的定量比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Closing the Knowledge Gap of Post-Acquisition Sample Normalization in Untargeted Metabolomics.

Sample normalization is a crucial step in metabolomics for fair quantitative comparisons. It aims to minimize sample-to-sample variations due to differences in the total metabolite amount. When samples lack a specific metabolic quantity to accurately represent their total metabolite amounts, post-acquisition sample normalization becomes essential. Despite many proposed normalization algorithms, understanding remains limited of their differences, hindering the selection of the most suitable one for a given metabolomics study. This study bridges this knowledge gap by employing data simulation, experimental simulation, and real experiments to elucidate the differences in the mechanism and performance among common post-acquisition sample normalization methods. Using public datasets, we first demonstrated the dramatic discrepancies between the outcomes of different sample normalization methods. Then, we benchmarked six normalization methods: sum, median, probabilistic quotient normalization (PQN), maximal density fold change (MDFC), quantile, and class-specific quantile. Our results show that most normalization methods are biased when there is unbalanced data, a phenomenon where the percentages of up- and downregulated metabolites are unequal. Notably, unbalanced data can be sourced from the underlying biological differences, experimental perturbations, and metabolic interference. Beyond normalization algorithms and data structure, our study also emphasizes the importance of considering additional factors contributed by data quality, such as background noise, signal saturation, and missingness. Based on these findings, we propose an evidence-based normalization strategy to maximize sample normalization outcomes, providing a robust bioinformatic solution for advancing metabolomics research with a fair quantitative comparison.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ACS Measurement Science Au
ACS Measurement Science Au 化学计量学-
CiteScore
5.20
自引率
0.00%
发文量
0
期刊介绍: ACS Measurement Science Au is an open access journal that publishes experimental computational or theoretical research in all areas of chemical measurement science. Short letters comprehensive articles reviews and perspectives are welcome on topics that report on any phase of analytical operations including sampling measurement and data analysis. This includes:Chemical Reactions and SelectivityChemometrics and Data ProcessingElectrochemistryElemental and Molecular CharacterizationImagingInstrumentationMass SpectrometryMicroscale and Nanoscale systemsOmics (Genomics Proteomics Metabonomics Metabolomics and Bioinformatics)Sensors and Sensing (Biosensors Chemical Sensors Gas Sensors Intracellular Sensors Single-Molecule Sensors Cell Chips Arrays Microfluidic Devices)SeparationsSpectroscopySurface analysisPapers dealing with established methods need to offer a significantly improved original application of the method.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信