液相色谱-高分辨率质谱数据增强峰检测的局部非对称高斯拟合算法

IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL
Shengsi Zou, Qingxiao Cui, Jinyue Liu, Qiong Wu, Lijia Zhu, Da Chen, Yiping Du, Ting Wu
{"title":"液相色谱-高分辨率质谱数据增强峰检测的局部非对称高斯拟合算法","authors":"Shengsi Zou, Qingxiao Cui, Jinyue Liu, Qiong Wu, Lijia Zhu, Da Chen, Yiping Du, Ting Wu","doi":"10.1021/acs.analchem.5c00060","DOIUrl":null,"url":null,"abstract":"Feature detection is a crucial step in the data preprocessing workflow of liquid chromatography–mass spectrometry (LC–MS). However, many existing methods are hindered by intricate parameter adjustments and high false positive rates during extracted ion chromatogram (EIC) construction and peak detection, which challenges the identification of spurious and missing compounds. This study introduces a novel algorithm, local asymmetric Gaussian fitting (LAGF), for peak detection. LAGF integrates with the “data points bins” EIC extraction algorithm to enhance the feature detection efficiency. By using a 1 Da data points bin for EIC extraction, computational time is significantly reduced, making the method well-suited for batch metabolomics analysis. LAGF minimizes parameter numbers of generalized two-sided asymmetric Gaussian fitting by automatically determining the peak center (μ) and height (α) while accommodating two-sided standard deviations (σ<sub>1</sub> and σ<sub>2</sub>) to self-adaptively model peak patterns. Features are filtered based on a goodness-of-fit threshold of 0.5. The performance of LAGF was validated using standard mixtures and serum samples at different concentrations in reversed-phase or hydrophilic interaction LC mode. In most cases, LAGF outperformed conventional tools in terms of determination coefficient (<i>R</i><sup>2</sup>) and relative standard deviation for automatically detected peak areas. The LAGF algorithm is available as open-source Python code alongside an interactive interface, facilitating implementation in both nontargeted and targeted LC–MS analysis to enhance peak detection and compound identification.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"47 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Local Asymmetric Gaussian Fitting Algorithm for Enhanced Peak Detection of Liquid Chromatography–High Resolution Mass Spectrometry Data\",\"authors\":\"Shengsi Zou, Qingxiao Cui, Jinyue Liu, Qiong Wu, Lijia Zhu, Da Chen, Yiping Du, Ting Wu\",\"doi\":\"10.1021/acs.analchem.5c00060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature detection is a crucial step in the data preprocessing workflow of liquid chromatography–mass spectrometry (LC–MS). However, many existing methods are hindered by intricate parameter adjustments and high false positive rates during extracted ion chromatogram (EIC) construction and peak detection, which challenges the identification of spurious and missing compounds. This study introduces a novel algorithm, local asymmetric Gaussian fitting (LAGF), for peak detection. LAGF integrates with the “data points bins” EIC extraction algorithm to enhance the feature detection efficiency. By using a 1 Da data points bin for EIC extraction, computational time is significantly reduced, making the method well-suited for batch metabolomics analysis. LAGF minimizes parameter numbers of generalized two-sided asymmetric Gaussian fitting by automatically determining the peak center (μ) and height (α) while accommodating two-sided standard deviations (σ<sub>1</sub> and σ<sub>2</sub>) to self-adaptively model peak patterns. Features are filtered based on a goodness-of-fit threshold of 0.5. The performance of LAGF was validated using standard mixtures and serum samples at different concentrations in reversed-phase or hydrophilic interaction LC mode. In most cases, LAGF outperformed conventional tools in terms of determination coefficient (<i>R</i><sup>2</sup>) and relative standard deviation for automatically detected peak areas. The LAGF algorithm is available as open-source Python code alongside an interactive interface, facilitating implementation in both nontargeted and targeted LC–MS analysis to enhance peak detection and compound identification.\",\"PeriodicalId\":27,\"journal\":{\"name\":\"Analytical Chemistry\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.analchem.5c00060\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.5c00060","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

特征检测是液相色谱-质谱(LC-MS)数据预处理流程中的关键步骤。然而,许多现有的方法在提取离子色谱(EIC)构建和峰检测过程中受到复杂的参数调整和高假阳性率的阻碍,这给假化合物和缺失化合物的识别带来了挑战。本文提出了一种新的峰值检测算法——局部不对称高斯拟合(LAGF)。LAGF与“数据点箱”EIC提取算法相结合,提高了特征检测效率。通过使用1 Da数据点提取EIC,大大减少了计算时间,使该方法非常适合批量代谢组学分析。LAGF通过自动确定峰中心(μ)和峰高(α),并适应自适应峰模式的双边标准差(σ1和σ2),实现了广义双面非对称高斯拟合参数数的最小化。特征是基于0.5的拟合优度阈值进行过滤的。使用标准混合物和不同浓度的血清样品在反相或亲水性相互作用LC模式下验证LAGF的性能。在大多数情况下,LAGF在确定系数(R2)和自动检测峰面积的相对标准偏差方面优于传统工具。LAGF算法以开源Python代码的形式提供,附带一个交互界面,便于在非靶向和靶向LC-MS分析中实现,以增强峰检测和化合物鉴定。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Local Asymmetric Gaussian Fitting Algorithm for Enhanced Peak Detection of Liquid Chromatography–High Resolution Mass Spectrometry Data

Local Asymmetric Gaussian Fitting Algorithm for Enhanced Peak Detection of Liquid Chromatography–High Resolution Mass Spectrometry Data
Feature detection is a crucial step in the data preprocessing workflow of liquid chromatography–mass spectrometry (LC–MS). However, many existing methods are hindered by intricate parameter adjustments and high false positive rates during extracted ion chromatogram (EIC) construction and peak detection, which challenges the identification of spurious and missing compounds. This study introduces a novel algorithm, local asymmetric Gaussian fitting (LAGF), for peak detection. LAGF integrates with the “data points bins” EIC extraction algorithm to enhance the feature detection efficiency. By using a 1 Da data points bin for EIC extraction, computational time is significantly reduced, making the method well-suited for batch metabolomics analysis. LAGF minimizes parameter numbers of generalized two-sided asymmetric Gaussian fitting by automatically determining the peak center (μ) and height (α) while accommodating two-sided standard deviations (σ1 and σ2) to self-adaptively model peak patterns. Features are filtered based on a goodness-of-fit threshold of 0.5. The performance of LAGF was validated using standard mixtures and serum samples at different concentrations in reversed-phase or hydrophilic interaction LC mode. In most cases, LAGF outperformed conventional tools in terms of determination coefficient (R2) and relative standard deviation for automatically detected peak areas. The LAGF algorithm is available as open-source Python code alongside an interactive interface, facilitating implementation in both nontargeted and targeted LC–MS analysis to enhance peak detection and compound identification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信