Shengsi Zou, Qingxiao Cui, Jinyue Liu, Qiong Wu, Lijia Zhu, Da Chen, Yiping Du, Ting Wu
{"title":"液相色谱-高分辨率质谱数据增强峰检测的局部非对称高斯拟合算法","authors":"Shengsi Zou, Qingxiao Cui, Jinyue Liu, Qiong Wu, Lijia Zhu, Da Chen, Yiping Du, Ting Wu","doi":"10.1021/acs.analchem.5c00060","DOIUrl":null,"url":null,"abstract":"Feature detection is a crucial step in the data preprocessing workflow of liquid chromatography–mass spectrometry (LC–MS). However, many existing methods are hindered by intricate parameter adjustments and high false positive rates during extracted ion chromatogram (EIC) construction and peak detection, which challenges the identification of spurious and missing compounds. This study introduces a novel algorithm, local asymmetric Gaussian fitting (LAGF), for peak detection. LAGF integrates with the “data points bins” EIC extraction algorithm to enhance the feature detection efficiency. By using a 1 Da data points bin for EIC extraction, computational time is significantly reduced, making the method well-suited for batch metabolomics analysis. LAGF minimizes parameter numbers of generalized two-sided asymmetric Gaussian fitting by automatically determining the peak center (μ) and height (α) while accommodating two-sided standard deviations (σ<sub>1</sub> and σ<sub>2</sub>) to self-adaptively model peak patterns. Features are filtered based on a goodness-of-fit threshold of 0.5. The performance of LAGF was validated using standard mixtures and serum samples at different concentrations in reversed-phase or hydrophilic interaction LC mode. In most cases, LAGF outperformed conventional tools in terms of determination coefficient (<i>R</i><sup>2</sup>) and relative standard deviation for automatically detected peak areas. The LAGF algorithm is available as open-source Python code alongside an interactive interface, facilitating implementation in both nontargeted and targeted LC–MS analysis to enhance peak detection and compound identification.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"47 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Local Asymmetric Gaussian Fitting Algorithm for Enhanced Peak Detection of Liquid Chromatography–High Resolution Mass Spectrometry Data\",\"authors\":\"Shengsi Zou, Qingxiao Cui, Jinyue Liu, Qiong Wu, Lijia Zhu, Da Chen, Yiping Du, Ting Wu\",\"doi\":\"10.1021/acs.analchem.5c00060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature detection is a crucial step in the data preprocessing workflow of liquid chromatography–mass spectrometry (LC–MS). However, many existing methods are hindered by intricate parameter adjustments and high false positive rates during extracted ion chromatogram (EIC) construction and peak detection, which challenges the identification of spurious and missing compounds. This study introduces a novel algorithm, local asymmetric Gaussian fitting (LAGF), for peak detection. LAGF integrates with the “data points bins” EIC extraction algorithm to enhance the feature detection efficiency. By using a 1 Da data points bin for EIC extraction, computational time is significantly reduced, making the method well-suited for batch metabolomics analysis. LAGF minimizes parameter numbers of generalized two-sided asymmetric Gaussian fitting by automatically determining the peak center (μ) and height (α) while accommodating two-sided standard deviations (σ<sub>1</sub> and σ<sub>2</sub>) to self-adaptively model peak patterns. Features are filtered based on a goodness-of-fit threshold of 0.5. The performance of LAGF was validated using standard mixtures and serum samples at different concentrations in reversed-phase or hydrophilic interaction LC mode. In most cases, LAGF outperformed conventional tools in terms of determination coefficient (<i>R</i><sup>2</sup>) and relative standard deviation for automatically detected peak areas. The LAGF algorithm is available as open-source Python code alongside an interactive interface, facilitating implementation in both nontargeted and targeted LC–MS analysis to enhance peak detection and compound identification.\",\"PeriodicalId\":27,\"journal\":{\"name\":\"Analytical Chemistry\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.analchem.5c00060\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.5c00060","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Local Asymmetric Gaussian Fitting Algorithm for Enhanced Peak Detection of Liquid Chromatography–High Resolution Mass Spectrometry Data
Feature detection is a crucial step in the data preprocessing workflow of liquid chromatography–mass spectrometry (LC–MS). However, many existing methods are hindered by intricate parameter adjustments and high false positive rates during extracted ion chromatogram (EIC) construction and peak detection, which challenges the identification of spurious and missing compounds. This study introduces a novel algorithm, local asymmetric Gaussian fitting (LAGF), for peak detection. LAGF integrates with the “data points bins” EIC extraction algorithm to enhance the feature detection efficiency. By using a 1 Da data points bin for EIC extraction, computational time is significantly reduced, making the method well-suited for batch metabolomics analysis. LAGF minimizes parameter numbers of generalized two-sided asymmetric Gaussian fitting by automatically determining the peak center (μ) and height (α) while accommodating two-sided standard deviations (σ1 and σ2) to self-adaptively model peak patterns. Features are filtered based on a goodness-of-fit threshold of 0.5. The performance of LAGF was validated using standard mixtures and serum samples at different concentrations in reversed-phase or hydrophilic interaction LC mode. In most cases, LAGF outperformed conventional tools in terms of determination coefficient (R2) and relative standard deviation for automatically detected peak areas. The LAGF algorithm is available as open-source Python code alongside an interactive interface, facilitating implementation in both nontargeted and targeted LC–MS analysis to enhance peak detection and compound identification.
期刊介绍:
Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.