A fast region of interest algorithm for efficient data compression and improved peak detection in high-resolution mass spectrometry.

IF 3.8 2区 化学 Q1 BIOCHEMICAL RESEARCH METHODS
Oskar Munk Kronik, Jan H Christensen, Nikoline Juul Nielsen, Selina Tisler, Giorgio Tomasi
{"title":"A fast region of interest algorithm for efficient data compression and improved peak detection in high-resolution mass spectrometry.","authors":"Oskar Munk Kronik, Jan H Christensen, Nikoline Juul Nielsen, Selina Tisler, Giorgio Tomasi","doi":"10.1007/s00216-024-05718-7","DOIUrl":null,"url":null,"abstract":"<p><p>Liquid chromatography coupled to high-resolution mass spectrometry (LC-HRMS) is commonly used for identification of compounds in complex samples due to the high chromatographic and mass spectral resolution provided. In subsequent data processing workflows, it is imperative to preserve this resolution to fully exploit the data. \"Region of interest\" (ROI) algorithms were introduced as a better alternative to equidistant binning for compressing HRMS data because they better preserve the mass spectral resolution. In this paper, we present a new ROI algorithm that improves on the selection of contiguous m/z traces, amongst others by introducing the concept of chromatographic filter, allows for an automated approach to optimise the admissible mass-to-charge deviation (δ<sub>m/z</sub>) and can be used to match ROIs across multiple samples. The algorithm was tested on a LC-HRMS dataset comprised of 21 replicate injections of a wastewater effluent extract and assessed on its ability to correctly retrieve the ROI's relative to 57 compounds and match them across all injections. In summary, it achieved a ten-fold compression rate in on-disk storage at a noise threshold of 200 counts, and the median ROI length matched the observed chromatographic peak width (12-23 points). Correct ROI matching with a mass accuracy of 9 ppm was observed for 52 compounds across all 21 injections with only one compound split between two adjacent m/z traces in six runs. Overall, the new algorithm performed favourably compared to the ROI algorithm currently used in the well-established ROI-MCR (multivariate curve resolution) workflow for deconvolution of HRMS chromatographic data.</p>","PeriodicalId":462,"journal":{"name":"Analytical and Bioanalytical Chemistry","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical and Bioanalytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s00216-024-05718-7","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Liquid chromatography coupled to high-resolution mass spectrometry (LC-HRMS) is commonly used for identification of compounds in complex samples due to the high chromatographic and mass spectral resolution provided. In subsequent data processing workflows, it is imperative to preserve this resolution to fully exploit the data. "Region of interest" (ROI) algorithms were introduced as a better alternative to equidistant binning for compressing HRMS data because they better preserve the mass spectral resolution. In this paper, we present a new ROI algorithm that improves on the selection of contiguous m/z traces, amongst others by introducing the concept of chromatographic filter, allows for an automated approach to optimise the admissible mass-to-charge deviation (δm/z) and can be used to match ROIs across multiple samples. The algorithm was tested on a LC-HRMS dataset comprised of 21 replicate injections of a wastewater effluent extract and assessed on its ability to correctly retrieve the ROI's relative to 57 compounds and match them across all injections. In summary, it achieved a ten-fold compression rate in on-disk storage at a noise threshold of 200 counts, and the median ROI length matched the observed chromatographic peak width (12-23 points). Correct ROI matching with a mass accuracy of 9 ppm was observed for 52 compounds across all 21 injections with only one compound split between two adjacent m/z traces in six runs. Overall, the new algorithm performed favourably compared to the ROI algorithm currently used in the well-established ROI-MCR (multivariate curve resolution) workflow for deconvolution of HRMS chromatographic data.

一种快速感兴趣区域算法,用于高效的数据压缩和改进的高分辨率质谱峰检测。
液相色谱联用高分辨率质谱法(LC-HRMS)通常用于鉴定复杂样品中的化合物,因为它提供了高色谱和质谱分辨率。在随后的数据处理工作流程中,必须保持该分辨率以充分利用数据。“感兴趣区域”(ROI)算法作为压缩HRMS数据的一种更好的替代方法,因为它们可以更好地保持质谱分辨率。在本文中,我们提出了一种新的ROI算法,该算法通过引入色谱过滤器的概念,改进了连续m/z走线的选择,允许自动化方法来优化可接受的质电荷偏差(δm/z),并可用于匹配多个样品的ROI。该算法在LC-HRMS数据集上进行了测试,该数据集包括21次废水萃取物的重复注射,并评估了其正确检索相对于57种化合物的ROI的能力,并在所有注射中进行了匹配。总之,在噪声阈值为200个计数时,它在磁盘存储中实现了10倍的压缩率,并且ROI长度中位数与观察到的色谱峰宽(12-23点)匹配。在所有21次注入中,52种化合物的ROI匹配正确,质量精度为9ppm,在6次注入中,只有一种化合物在两个相邻的m/z道之间分裂。总的来说,与目前用于HRMS色谱数据反卷积的完善的ROI- mcr(多元曲线分辨率)工作流程中的ROI算法相比,新算法表现良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
4.70%
发文量
638
审稿时长
2.1 months
期刊介绍: Analytical and Bioanalytical Chemistry’s mission is the rapid publication of excellent and high-impact research articles on fundamental and applied topics of analytical and bioanalytical measurement science. Its scope is broad, and ranges from novel measurement platforms and their characterization to multidisciplinary approaches that effectively address important scientific problems. The Editors encourage submissions presenting innovative analytical research in concept, instrumentation, methods, and/or applications, including: mass spectrometry, spectroscopy, and electroanalysis; advanced separations; analytical strategies in “-omics” and imaging, bioanalysis, and sampling; miniaturized devices, medical diagnostics, sensors; analytical characterization of nano- and biomaterials; chemometrics and advanced data analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信