Towards automated discrimination of lipids versus peptides from full scan mass spectra

Q4 Biochemistry, Genetics and Molecular Biology
Piotr Dittwald , Trung Nghia Vu , Glenn A. Harris , Richard M. Caprioli , Raf Van de Plas , Kris Laukens , Anna Gambin , Dirk Valkenborg
{"title":"Towards automated discrimination of lipids versus peptides from full scan mass spectra","authors":"Piotr Dittwald ,&nbsp;Trung Nghia Vu ,&nbsp;Glenn A. Harris ,&nbsp;Richard M. Caprioli ,&nbsp;Raf Van de Plas ,&nbsp;Kris Laukens ,&nbsp;Anna Gambin ,&nbsp;Dirk Valkenborg","doi":"10.1016/j.euprot.2014.05.002","DOIUrl":null,"url":null,"abstract":"<div><p>Although physicochemical fractionation techniques play a crucial role in the analysis of complex mixtures, they are not necessarily the best solution to separate specific molecular classes, such as lipids and peptides. Any physical fractionation step such as, for example, those based on liquid chromatography, will introduce its own variation and noise. In this paper we investigate to what extent the high sensitivity and resolution of contemporary mass spectrometers offers viable opportunities for computational separation of signals in full scan spectra. We introduce an automatic method that can discriminate peptide from lipid peaks in full scan mass spectra, based on their isotopic properties. We systematically evaluate which features maximally contribute to a peptide versus lipid classification. The selected features are subsequently used to build a random forest classifier that enables almost perfect separation between lipid and peptide signals without requiring ion fragmentation and classical tandem MS-based identification approaches. The classifier is trained on <em>in silico</em> data, but is also capable of discriminating signals in real world experiments. We evaluate the influence of typical data inaccuracies of common classes of mass spectrometry instruments on the optimal set of discriminant features. Finally, the method is successfully extended towards the classification of individual lipid classes from full scan mass spectral features, based on input data defined by the Lipid Maps Consortium.</p></div>","PeriodicalId":38260,"journal":{"name":"EuPA Open Proteomics","volume":"4 ","pages":"Pages 87-100"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.euprot.2014.05.002","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EuPA Open Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221296851400035X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 5

Abstract

Although physicochemical fractionation techniques play a crucial role in the analysis of complex mixtures, they are not necessarily the best solution to separate specific molecular classes, such as lipids and peptides. Any physical fractionation step such as, for example, those based on liquid chromatography, will introduce its own variation and noise. In this paper we investigate to what extent the high sensitivity and resolution of contemporary mass spectrometers offers viable opportunities for computational separation of signals in full scan spectra. We introduce an automatic method that can discriminate peptide from lipid peaks in full scan mass spectra, based on their isotopic properties. We systematically evaluate which features maximally contribute to a peptide versus lipid classification. The selected features are subsequently used to build a random forest classifier that enables almost perfect separation between lipid and peptide signals without requiring ion fragmentation and classical tandem MS-based identification approaches. The classifier is trained on in silico data, but is also capable of discriminating signals in real world experiments. We evaluate the influence of typical data inaccuracies of common classes of mass spectrometry instruments on the optimal set of discriminant features. Finally, the method is successfully extended towards the classification of individual lipid classes from full scan mass spectral features, based on input data defined by the Lipid Maps Consortium.

Abstract Image

Abstract Image

Abstract Image

脂质与多肽全扫描质谱自动鉴别研究
尽管物化分馏技术在复杂混合物的分析中起着至关重要的作用,但它们不一定是分离特定分子类别(如脂质和肽)的最佳解决方案。任何物理分馏步骤,例如基于液相色谱的分馏步骤,都会引入其自身的变化和噪声。本文研究了当代质谱仪的高灵敏度和高分辨率在多大程度上为全扫描光谱信号的计算分离提供了可行的机会。我们介绍了一种自动方法,可以区分肽和脂质峰在全扫描质谱,基于它们的同位素性质。我们系统地评估哪些特征最大程度地有助于肽与脂质分类。选择的特征随后用于构建随机森林分类器,使脂质和肽信号之间几乎完美的分离,而不需要离子碎片化和经典的串联质谱识别方法。该分类器是在计算机数据上训练的,但也能够在现实世界的实验中识别信号。我们评估了常见类型质谱仪器的典型数据不准确性对最佳鉴别特征集的影响。最后,基于脂质图谱联盟定义的输入数据,该方法成功地扩展到从全扫描质谱特征中对单个脂类进行分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
EuPA Open Proteomics
EuPA Open Proteomics Biochemistry, Genetics and Molecular Biology-Biochemistry
自引率
0.00%
发文量
0
审稿时长
103 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信