An algorithm for peptide de novo sequencing from a group of SILAC labeled MS/MS spectra.

IF 0.7 4区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Bioinformatics and Computational Biology Pub Date : 2025-06-01 Epub Date: 2025-07-15 DOI:10.1142/S0219720025500076

Fang Han, Kaizhong Zhang

{"title":"An algorithm for peptide de novo sequencing from a group of SILAC labeled MS/MS spectra.","authors":"Fang Han, Kaizhong Zhang","doi":"10.1142/S0219720025500076","DOIUrl":null,"url":null,"abstract":"<p><p>Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2550007"},"PeriodicalIF":0.7000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bioinformatics and Computational Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1142/S0219720025500076","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/15 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.

查看原文本刊更多论文

从一组SILAC标记的MS/MS光谱中进行肽从头测序的算法。

霰弹枪蛋白质组学与高效液相色谱和质谱相结合，在鉴定复杂混合物中的蛋白质方面发挥了重要作用。需要有效的计算方法来自动化光谱解释过程，以处理在单次液相色谱-串联质谱（LC-MS/MS）运行中收集的大量数据。MS/MS从头测序已成为蛋白质组学中肽段测序的重要技术。为了提高从头测序的准确性和实用性，以前的算法利用多光谱来识别肽序列。在这里，我们的研究重点是通过将不同的同位素标记的氨基酸加入到新合成的蛋白质中，对具有稳定同位素标记的细胞培养氨基酸（SILAC）肽的多个串联质谱进行从头测序。SILAC样品经LC-MS/MS霰弹枪蛋白质组学处理后，谱仪可生成多个相同肽序列的MS/MS谱图。考虑到保留时间和前体离子质量等因素，我们的目标是确定具有特定SILAC修饰的肽序列及其位置。为此，我们提出了从头测序算法，通过使用相似性评分来计算潜在的候选肽序列，然后使用改进算法来评估它们。我们还用真实的实验数据对算法进行了验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Bioinformatics and Computational Biology MATHEMATICAL & COMPUTATIONAL BIOLOGY-

CiteScore

2.10

自引率

0.00%

发文量

期刊介绍： The Journal of Bioinformatics and Computational Biology aims to publish high quality, original research articles, expository tutorial papers and review papers as well as short, critical comments on technical issues associated with the analysis of cellular information. The research papers will be technical presentations of new assertions, discoveries and tools, intended for a narrower specialist community. The tutorials, reviews and critical commentary will be targeted at a broader readership of biologists who are interested in using computers but are not knowledgeable about scientific computing, and equally, computer scientists who have an interest in biology but are not familiar with current thrusts nor the language of biology. Such carefully chosen tutorials and articles should greatly accelerate the rate of entry of these new creative scientists into the field.