Deep Coverage and Extended Sequence Reads Obtained with a Single Archaeal Protease Expedite de novo Protein Sequencing by Mass Spectrometry.

bioRxiv : the preprint server for biology Pub Date : 2025-05-29 DOI:10.1101/2025.05.26.656138

Tatiana M Shamorkina, Laura Pérez Pañeda, Tereza Kadavá, Douwe Schulte, Patrick Pribil, Sibylle Heidelberger, Allison Michele Narlock-Brand, Steven M Yannone, Joost Snijder, Albert J R Heck

{"title":"Deep Coverage and Extended Sequence Reads Obtained with a Single Archaeal Protease Expedite de novo Protein Sequencing by Mass Spectrometry.","authors":"Tatiana M Shamorkina, Laura Pérez Pañeda, Tereza Kadavá, Douwe Schulte, Patrick Pribil, Sibylle Heidelberger, Allison Michele Narlock-Brand, Steven M Yannone, Joost Snijder, Albert J R Heck","doi":"10.1101/2025.05.26.656138","DOIUrl":null,"url":null,"abstract":"The ability to sequence proteins without reliance on a genomic template defines a critical frontier in modern proteomics. This approach, known as de novo protein sequencing, is essential for applications such as antibody sequencing, microbiome proteomics, and antigen discovery, which require accurate reconstruction of peptide and protein sequences. While trypsin remains the gold-standard protease in proteomics, its restricted cleavage specificity limits peptide diversity. This constraint is especially problematic in antibody sequencing, where the functionally critical regions often lack canonical tryptic sites. As a result, exclusively trypsin-based approaches yield sparse reads, leading to sequence gaps. Multi-protease and hybrid-fragmentation strategies can improve the sequence coverage, but they add complexity, compromise scalability and reproducibility. Here, we explore two HyperThermoacidic Archaeal (HTA)-proteases as single-enzyme solutions for de novo antibody sequencing. Each HTA-protease generated about five times more unique peptide reads than trypsin or chymotrypsin, providing high redundancy across all CDRs. Combined with EAciD fragmentation on a ZenoTOF 7600 system, this approach enabled complete, unambiguous antibody sequencing. De novo analysis using PEAKS/DeepNovo and Stitch showed up to fourfold higher alignment scores and reduced the sequence errors within the HTA-generated data. Additionally, the HTA-EAciD approach offers short digestion times, eliminates extensive cleanup, and enables analysis in a single LC-MS/MS run. This streamlined, single-protease approach delivers therefore performance comparable to multi-enzyme workflows, offering a scalable and efficient strategy for de novo protein sequencing across diverse applications.","PeriodicalId":519960,"journal":{"name":"bioRxiv : the preprint server for biology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12154832/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv : the preprint server for biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.05.26.656138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The ability to sequence proteins without reliance on a genomic template defines a critical frontier in modern proteomics. This approach, known as de novo protein sequencing, is essential for applications such as antibody sequencing, microbiome proteomics, and antigen discovery, which require accurate reconstruction of peptide and protein sequences. While trypsin remains the gold-standard protease in proteomics, its restricted cleavage specificity limits peptide diversity. This constraint is especially problematic in antibody sequencing, where the functionally critical regions often lack canonical tryptic sites. As a result, exclusively trypsin-based approaches yield sparse reads, leading to sequence gaps. Multi-protease and hybrid-fragmentation strategies can improve the sequence coverage, but they add complexity, compromise scalability and reproducibility. Here, we explore two HyperThermoacidic Archaeal (HTA)-proteases as single-enzyme solutions for de novo antibody sequencing. Each HTA-protease generated about five times more unique peptide reads than trypsin or chymotrypsin, providing high redundancy across all CDRs. Combined with EAciD fragmentation on a ZenoTOF 7600 system, this approach enabled complete, unambiguous antibody sequencing. De novo analysis using PEAKS/DeepNovo and Stitch showed up to fourfold higher alignment scores and reduced the sequence errors within the HTA-generated data. Additionally, the HTA-EAciD approach offers short digestion times, eliminates extensive cleanup, and enables analysis in a single LC-MS/MS run. This streamlined, single-protease approach delivers therefore performance comparable to multi-enzyme workflows, offering a scalable and efficient strategy for de novo protein sequencing across diverse applications.

Abstract Image

查看原文本刊更多论文

通过质谱法获得单一古菌蛋白酶加速从头蛋白测序的深度覆盖和扩展序列读取。

在不依赖基因组模板的情况下对蛋白质进行测序的能力定义了现代蛋白质组学的一个关键前沿。这种方法被称为从头蛋白质测序，对于抗体测序、微生物组蛋白质组学和抗原发现等需要精确重建肽和蛋白质序列的应用至关重要。虽然胰蛋白酶仍然是蛋白质组学中的金标准蛋白酶，但其有限的裂解特异性限制了肽的多样性。这种限制在抗体测序中尤其成问题，其中功能关键区域通常缺乏典型的色氨酸位点。因此，完全基于胰蛋白酶的方法产生稀疏读取，导致序列间隙。多蛋白酶和混合片段策略可以提高序列覆盖率，但它们增加了复杂性，损害了可扩展性和可重复性。在这里，我们探索了两种超热酸性古细菌（HTA）蛋白酶作为单酶溶液进行从头抗体测序。每个hta蛋白酶产生的独特肽序列是胰蛋白酶或凝乳胰蛋白酶的5倍，从而在所有cdr中提供高冗余。结合ZenoTOF 7600系统上的EAciD片段，该方法实现了完整、明确的抗体测序。使用PEAKS/DeepNovo和Stitch进行的从头分析显示，比对分数提高了四倍，并减少了hta生成数据中的序列错误。此外，HTA-EAciD方法可缩短消化时间，消除大量清理，并可在单次LC-MS/MS运行中进行分析。因此，这种简化的单蛋白酶方法提供了与多酶工作流程相当的性能，为跨不同应用的从头蛋白质测序提供了可扩展和高效的策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

bioRxiv : the preprint server for biology

自引率

0.00%

发文量