{"title":"Amplifiable protein identification via residue-resolved barcoding and composition code counting.","authors":"Weiming Guo, Yuan Liu, Yu Han, Huan Tang, Xinyuan Fan, Chu Wang, Peng R Chen","doi":"10.1093/nsr/nwae183","DOIUrl":null,"url":null,"abstract":"<p><p>Ultrasensitive protein identification is of paramount importance in basic research and clinical diagnostics but remains extremely challenging. A key bottleneck in preventing single-molecule protein sequencing is that, unlike the revolutionary nucleic acid sequencing methods that rely on the polymerase chain reaction (PCR) to amplify DNA and RNA molecules, protein molecules cannot be directly amplified. Decoding the proteins via amplification of certain fingerprints rather than the intact protein sequence thus represents an appealing alternative choice to address this formidable challenge. Herein, we report a proof-of-concept method that relies on residue-resolved DNA barcoding and composition code counting for amplifiable protein fingerprinting (AmproCode). In AmproCode, selective types of residues on peptides or proteins are chemically labeled with a DNA barcode, which can be amplified and quantified via quantitative PCR. The operation generates a relative ratio as the residue-resolved 'composition code' for each target protein that can be utilized as the fingerprint to determine its identity from the proteome database. We developed a database searching algorithm and applied it to assess the coverage of the whole proteome and secretome via computational simulations, proving the theoretical feasibility of AmproCode. We then designed the residue-specific DNA barcoding and amplification workflow, and identified different synthetic model peptides found in the secretome at as low as the fmol/L level for demonstration. These results build the foundation for an unprecedented amplifiable protein fingerprinting method. We believe that, in the future, AmproCode could ultimately realize single-molecule amplifiable identification of trace complex samples without further purification, and it may open a new avenue in the development of next-generation protein sequencing techniques.</p>","PeriodicalId":18842,"journal":{"name":"National Science Review","volume":"11 7","pages":"nwae183"},"PeriodicalIF":16.3000,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11272068/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"National Science Review","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1093/nsr/nwae183","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Ultrasensitive protein identification is of paramount importance in basic research and clinical diagnostics but remains extremely challenging. A key bottleneck in preventing single-molecule protein sequencing is that, unlike the revolutionary nucleic acid sequencing methods that rely on the polymerase chain reaction (PCR) to amplify DNA and RNA molecules, protein molecules cannot be directly amplified. Decoding the proteins via amplification of certain fingerprints rather than the intact protein sequence thus represents an appealing alternative choice to address this formidable challenge. Herein, we report a proof-of-concept method that relies on residue-resolved DNA barcoding and composition code counting for amplifiable protein fingerprinting (AmproCode). In AmproCode, selective types of residues on peptides or proteins are chemically labeled with a DNA barcode, which can be amplified and quantified via quantitative PCR. The operation generates a relative ratio as the residue-resolved 'composition code' for each target protein that can be utilized as the fingerprint to determine its identity from the proteome database. We developed a database searching algorithm and applied it to assess the coverage of the whole proteome and secretome via computational simulations, proving the theoretical feasibility of AmproCode. We then designed the residue-specific DNA barcoding and amplification workflow, and identified different synthetic model peptides found in the secretome at as low as the fmol/L level for demonstration. These results build the foundation for an unprecedented amplifiable protein fingerprinting method. We believe that, in the future, AmproCode could ultimately realize single-molecule amplifiable identification of trace complex samples without further purification, and it may open a new avenue in the development of next-generation protein sequencing techniques.
期刊介绍:
National Science Review (NSR; ISSN abbreviation: Natl. Sci. Rev.) is an English-language peer-reviewed multidisciplinary open-access scientific journal published by Oxford University Press under the auspices of the Chinese Academy of Sciences.According to Journal Citation Reports, its 2021 impact factor was 23.178.
National Science Review publishes both review articles and perspectives as well as original research in the form of brief communications and research articles.