Dean Bryant, Benjamin Sale, Giorgia Chiodin, Dylan Tatterton, Benjamin Stevens, Alyssa Adlaon, Erin Snook, James Batchelor, Alberto Orfao, Francesco Forconi
{"title":"Identification, assembly and characterization of tumor immunoglobulin transcripts from RNA sequencing data using IgSeqR.","authors":"Dean Bryant, Benjamin Sale, Giorgia Chiodin, Dylan Tatterton, Benjamin Stevens, Alyssa Adlaon, Erin Snook, James Batchelor, Alberto Orfao, Francesco Forconi","doi":"10.1038/s41596-025-01172-6","DOIUrl":null,"url":null,"abstract":"<p><p>Immunoglobulin gene analysis provides fundamental insight into B cell receptor structure and function. In B cell tumors, it can provide information on the cell of origin and predict clinical outcomes. Its clinical value has been established in the two main types of chronic lymphocytic leukemia, which are distinguished by the expression of unmutated or mutated immunoglobulin heavy chain variable region (IGHV) genes, and is emerging in other B cell tumors. The traditional PCR and Sanger sequencing-based techniques for immunoglobulin gene analysis are labor-intensive and rely on attaining either a dominant sequence or a small number of subclonal sequences. Extraction of the expressed tumor immunoglobulin transcripts by using high-throughput RNA-sequencing (RNA-seq) can be faster, allow the collection of the tumor immunoglobulin sequence and match this with the rest of the RNA-seq data. Analytical tools are regularly sought to increase the accuracy, depth and speed of acquisition of the immunoglobulin transcript sequences and combine the immunoglobulin characteristics with other tumor features. We provide here a user-friendly protocol for the rapid (~1 h) de novo assembly, identification and accurate characterization of the full (leader to constant region) tumor immunoglobulin templated and non-templated transcript sequence from RNA-seq data ( https://github.com/ForconiLab/IgSeqR ). The derived amino acid sequences can be interrogated for their physicochemical characteristics and, in certain lymphomas, be used to predict tumor glycan types occupying acquired N-glycosylation sites. These features will then be available for association studies with the tumor transcriptome. The resulting information can also help refine diagnosis, prognosis and potential therapeutic targeting in the most common lymphomas.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Protocols","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41596-025-01172-6","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Immunoglobulin gene analysis provides fundamental insight into B cell receptor structure and function. In B cell tumors, it can provide information on the cell of origin and predict clinical outcomes. Its clinical value has been established in the two main types of chronic lymphocytic leukemia, which are distinguished by the expression of unmutated or mutated immunoglobulin heavy chain variable region (IGHV) genes, and is emerging in other B cell tumors. The traditional PCR and Sanger sequencing-based techniques for immunoglobulin gene analysis are labor-intensive and rely on attaining either a dominant sequence or a small number of subclonal sequences. Extraction of the expressed tumor immunoglobulin transcripts by using high-throughput RNA-sequencing (RNA-seq) can be faster, allow the collection of the tumor immunoglobulin sequence and match this with the rest of the RNA-seq data. Analytical tools are regularly sought to increase the accuracy, depth and speed of acquisition of the immunoglobulin transcript sequences and combine the immunoglobulin characteristics with other tumor features. We provide here a user-friendly protocol for the rapid (~1 h) de novo assembly, identification and accurate characterization of the full (leader to constant region) tumor immunoglobulin templated and non-templated transcript sequence from RNA-seq data ( https://github.com/ForconiLab/IgSeqR ). The derived amino acid sequences can be interrogated for their physicochemical characteristics and, in certain lymphomas, be used to predict tumor glycan types occupying acquired N-glycosylation sites. These features will then be available for association studies with the tumor transcriptome. The resulting information can also help refine diagnosis, prognosis and potential therapeutic targeting in the most common lymphomas.
期刊介绍:
Nature Protocols focuses on publishing protocols used to address significant biological and biomedical science research questions, including methods grounded in physics and chemistry with practical applications to biological problems. The journal caters to a primary audience of research scientists and, as such, exclusively publishes protocols with research applications. Protocols primarily aimed at influencing patient management and treatment decisions are not featured.
The specific techniques covered encompass a wide range, including but not limited to: Biochemistry, Cell biology, Cell culture, Chemical modification, Computational biology, Developmental biology, Epigenomics, Genetic analysis, Genetic modification, Genomics, Imaging, Immunology, Isolation, purification, and separation, Lipidomics, Metabolomics, Microbiology, Model organisms, Nanotechnology, Neuroscience, Nucleic-acid-based molecular biology, Pharmacology, Plant biology, Protein analysis, Proteomics, Spectroscopy, Structural biology, Synthetic chemistry, Tissue culture, Toxicology, and Virology.