Dean Bryant, Benjamin Sale, Giorgia Chiodin, Dylan Tatterton, Benjamin Stevens, Alyssa Adlaon, Erin Snook, James Batchelor, Alberto Orfao, Francesco Forconi
{"title":"利用IgSeqR从RNA测序数据中鉴定、组装和表征肿瘤免疫球蛋白转录物。","authors":"Dean Bryant, Benjamin Sale, Giorgia Chiodin, Dylan Tatterton, Benjamin Stevens, Alyssa Adlaon, Erin Snook, James Batchelor, Alberto Orfao, Francesco Forconi","doi":"10.1038/s41596-025-01172-6","DOIUrl":null,"url":null,"abstract":"<p><p>Immunoglobulin gene analysis provides fundamental insight into B cell receptor structure and function. In B cell tumors, it can provide information on the cell of origin and predict clinical outcomes. Its clinical value has been established in the two main types of chronic lymphocytic leukemia, which are distinguished by the expression of unmutated or mutated immunoglobulin heavy chain variable region (IGHV) genes, and is emerging in other B cell tumors. The traditional PCR and Sanger sequencing-based techniques for immunoglobulin gene analysis are labor-intensive and rely on attaining either a dominant sequence or a small number of subclonal sequences. Extraction of the expressed tumor immunoglobulin transcripts by using high-throughput RNA-sequencing (RNA-seq) can be faster, allow the collection of the tumor immunoglobulin sequence and match this with the rest of the RNA-seq data. Analytical tools are regularly sought to increase the accuracy, depth and speed of acquisition of the immunoglobulin transcript sequences and combine the immunoglobulin characteristics with other tumor features. We provide here a user-friendly protocol for the rapid (~1 h) de novo assembly, identification and accurate characterization of the full (leader to constant region) tumor immunoglobulin templated and non-templated transcript sequence from RNA-seq data ( https://github.com/ForconiLab/IgSeqR ). The derived amino acid sequences can be interrogated for their physicochemical characteristics and, in certain lymphomas, be used to predict tumor glycan types occupying acquired N-glycosylation sites. These features will then be available for association studies with the tumor transcriptome. The resulting information can also help refine diagnosis, prognosis and potential therapeutic targeting in the most common lymphomas.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification, assembly and characterization of tumor immunoglobulin transcripts from RNA sequencing data using IgSeqR.\",\"authors\":\"Dean Bryant, Benjamin Sale, Giorgia Chiodin, Dylan Tatterton, Benjamin Stevens, Alyssa Adlaon, Erin Snook, James Batchelor, Alberto Orfao, Francesco Forconi\",\"doi\":\"10.1038/s41596-025-01172-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Immunoglobulin gene analysis provides fundamental insight into B cell receptor structure and function. In B cell tumors, it can provide information on the cell of origin and predict clinical outcomes. Its clinical value has been established in the two main types of chronic lymphocytic leukemia, which are distinguished by the expression of unmutated or mutated immunoglobulin heavy chain variable region (IGHV) genes, and is emerging in other B cell tumors. The traditional PCR and Sanger sequencing-based techniques for immunoglobulin gene analysis are labor-intensive and rely on attaining either a dominant sequence or a small number of subclonal sequences. Extraction of the expressed tumor immunoglobulin transcripts by using high-throughput RNA-sequencing (RNA-seq) can be faster, allow the collection of the tumor immunoglobulin sequence and match this with the rest of the RNA-seq data. Analytical tools are regularly sought to increase the accuracy, depth and speed of acquisition of the immunoglobulin transcript sequences and combine the immunoglobulin characteristics with other tumor features. We provide here a user-friendly protocol for the rapid (~1 h) de novo assembly, identification and accurate characterization of the full (leader to constant region) tumor immunoglobulin templated and non-templated transcript sequence from RNA-seq data ( https://github.com/ForconiLab/IgSeqR ). The derived amino acid sequences can be interrogated for their physicochemical characteristics and, in certain lymphomas, be used to predict tumor glycan types occupying acquired N-glycosylation sites. These features will then be available for association studies with the tumor transcriptome. The resulting information can also help refine diagnosis, prognosis and potential therapeutic targeting in the most common lymphomas.</p>\",\"PeriodicalId\":18901,\"journal\":{\"name\":\"Nature Protocols\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":13.1000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Protocols\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s41596-025-01172-6\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Protocols","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41596-025-01172-6","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
摘要
免疫球蛋白基因分析提供了对B细胞受体结构和功能的基本认识。在B细胞肿瘤中,它可以提供关于细胞起源的信息并预测临床结果。其临床价值已在两种主要类型的慢性淋巴细胞白血病中确立,这两种类型以未突变或突变的免疫球蛋白重链可变区(immunoglobulin heavy chain variable region, IGHV)基因的表达为区别,并且正在其他B细胞肿瘤中出现。传统的基于PCR和Sanger测序的免疫球蛋白基因分析技术是劳动密集型的,并且依赖于获得显性序列或少量亚克隆序列。使用高通量rna测序(RNA-seq)提取表达的肿瘤免疫球蛋白转录本可以更快,允许收集肿瘤免疫球蛋白序列并将其与其余RNA-seq数据进行匹配。人们经常寻求分析工具,以提高免疫球蛋白转录序列获取的准确性、深度和速度,并将免疫球蛋白特征与其他肿瘤特征结合起来。我们在此提供了一种用户友好的方案,用于快速(~1小时)从头组装,鉴定和准确表征来自RNA-seq数据的完整(从先导区到恒定区)肿瘤免疫球蛋白模板和非模板转录序列(https://github.com/ForconiLab/IgSeqR)。衍生的氨基酸序列可以询问其物理化学特征,并且在某些淋巴瘤中,可用于预测占据获得性n糖基化位点的肿瘤聚糖类型。这些特征将可用于与肿瘤转录组的关联研究。由此产生的信息也可以帮助改进诊断,预后和潜在的治疗目标在最常见的淋巴瘤。
Identification, assembly and characterization of tumor immunoglobulin transcripts from RNA sequencing data using IgSeqR.
Immunoglobulin gene analysis provides fundamental insight into B cell receptor structure and function. In B cell tumors, it can provide information on the cell of origin and predict clinical outcomes. Its clinical value has been established in the two main types of chronic lymphocytic leukemia, which are distinguished by the expression of unmutated or mutated immunoglobulin heavy chain variable region (IGHV) genes, and is emerging in other B cell tumors. The traditional PCR and Sanger sequencing-based techniques for immunoglobulin gene analysis are labor-intensive and rely on attaining either a dominant sequence or a small number of subclonal sequences. Extraction of the expressed tumor immunoglobulin transcripts by using high-throughput RNA-sequencing (RNA-seq) can be faster, allow the collection of the tumor immunoglobulin sequence and match this with the rest of the RNA-seq data. Analytical tools are regularly sought to increase the accuracy, depth and speed of acquisition of the immunoglobulin transcript sequences and combine the immunoglobulin characteristics with other tumor features. We provide here a user-friendly protocol for the rapid (~1 h) de novo assembly, identification and accurate characterization of the full (leader to constant region) tumor immunoglobulin templated and non-templated transcript sequence from RNA-seq data ( https://github.com/ForconiLab/IgSeqR ). The derived amino acid sequences can be interrogated for their physicochemical characteristics and, in certain lymphomas, be used to predict tumor glycan types occupying acquired N-glycosylation sites. These features will then be available for association studies with the tumor transcriptome. The resulting information can also help refine diagnosis, prognosis and potential therapeutic targeting in the most common lymphomas.
期刊介绍:
Nature Protocols focuses on publishing protocols used to address significant biological and biomedical science research questions, including methods grounded in physics and chemistry with practical applications to biological problems. The journal caters to a primary audience of research scientists and, as such, exclusively publishes protocols with research applications. Protocols primarily aimed at influencing patient management and treatment decisions are not featured.
The specific techniques covered encompass a wide range, including but not limited to: Biochemistry, Cell biology, Cell culture, Chemical modification, Computational biology, Developmental biology, Epigenomics, Genetic analysis, Genetic modification, Genomics, Imaging, Immunology, Isolation, purification, and separation, Lipidomics, Metabolomics, Microbiology, Model organisms, Nanotechnology, Neuroscience, Nucleic-acid-based molecular biology, Pharmacology, Plant biology, Protein analysis, Proteomics, Spectroscopy, Structural biology, Synthetic chemistry, Tissue culture, Toxicology, and Virology.