Corey T. Watson , Andrew M. Collins , Mats Ohlin , James M. Heather , Ayelet Peres , William D. Lees , Gur Yaari
{"title":"Building immunoglobulin and T cell receptor gene databases for the future","authors":"Corey T. Watson , Andrew M. Collins , Mats Ohlin , James M. Heather , Ayelet Peres , William D. Lees , Gur Yaari","doi":"10.1016/j.immuno.2025.100059","DOIUrl":null,"url":null,"abstract":"<div><div>Genetic databases for immunoglobulin (IG) and T cell receptor (TR) genes have evolved from small catalogs to critical resources underpinning immunogenetic research. Accurate annotation enables the analysis of repertoire diversity, somatic hypermutation, clonal relationships, and lineage development. Recent advances in high-throughput repertoire sequencing and long-read genomics now allow for unprecedented discovery of germline variation across populations and species, but they also expose limitations of existing resources. Here, we discuss the historical evolution of IG/TR databases, highlight the challenges and opportunities presented by changing data landscapes, and outline strategies for building future databases that integrate genomic and expression data, support population diversity, and align with evolving nomenclature frameworks. Enhanced germline resources will be essential for accurate annotation, reproducible research, and the next generation of immunological discovery and clinical translation.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"19 ","pages":"Article 100059"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119025000126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Genetic databases for immunoglobulin (IG) and T cell receptor (TR) genes have evolved from small catalogs to critical resources underpinning immunogenetic research. Accurate annotation enables the analysis of repertoire diversity, somatic hypermutation, clonal relationships, and lineage development. Recent advances in high-throughput repertoire sequencing and long-read genomics now allow for unprecedented discovery of germline variation across populations and species, but they also expose limitations of existing resources. Here, we discuss the historical evolution of IG/TR databases, highlight the challenges and opportunities presented by changing data landscapes, and outline strategies for building future databases that integrate genomic and expression data, support population diversity, and align with evolving nomenclature frameworks. Enhanced germline resources will be essential for accurate annotation, reproducible research, and the next generation of immunological discovery and clinical translation.