AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets

William D. Lees , Scott Christley , Ayelet Peres , Justin T. Kos , Brian Corrie , Duncan Ralph , Felix Breden , Lindsay G. Cowell , Gur Yaari , Martin Corcoran , Gunilla B. Karlsson Hedestam , Mats Ohlin , Andrew M. Collins , Corey T. Watson , Christian E. Busse , The AIRR Community
{"title":"AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets","authors":"William D. Lees ,&nbsp;Scott Christley ,&nbsp;Ayelet Peres ,&nbsp;Justin T. Kos ,&nbsp;Brian Corrie ,&nbsp;Duncan Ralph ,&nbsp;Felix Breden ,&nbsp;Lindsay G. Cowell ,&nbsp;Gur Yaari ,&nbsp;Martin Corcoran ,&nbsp;Gunilla B. Karlsson Hedestam ,&nbsp;Mats Ohlin ,&nbsp;Andrew M. Collins ,&nbsp;Corey T. Watson ,&nbsp;Christian E. Busse ,&nbsp;The AIRR Community","doi":"10.1016/j.immuno.2023.100025","DOIUrl":null,"url":null,"abstract":"<div><p>Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"10 ","pages":"Article 100025"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310305/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119023000058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.

Abstract Image

免疫球蛋白和T细胞受体种系集合的AIRR社区管理和标准化代表
分析个体的免疫球蛋白或T细胞受体基因库可以为免疫功能提供重要的见解。适应性免疫受体库测序数据的高质量分析依赖于准确和相对完整的生殖系集,但目前的集已知是不完整的。对受体生殖系基因和等位基因进行审查和系统命名的既定过程需要特定的证据和数据类型,但发现领域正在迅速变化。为了利用新出现的数据的潜力,并向该领域提供改进的最先进的生殖系集,需要一种中间方法,以便能够迅速出版从这些新出现的来源获得的综合集。这些集合必须使用一致的命名方案,并允许随着新信息的出现而细化和整合到基因中。应该尽量减少名称更改,但是,在发生更改的地方,序列的命名历史必须是可跟踪的。在这里,我们概述了生殖系IG/TR基因管理的当前问题和机遇,并提出了一个前瞻性的数据模型,用于构建更强大的生殖系集,可以与当前已建立的过程相吻合。我们描述了生殖系集的互操作性标准,以及基于可查找性、可访问性、互操作性和可重用性原则的透明性方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Immunoinformatics (Amsterdam, Netherlands)
Immunoinformatics (Amsterdam, Netherlands) Immunology, Computer Science Applications
自引率
0.00%
发文量
0
审稿时长
60 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信