免疫球蛋白和T细胞受体种系集合的AIRR社区管理和标准化代表

Immunoinformatics (Amsterdam, Netherlands) Pub Date : 2023-06-01 DOI:10.1016/j.immuno.2023.100025

William D. Lees , Scott Christley , Ayelet Peres , Justin T. Kos , Brian Corrie , Duncan Ralph , Felix Breden , Lindsay G. Cowell , Gur Yaari , Martin Corcoran , Gunilla B. Karlsson Hedestam , Mats Ohlin , Andrew M. Collins , Corey T. Watson , Christian E. Busse , The AIRR Community

{"title":"免疫球蛋白和T细胞受体种系集合的AIRR社区管理和标准化代表","authors":"William D. Lees , Scott Christley , Ayelet Peres , Justin T. Kos , Brian Corrie , Duncan Ralph , Felix Breden , Lindsay G. Cowell , Gur Yaari , Martin Corcoran , Gunilla B. Karlsson Hedestam , Mats Ohlin , Andrew M. Collins , Corey T. Watson , Christian E. Busse , The AIRR Community","doi":"10.1016/j.immuno.2023.100025","DOIUrl":null,"url":null,"abstract":"<div><p>Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"10 ","pages":"Article 100025"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310305/pdf/","citationCount":"1","resultStr":"{\"title\":\"AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets\",\"authors\":\"William D. Lees , Scott Christley , Ayelet Peres , Justin T. Kos , Brian Corrie , Duncan Ralph , Felix Breden , Lindsay G. Cowell , Gur Yaari , Martin Corcoran , Gunilla B. Karlsson Hedestam , Mats Ohlin , Andrew M. Collins , Corey T. Watson , Christian E. Busse , The AIRR Community\",\"doi\":\"10.1016/j.immuno.2023.100025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.</p></div>\",\"PeriodicalId\":73343,\"journal\":{\"name\":\"Immunoinformatics (Amsterdam, Netherlands)\",\"volume\":\"10 \",\"pages\":\"Article 100025\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310305/pdf/\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Immunoinformatics (Amsterdam, Netherlands)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667119023000058\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119023000058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

分析个体的免疫球蛋白或T细胞受体基因库可以为免疫功能提供重要的见解。适应性免疫受体库测序数据的高质量分析依赖于准确和相对完整的生殖系集，但目前的集已知是不完整的。对受体生殖系基因和等位基因进行审查和系统命名的既定过程需要特定的证据和数据类型，但发现领域正在迅速变化。为了利用新出现的数据的潜力，并向该领域提供改进的最先进的生殖系集，需要一种中间方法，以便能够迅速出版从这些新出现的来源获得的综合集。这些集合必须使用一致的命名方案，并允许随着新信息的出现而细化和整合到基因中。应该尽量减少名称更改，但是，在发生更改的地方，序列的命名历史必须是可跟踪的。在这里，我们概述了生殖系IG/TR基因管理的当前问题和机遇，并提出了一个前瞻性的数据模型，用于构建更强大的生殖系集，可以与当前已建立的过程相吻合。我们描述了生殖系集的互操作性标准，以及基于可查找性、可访问性、互操作性和可重用性原则的透明性方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets

查看原文本刊更多论文

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets

Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Immunoinformatics (Amsterdam, Netherlands) Immunology, Computer Science Applications

自引率

0.00%

发文量

审稿时长

60 days