通过对 HLA I 类和 II 类等位基因进行扩展和全长测序,解决 IPD-IMGT/HLA 数据库中的未知核苷酸问题。

IF 2.9 4区 医学 Q2 GENETICS & HEREDITY
Immunogenetics Pub Date : 2024-04-01 Epub Date: 2024-02-24 DOI:10.1007/s00251-024-01333-z
Christina E M Voorter, Mathijs Groeneweg, Timo I Olieslagers, Ingrid Fae, Gottfried F Fischer, Marco Andreani, Maria Troiano, Blanka Vidan-Jeras, Sendi Montanic, Bouke G Hepkema, Laura B Bungener, Marcel G J Tilanus, Lotte Wieten
{"title":"通过对 HLA I 类和 II 类等位基因进行扩展和全长测序,解决 IPD-IMGT/HLA 数据库中的未知核苷酸问题。","authors":"Christina E M Voorter, Mathijs Groeneweg, Timo I Olieslagers, Ingrid Fae, Gottfried F Fischer, Marco Andreani, Maria Troiano, Blanka Vidan-Jeras, Sendi Montanic, Bouke G Hepkema, Laura B Bungener, Marcel G J Tilanus, Lotte Wieten","doi":"10.1007/s00251-024-01333-z","DOIUrl":null,"url":null,"abstract":"<p><p>In the past, identification of HLA alleles was limited to sequencing the region of the gene coding for the peptide binding groove, resulting in a lack of sequence information in the HLA database, challenging HLA allele assignment software programs. We investigated full-length sequences of 19 HLA class I and 7 HLA class II alleles, and we extended another 47 HLA class I alleles with sequences of 5' and 3' UTR regions that were all not yet available in the IPD-IMGT/HLA database. We resolved 8638 unknown nucleotides in the coding sequence of HLA class I and 2139 of HLA class II. Furthermore, with full-length sequencing of the 26 alleles, more than 90 kb of sequence information was added to the non-coding sequences, whereas extension of the 47 alleles resulted in the addition of 5.5 kb unknown nucleotides to the 5' UTR and > 31.7 kb to the 3' UTR region. With this information, some interesting features were observed, like possible recombination events and lineage evolutionary origins. The continuing increase in the availability of full-length sequences in the HLA database will enable the identification of the evolutionary origin and will help the community to improve the alignment and assignment accuracy of HLA alleles.</p>","PeriodicalId":13446,"journal":{"name":"Immunogenetics","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10944811/pdf/","citationCount":"0","resultStr":"{\"title\":\"Resolving unknown nucleotides in the IPD-IMGT/HLA database by extended and full-length sequencing of HLA class I and II alleles.\",\"authors\":\"Christina E M Voorter, Mathijs Groeneweg, Timo I Olieslagers, Ingrid Fae, Gottfried F Fischer, Marco Andreani, Maria Troiano, Blanka Vidan-Jeras, Sendi Montanic, Bouke G Hepkema, Laura B Bungener, Marcel G J Tilanus, Lotte Wieten\",\"doi\":\"10.1007/s00251-024-01333-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In the past, identification of HLA alleles was limited to sequencing the region of the gene coding for the peptide binding groove, resulting in a lack of sequence information in the HLA database, challenging HLA allele assignment software programs. We investigated full-length sequences of 19 HLA class I and 7 HLA class II alleles, and we extended another 47 HLA class I alleles with sequences of 5' and 3' UTR regions that were all not yet available in the IPD-IMGT/HLA database. We resolved 8638 unknown nucleotides in the coding sequence of HLA class I and 2139 of HLA class II. Furthermore, with full-length sequencing of the 26 alleles, more than 90 kb of sequence information was added to the non-coding sequences, whereas extension of the 47 alleles resulted in the addition of 5.5 kb unknown nucleotides to the 5' UTR and > 31.7 kb to the 3' UTR region. With this information, some interesting features were observed, like possible recombination events and lineage evolutionary origins. The continuing increase in the availability of full-length sequences in the HLA database will enable the identification of the evolutionary origin and will help the community to improve the alignment and assignment accuracy of HLA alleles.</p>\",\"PeriodicalId\":13446,\"journal\":{\"name\":\"Immunogenetics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10944811/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Immunogenetics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00251-024-01333-z\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/24 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunogenetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00251-024-01333-z","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/24 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

过去,HLA 等位基因的鉴定仅限于对编码肽结合沟的基因区域进行测序,导致 HLA 数据库中缺乏序列信息,给 HLA 等位基因分配软件程序带来了挑战。我们研究了 19 个 HLA I 类等位基因和 7 个 HLA II 类等位基因的全长序列,并用 IPD-IMGT/HLA 数据库中尚未提供的 5' 和 3' UTR 区域的序列扩展了另外 47 个 HLA I 类等位基因。我们解决了 HLA I 类编码序列中的 8638 个未知核苷酸和 HLA II 类编码序列中的 2139 个未知核苷酸。此外,通过对 26 个等位基因进行全长测序,非编码序列中增加了超过 90 kb 的序列信息,而对 47 个等位基因进行延伸后,5'UTR 中增加了 5.5 kb 的未知核苷酸,3'UTR 区域增加了超过 31.7 kb 的未知核苷酸。根据这些信息,我们观察到了一些有趣的特征,如可能的重组事件和品系进化起源。随着 HLA 数据库中全长序列的不断增加,我们将能够确定其进化起源,并帮助社区提高 HLA 等位基因的比对和赋值准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Resolving unknown nucleotides in the IPD-IMGT/HLA database by extended and full-length sequencing of HLA class I and II alleles.

Resolving unknown nucleotides in the IPD-IMGT/HLA database by extended and full-length sequencing of HLA class I and II alleles.

In the past, identification of HLA alleles was limited to sequencing the region of the gene coding for the peptide binding groove, resulting in a lack of sequence information in the HLA database, challenging HLA allele assignment software programs. We investigated full-length sequences of 19 HLA class I and 7 HLA class II alleles, and we extended another 47 HLA class I alleles with sequences of 5' and 3' UTR regions that were all not yet available in the IPD-IMGT/HLA database. We resolved 8638 unknown nucleotides in the coding sequence of HLA class I and 2139 of HLA class II. Furthermore, with full-length sequencing of the 26 alleles, more than 90 kb of sequence information was added to the non-coding sequences, whereas extension of the 47 alleles resulted in the addition of 5.5 kb unknown nucleotides to the 5' UTR and > 31.7 kb to the 3' UTR region. With this information, some interesting features were observed, like possible recombination events and lineage evolutionary origins. The continuing increase in the availability of full-length sequences in the HLA database will enable the identification of the evolutionary origin and will help the community to improve the alignment and assignment accuracy of HLA alleles.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Immunogenetics
Immunogenetics 医学-免疫学
CiteScore
6.20
自引率
6.20%
发文量
48
审稿时长
1 months
期刊介绍: Immunogenetics publishes original papers, brief communications, and reviews on research in the following areas: genetics and evolution of the immune system; genetic control of immune response and disease susceptibility; bioinformatics of the immune system; structure of immunologically important molecules; and immunogenetics of reproductive biology, tissue differentiation, and development.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信