Nanopore sequencing with unique molecular identifiers enables accurate mutation analysis and haplotyping in the complex lipoprotein(a) KIV-2 VNTR.

IF 10.4 1区 生物学 Q1 GENETICS & HEREDITY
Stephan Amstler, Gertraud Streiter, Cathrin Pfurtscheller, Lukas Forer, Silvia Di Maio, Hansi Weissensteiner, Bernhard Paulweber, Sebastian Schönherr, Florian Kronenberg, Stefan Coassin
{"title":"Nanopore sequencing with unique molecular identifiers enables accurate mutation analysis and haplotyping in the complex lipoprotein(a) KIV-2 VNTR.","authors":"Stephan Amstler, Gertraud Streiter, Cathrin Pfurtscheller, Lukas Forer, Silvia Di Maio, Hansi Weissensteiner, Bernhard Paulweber, Sebastian Schönherr, Florian Kronenberg, Stefan Coassin","doi":"10.1186/s13073-024-01391-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Repetitive genome regions, such as variable number of tandem repeats (VNTR) or short tandem repeats (STR), are major constituents of the uncharted dark genome and evade conventional sequencing approaches. The protein-coding LPA kringle IV type-2 (KIV-2) VNTR (5.6 kb per unit, 1-40 units per allele) is a medically highly relevant example with a particularly intricate structure, multiple haplotypes, intragenic homologies, and an intra-VNTR STR. It is the primary regulator of plasma lipoprotein(a) [Lp(a)] concentrations, an important cardiovascular risk factor. Lp(a) concentrations vary widely between individuals and ancestries. Multiple variants and functional haplotypes in the LPA gene and especially in the KIV-2 VNTR strongly contribute to this variance.</p><p><strong>Methods: </strong>We evaluated the performance of amplicon-based nanopore sequencing with unique molecular identifiers (UMI-ONT-Seq) for SNP detection, haplotype mapping, VNTR unit consensus sequence generation, and copy number estimation via coverage-corrected haplotypes quantification in the KIV-2 VNTR. We used 15 human samples and low-level mixtures (0.5 to 5%) of KIV-2 plasmids as a validation set. We then applied UMI-ONT-Seq to extract KIV-2 VNTR haplotypes in 48 multi-ancestry 1000 Genome samples and analyzed at scale a poorly characterized STR within the KIV-2 VNTR.</p><p><strong>Results: </strong>UMI-ONT-Seq detected KIV-2 SNPs down to 1% variant level with high sensitivity, specificity, and precision (0.977 ± 0.018; 1.000 ± 0.0005; 0.993 ± 0.02) and accurately retrieved the full-length haplotype of each VNTR unit. Human variant levels were highly correlated with next-generation sequencing (R<sup>2</sup> = 0.983) without bias across the whole variant level range. Six reads per UMI produced sequences of each KIV-2 unit with Q40 quality. The KIV-2 repeat number determined by coverage-corrected unique haplotype counting was in close agreement with droplet digital PCR (ddPCR), with 70% of the samples falling even within the narrow confidence interval of ddPCR. We then analyzed 62,679 intra-KIV-2 STR sequences and explored KIV-2 SNP haplotype patterns across five ancestries.</p><p><strong>Conclusions: </strong>UMI-ONT-Seq accurately retrieves the SNP haplotype and precisely quantifies the VNTR copy number of each repeat unit of the complex KIV-2 VNTR region across multiple ancestries. This study utilizes the KIV-2 VNTR, presenting a novel and potent tool for comprehensive characterization of medically relevant complex genome regions at scale.</p>","PeriodicalId":12645,"journal":{"name":"Genome Medicine","volume":"16 1","pages":"117"},"PeriodicalIF":10.4000,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11462820/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Medicine","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13073-024-01391-8","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Repetitive genome regions, such as variable number of tandem repeats (VNTR) or short tandem repeats (STR), are major constituents of the uncharted dark genome and evade conventional sequencing approaches. The protein-coding LPA kringle IV type-2 (KIV-2) VNTR (5.6 kb per unit, 1-40 units per allele) is a medically highly relevant example with a particularly intricate structure, multiple haplotypes, intragenic homologies, and an intra-VNTR STR. It is the primary regulator of plasma lipoprotein(a) [Lp(a)] concentrations, an important cardiovascular risk factor. Lp(a) concentrations vary widely between individuals and ancestries. Multiple variants and functional haplotypes in the LPA gene and especially in the KIV-2 VNTR strongly contribute to this variance.

Methods: We evaluated the performance of amplicon-based nanopore sequencing with unique molecular identifiers (UMI-ONT-Seq) for SNP detection, haplotype mapping, VNTR unit consensus sequence generation, and copy number estimation via coverage-corrected haplotypes quantification in the KIV-2 VNTR. We used 15 human samples and low-level mixtures (0.5 to 5%) of KIV-2 plasmids as a validation set. We then applied UMI-ONT-Seq to extract KIV-2 VNTR haplotypes in 48 multi-ancestry 1000 Genome samples and analyzed at scale a poorly characterized STR within the KIV-2 VNTR.

Results: UMI-ONT-Seq detected KIV-2 SNPs down to 1% variant level with high sensitivity, specificity, and precision (0.977 ± 0.018; 1.000 ± 0.0005; 0.993 ± 0.02) and accurately retrieved the full-length haplotype of each VNTR unit. Human variant levels were highly correlated with next-generation sequencing (R2 = 0.983) without bias across the whole variant level range. Six reads per UMI produced sequences of each KIV-2 unit with Q40 quality. The KIV-2 repeat number determined by coverage-corrected unique haplotype counting was in close agreement with droplet digital PCR (ddPCR), with 70% of the samples falling even within the narrow confidence interval of ddPCR. We then analyzed 62,679 intra-KIV-2 STR sequences and explored KIV-2 SNP haplotype patterns across five ancestries.

Conclusions: UMI-ONT-Seq accurately retrieves the SNP haplotype and precisely quantifies the VNTR copy number of each repeat unit of the complex KIV-2 VNTR region across multiple ancestries. This study utilizes the KIV-2 VNTR, presenting a novel and potent tool for comprehensive characterization of medically relevant complex genome regions at scale.

利用独特的分子标识符进行纳米孔测序,可对复杂的脂蛋白(a)KIV-2 VNTR 进行准确的突变分析和单倍分型。
背景:重复基因组区域,如可变串联重复序列(VNTR)或短串联重复序列(STR),是未知黑暗基因组的主要组成成分,可避开常规测序方法。编码蛋白质的 LPA kringle IV 类型-2(KIV-2)VNTR(每个单位 5.6 kb,每个等位基因 1-40 个单位)是一个与医学高度相关的例子,它具有特别复杂的结构、多种单倍型、基因内同源性和 VNTR 内 STR。它是血浆脂蛋白(a)[Lp(a)]浓度的主要调节因子,是重要的心血管风险因素。脂蛋白(a)浓度在不同个体和不同血统之间差异很大。LPA 基因尤其是 KIV-2 VNTR 中的多种变异和功能单倍型是造成这种差异的主要原因:我们评估了基于扩增子的纳米孔测序与独特分子标识符(UMI-ONT-Seq)在 SNP 检测、单倍型图谱绘制、VNTR 单元共识序列生成以及通过 KIV-2 VNTR 中覆盖校正的单倍型量化估算拷贝数方面的性能。我们使用 15 份人类样本和 KIV-2 质粒的低水平混合物(0.5%-5%)作为验证集。然后,我们应用 UMI-ONT-Seq 提取了 48 个多嵌套 1000 基因组样本中的 KIV-2 VNTR 单倍型,并大规模分析了 KIV-2 VNTR 中一个特征不明显的 STR:UMI-ONT-Seq检测到的KIV-2 SNP低至1%的变异水平,灵敏度、特异性和精确度都很高(0.977±0.018;1.000±0.0005;0.993±0.02),并准确检索到每个VNTR单元的全长单倍型。人类变异水平与新一代测序高度相关(R2 = 0.983),在整个变异水平范围内无偏差。每个 UMI 的 6 个读数产生了每个 KIV-2 单元的序列,质量为 Q40。通过覆盖校正独特单倍型计数确定的 KIV-2 重复序列数与液滴数字 PCR(ddPCR)的结果非常接近,70% 的样本甚至在 ddPCR 狭窄的置信区间内。然后,我们分析了62,679个KIV-2内STR序列,并探讨了五个祖先的KIV-2 SNP单倍型模式:结论:UMI-ONT-Seq 能准确检索 SNP 单倍型,并精确量化多个血统中复杂的 KIV-2 VNTR 区域每个重复单元的 VNTR 拷贝数。这项研究利用 KIV-2 VNTR,为大规模全面鉴定医学相关的复杂基因组区域提供了一种新颖而有效的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genome Medicine
Genome Medicine GENETICS & HEREDITY-
CiteScore
20.80
自引率
0.80%
发文量
128
审稿时长
6-12 weeks
期刊介绍: Genome Medicine is an open access journal that publishes outstanding research applying genetics, genomics, and multi-omics to understand, diagnose, and treat disease. Bridging basic science and clinical research, it covers areas such as cancer genomics, immuno-oncology, immunogenomics, infectious disease, microbiome, neurogenomics, systems medicine, clinical genomics, gene therapies, precision medicine, and clinical trials. The journal publishes original research, methods, software, and reviews to serve authors and promote broad interest and importance in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信