Comparing Gold-Standard Sanger Sequencing with Two Next-Generation Sequencing Platforms of HIV-1 gp160 Single Genome Amplicons.

IF 1.5 4区 医学 Q4 IMMUNOLOGY
AIDS research and human retroviruses Pub Date : 2024-11-01 Epub Date: 2024-07-10 DOI:10.1089/AID.2024.0012
David J Nolan, Jonathan DaRoza, Robin Brody, Krishna Ganta, Katherine Luzuriaga, Chris Huston, Simon Rosenthal, Susanna L Lamers, Rebecca Rose
{"title":"Comparing Gold-Standard Sanger Sequencing with Two Next-Generation Sequencing Platforms of HIV-1 <i>gp160</i> Single Genome Amplicons.","authors":"David J Nolan, Jonathan DaRoza, Robin Brody, Krishna Ganta, Katherine Luzuriaga, Chris Huston, Simon Rosenthal, Susanna L Lamers, Rebecca Rose","doi":"10.1089/AID.2024.0012","DOIUrl":null,"url":null,"abstract":"<p><p>Our goal was to assess the accuracy of next generation sequencing (NGS) compared with Sanger. We performed single genome amplification (SGA) of HIV-1 <i>gp160</i> on extracted tissue DNA from two HIV+ individuals. Amplicons (<i>n</i> = 30) were sequenced with Sanger or reamplified with barcoded primers and pooled before sequencing using Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB). For each amplicon, a consensus sequence for NGS reads was obtained by (1) mapping reads to the Sanger sequence when available (\"reference-based\") or (2) mapping reads to a \"pseudo-reference\" sequence, i.e., a consensus sequence of a subset of NGS reads (\"reference-free\"). PB reads were clustered based on genetic similarity. A Sanger consensus sequence was obtained for 23/30 amplicons, for which all NGS consensus sequences were identical (<i>n</i> = 9) or nearly identical (<i>n</i> = 14) compared with Sanger. For the nine mismatches between Sanger/NGS, the nucleotide in the NGS sequence matched all other sequences from that patient. Of the 7/30 amplicons without a Sanger sequence, NGS sequences had ≥35 ambiguous calls in five amplicons and 0 ambiguities in two amplicons. Analysis of the electropherograms showed failure of a single sequencing primer for the latter two amplicons (consistent with a single template) and overlapping peaks for the other five (consistent with multiple templates). Clustering results closely followed the Sanger/NGS consensus results, where amplicons derived from a single template also had a single cluster and vice versa (with one exception, which could be the result of barcode misidentification). Representative sequences from the clusters contained 2-13 differences compared with Sanger/NGS. In summary, we show that both ONT and PB can produce amplicon consensus sequences with similar or higher accuracy compared with Sanger and, importantly, without the need for a known reference sequence. Clustering could be useful in some circumstances to predict or confirm the presence of multiple starting templates.</p>","PeriodicalId":7544,"journal":{"name":"AIDS research and human retroviruses","volume":" ","pages":"659-669"},"PeriodicalIF":1.5000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIDS research and human retroviruses","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1089/AID.2024.0012","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/10 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Our goal was to assess the accuracy of next generation sequencing (NGS) compared with Sanger. We performed single genome amplification (SGA) of HIV-1 gp160 on extracted tissue DNA from two HIV+ individuals. Amplicons (n = 30) were sequenced with Sanger or reamplified with barcoded primers and pooled before sequencing using Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB). For each amplicon, a consensus sequence for NGS reads was obtained by (1) mapping reads to the Sanger sequence when available ("reference-based") or (2) mapping reads to a "pseudo-reference" sequence, i.e., a consensus sequence of a subset of NGS reads ("reference-free"). PB reads were clustered based on genetic similarity. A Sanger consensus sequence was obtained for 23/30 amplicons, for which all NGS consensus sequences were identical (n = 9) or nearly identical (n = 14) compared with Sanger. For the nine mismatches between Sanger/NGS, the nucleotide in the NGS sequence matched all other sequences from that patient. Of the 7/30 amplicons without a Sanger sequence, NGS sequences had ≥35 ambiguous calls in five amplicons and 0 ambiguities in two amplicons. Analysis of the electropherograms showed failure of a single sequencing primer for the latter two amplicons (consistent with a single template) and overlapping peaks for the other five (consistent with multiple templates). Clustering results closely followed the Sanger/NGS consensus results, where amplicons derived from a single template also had a single cluster and vice versa (with one exception, which could be the result of barcode misidentification). Representative sequences from the clusters contained 2-13 differences compared with Sanger/NGS. In summary, we show that both ONT and PB can produce amplicon consensus sequences with similar or higher accuracy compared with Sanger and, importantly, without the need for a known reference sequence. Clustering could be useful in some circumstances to predict or confirm the presence of multiple starting templates.

比较金标准桑格测序与 HIV-1 gp160 单模板扩增子的两种下一代测序平台。
我们的目标是评估新一代测序 (NGS) 与 Sanger 相比的准确性。我们对两名 HIV 感染者提取的组织 DNA 进行了 HIV-1 gp160 的单基因组扩增(SGA)。扩增子(n=30)用 Sanger 测序,或用条形码引物重新扩增,并在测序前用牛津纳米孔技术公司(ONT)和太平洋生物科学公司(PB)汇集。对于每个扩增子,通过以下方法获得 NGS 读数的共识序列:(1)将读数映射到可用的 Sanger 序列("基于参考")或(2)将读数映射到 "伪参考 "序列,即 NGS 读数子集的共识序列("无参考")。根据遗传相似性对 PB 读数进行聚类。23/30 个扩增子获得了 Sanger 共识序列,其中所有 NGS 共识序列与 Sanger 序列相同 [n=9] 或几乎相同 [n=14]。对于 Sanger/NGS 之间的 9 个不匹配序列,NGS 序列中的核苷酸与该患者的所有其他序列相匹配。在 7/30 个没有 Sanger 序列的扩增子中,NGS 序列在 5 个扩增子中出现了 35 模糊调用,在 2 个扩增子中出现了 0 模糊调用。对电泳图的分析表明,后两个扩增子的单一测序引物失效(与单一模板一致),其他五个扩增子的峰值重叠(与多模板一致)。聚类结果紧跟 Sanger/NGS 的共识结果,即来自单一模板的扩增子也有一个聚类,反之亦然(只有一个例外,可能是条形码识别错误的结果)。与 Sanger/NGS 相比,簇的代表性序列有 2-13 个差异。总之,我们发现 ONT 和 PB 都能产生与 Sanger 相似或更高准确度的扩增片段共识序列,而且重要的是,无需已知参考序列。在某些情况下,聚类可用于预测或确认多个起始模板的存在。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.10
自引率
6.70%
发文量
201
审稿时长
3-6 weeks
期刊介绍: AIDS Research and Human Retroviruses was the very first AIDS publication in the field over 30 years ago, and today it is still the critical resource advancing research in retroviruses, including AIDS. The Journal provides the broadest coverage from molecular biology to clinical studies and outcomes research, focusing on developments in prevention science, novel therapeutics, and immune-restorative approaches. Cutting-edge papers on the latest progress and research advances through clinical trials and examination of targeted antiretroviral agents lead to improvements in translational medicine for optimal treatment outcomes. AIDS Research and Human Retroviruses coverage includes: HIV cure research HIV prevention science - Vaccine research - Systemic and Topical PreP Molecular and cell biology of HIV and SIV Developments in HIV pathogenesis and comorbidities Molecular biology, immunology, and epidemiology of HTLV Pharmacology of HIV therapy Social and behavioral science Rapid publication of emerging sequence information.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信