Thoai Dotrang, Brad T Sherman, Lisheng Dai, Muhammad Ayub Khan, Helene C Highbarger, Whitney Bruchey, Sylvain Laverdure, Michael W Baseler, Tomozumi Imamichi, Robin L Dewar, Weizhong Chang
{"title":"HIVGenoPipe: a nextflow pipeline for the detection of HIV-1 drug resistance using a real-time sample-specific reference sequence.","authors":"Thoai Dotrang, Brad T Sherman, Lisheng Dai, Muhammad Ayub Khan, Helene C Highbarger, Whitney Bruchey, Sylvain Laverdure, Michael W Baseler, Tomozumi Imamichi, Robin L Dewar, Weizhong Chang","doi":"10.1186/s12859-025-06201-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The emergence of HIV drug resistance is a challenge in controlling the acquired immunodeficiency syndrome (AIDS) pandemic caused by human immunodeficiency virus-1 (HIV-1) infection. Detection of drug resistance variants at minor frequencies can help to formulate successful antiretroviral therapy (ART) regimens for people living with HIV (PLWH) and reduce the emergence of drug resistance. Therefore, a pipeline which can accurately produce consensus nucleotide sequences and identify drug resistance mutations (DRMs) at defined frequency thresholds will be helpful in the treatment of PLWH, analysis of virus evolution, and the control of the pandemic.</p><p><strong>Results: </strong>We have developed a pipeline, HIVGenoPipe, to determine HIV drug resistance variants within the gag-pol region above user-defined frequencies for HIV-1 samples sequenced using Illumina technology. The pipeline has been validated by comparing its results with the results generated by a widely used pipeline, HyDRA, which is limited to the pol region, and with the results generated by Sanger sequencing technology using the same set of 30 samples. The variant frequency used to generate ambiguous consensus sequences in HIVGenoPipe is more accurate than other pipelines because a sample-specific reference, which is generated in real-time with a novel hybrid strategy of de novo and reference-based assembly, is used for the frequency calculation, leading to more accurate drug resistance calls for use by clinicians. In addition, since Nextflow is used as the pipeline platform, HIVGenoPipe inherently has great portability, scalability and reproducibility; and the components can be updated or replaced independently if required.</p><p><strong>Conclusions: </strong>We developed HIVGenoPipe for the detection of HIV-1 drug resistance. It constructs more accurate gag-pol consensus sequences, leading to improved detection of DRMs. HIVGenoPipe is open source and freely available under the MIT license at https://github.com/LHRI-Bioinformatics/HIVGenoPipe . The current release (v1.0.1) is archived and available at https://doi.org/ https://doi.org/10.5281/zenodo.15528502 .</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"168"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12235847/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06201-5","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The emergence of HIV drug resistance is a challenge in controlling the acquired immunodeficiency syndrome (AIDS) pandemic caused by human immunodeficiency virus-1 (HIV-1) infection. Detection of drug resistance variants at minor frequencies can help to formulate successful antiretroviral therapy (ART) regimens for people living with HIV (PLWH) and reduce the emergence of drug resistance. Therefore, a pipeline which can accurately produce consensus nucleotide sequences and identify drug resistance mutations (DRMs) at defined frequency thresholds will be helpful in the treatment of PLWH, analysis of virus evolution, and the control of the pandemic.
Results: We have developed a pipeline, HIVGenoPipe, to determine HIV drug resistance variants within the gag-pol region above user-defined frequencies for HIV-1 samples sequenced using Illumina technology. The pipeline has been validated by comparing its results with the results generated by a widely used pipeline, HyDRA, which is limited to the pol region, and with the results generated by Sanger sequencing technology using the same set of 30 samples. The variant frequency used to generate ambiguous consensus sequences in HIVGenoPipe is more accurate than other pipelines because a sample-specific reference, which is generated in real-time with a novel hybrid strategy of de novo and reference-based assembly, is used for the frequency calculation, leading to more accurate drug resistance calls for use by clinicians. In addition, since Nextflow is used as the pipeline platform, HIVGenoPipe inherently has great portability, scalability and reproducibility; and the components can be updated or replaced independently if required.
Conclusions: We developed HIVGenoPipe for the detection of HIV-1 drug resistance. It constructs more accurate gag-pol consensus sequences, leading to improved detection of DRMs. HIVGenoPipe is open source and freely available under the MIT license at https://github.com/LHRI-Bioinformatics/HIVGenoPipe . The current release (v1.0.1) is archived and available at https://doi.org/ https://doi.org/10.5281/zenodo.15528502 .
期刊介绍:
BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology.
BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.