Vid Mlakar, Isabelle Dupanloup, Yvonne Gloor, Marc Ansari
{"title":"利用长读纳米孔测序进行单倍型推断:应用于 GSTA1 Promoter。","authors":"Vid Mlakar, Isabelle Dupanloup, Yvonne Gloor, Marc Ansari","doi":"10.1007/s12033-024-01213-7","DOIUrl":null,"url":null,"abstract":"<p><p>Recovering true haplotypes can have important clinical consequences. The laboratory process is difficult and is, therefore, most often done through inference. In this paper, we show that when using the Oxford nanopore sequencing technology, we could recover the true haplotypes of the GSTA1 promoter region. Eight LCL cell lines with potentially ambiguous haplotypes were used to characterize the efficacy of Oxford nanopore sequencing to phase the correct GSTA1 promoter haplotypes. The results were compared to Sanger sequencing and inferred haplotypes in the 1000 genomes project. The average read length was 813 bp out of a total PCR length of 1336 bp. The best coverage of sequencing was in the middle of the PCR product and decreased to 50% at the PCR ends. SNPs separated by less than 200 bp showed > 90% of correct haplotypes, while at the distance of 1089 bp, this proportion still exceeded 58%. The number of cycles influences the generation of hybrid haplotypes but not extension or annealing time. The results demonstrate that this long sequencing reads methodology, can accurately determine the haplotypes without the need for inference. The technology proved to be robust but the success of phasing nonetheless depends on the distances and frequencies of SNPs.</p>","PeriodicalId":18865,"journal":{"name":"Molecular Biotechnology","volume":" ","pages":"2512-2519"},"PeriodicalIF":2.4000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12055866/pdf/","citationCount":"0","resultStr":"{\"title\":\"Haplotype Inference Using Long-Read Nanopore Sequencing: Application to GSTA1 Promoter.\",\"authors\":\"Vid Mlakar, Isabelle Dupanloup, Yvonne Gloor, Marc Ansari\",\"doi\":\"10.1007/s12033-024-01213-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Recovering true haplotypes can have important clinical consequences. The laboratory process is difficult and is, therefore, most often done through inference. In this paper, we show that when using the Oxford nanopore sequencing technology, we could recover the true haplotypes of the GSTA1 promoter region. Eight LCL cell lines with potentially ambiguous haplotypes were used to characterize the efficacy of Oxford nanopore sequencing to phase the correct GSTA1 promoter haplotypes. The results were compared to Sanger sequencing and inferred haplotypes in the 1000 genomes project. The average read length was 813 bp out of a total PCR length of 1336 bp. The best coverage of sequencing was in the middle of the PCR product and decreased to 50% at the PCR ends. SNPs separated by less than 200 bp showed > 90% of correct haplotypes, while at the distance of 1089 bp, this proportion still exceeded 58%. The number of cycles influences the generation of hybrid haplotypes but not extension or annealing time. The results demonstrate that this long sequencing reads methodology, can accurately determine the haplotypes without the need for inference. The technology proved to be robust but the success of phasing nonetheless depends on the distances and frequencies of SNPs.</p>\",\"PeriodicalId\":18865,\"journal\":{\"name\":\"Molecular Biotechnology\",\"volume\":\" \",\"pages\":\"2512-2519\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12055866/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Biotechnology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s12033-024-01213-7\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/6/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Biotechnology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12033-024-01213-7","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/17 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
恢复真实的单倍型会对临床产生重要影响。这一实验室过程十分困难,因此通常通过推断来完成。本文表明,利用牛津纳米孔测序技术,我们可以恢复 GSTA1 启动子区域的真实单倍型。我们使用了八种单倍型可能不明确的 LCL 细胞系来鉴定牛津纳米孔测序技术在分辨正确的 GSTA1 启动子单倍型方面的功效。测序结果与桑格测序和 1000 基因组项目中推断的单倍型进行了比较。PCR 总长度为 1336 bp,平均读取长度为 813 bp。测序的最佳覆盖率在 PCR 产物的中部,在 PCR 两端的覆盖率降至 50%。相距不到 200 bp 的 SNP 显示出大于 90% 的正确单倍型,而在相距 1089 bp 时,这一比例仍超过 58%。循环次数会影响混合单倍型的产生,但延伸或退火时间不会。结果表明,这种长测序读数方法无需推断即可准确确定单倍型。该技术被证明是稳健的,但分期的成功与否取决于 SNP 的距离和频率。
Haplotype Inference Using Long-Read Nanopore Sequencing: Application to GSTA1 Promoter.
Recovering true haplotypes can have important clinical consequences. The laboratory process is difficult and is, therefore, most often done through inference. In this paper, we show that when using the Oxford nanopore sequencing technology, we could recover the true haplotypes of the GSTA1 promoter region. Eight LCL cell lines with potentially ambiguous haplotypes were used to characterize the efficacy of Oxford nanopore sequencing to phase the correct GSTA1 promoter haplotypes. The results were compared to Sanger sequencing and inferred haplotypes in the 1000 genomes project. The average read length was 813 bp out of a total PCR length of 1336 bp. The best coverage of sequencing was in the middle of the PCR product and decreased to 50% at the PCR ends. SNPs separated by less than 200 bp showed > 90% of correct haplotypes, while at the distance of 1089 bp, this proportion still exceeded 58%. The number of cycles influences the generation of hybrid haplotypes but not extension or annealing time. The results demonstrate that this long sequencing reads methodology, can accurately determine the haplotypes without the need for inference. The technology proved to be robust but the success of phasing nonetheless depends on the distances and frequencies of SNPs.
期刊介绍:
Molecular Biotechnology publishes original research papers on the application of molecular biology to both basic and applied research in the field of biotechnology. Particular areas of interest include the following: stability and expression of cloned gene products, cell transformation, gene cloning systems and the production of recombinant proteins, protein purification and analysis, transgenic species, developmental biology, mutation analysis, the applications of DNA fingerprinting, RNA interference, and PCR technology, microarray technology, proteomics, mass spectrometry, bioinformatics, plant molecular biology, microbial genetics, gene probes and the diagnosis of disease, pharmaceutical and health care products, therapeutic agents, vaccines, gene targeting, gene therapy, stem cell technology and tissue engineering, antisense technology, protein engineering and enzyme technology, monoclonal antibodies, glycobiology and glycomics, and agricultural biotechnology.