MhGeneS: An Analytical Pipeline to Allow for Robust Microhaplotype Genotyping

IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Julia C. Geue, Peng Liu, Sonesinh Keobouasone, Paul Wilson, Micheline Manseau
{"title":"MhGeneS: An Analytical Pipeline to Allow for Robust Microhaplotype Genotyping","authors":"Julia C. Geue,&nbsp;Peng Liu,&nbsp;Sonesinh Keobouasone,&nbsp;Paul Wilson,&nbsp;Micheline Manseau","doi":"10.1111/1755-0998.14027","DOIUrl":null,"url":null,"abstract":"<p>Microhaplotypes are small linked genomic regions comprising two or more single-nucleotide polymorphisms (SNPs) that are being applied in forensics and are emerging in wildlife monitoring studies and genomic epidemiology. Typically, targeted in non-coding regions, microhaplotypes in exonic regions can be designed with larger amplicons to capture functional non-synonymous sites and minimise insertion/deletion (indel) polymorphisms. Quality control is an important first step for high-confidence genotyping to counteract such false-positive variants. As genetic markers with higher polymorphism compared to biallelic SNPs, it is critical to ensure sequencing errors across the microhaplotype amplicon are filtered out to avoid introducing false-haplotypes. We developed the MhGeneS pipeline which works in tandem with Seq2Sat to help validate microhaplotype genotyping of the coding region of genes, with broader applicability to any microhaplotype profiling. We genotyped microhaplotype regions of the <i>Zfx</i> (≅ 160 bp) and <i>Zfy</i> (≅ 140 bp) genes, as well as an exon of the prion protein (<i>Prnp</i>) gene (≅ 370 bp) in caribou (<i>Rangifer tarandus</i>) using paired-end Illumina technology. As important quality metrics affecting microhaplotype calling, we identified the sequencing error rate profile related to the overlap or non-overlap of paired-end reads as well as the read depth as significant. In the case of <i>Prnp</i>, we achieved confident microhaplotype calling through MhGeneS by removing small sections of the 5′ and 3′ amplicons and using a minimum read depth of 20. Read depth and sequence trimming may be locus-specific, and validation of these parameters is recommended before the high-throughput profiling of samples.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 1","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.14027","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1755-0998.14027","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Microhaplotypes are small linked genomic regions comprising two or more single-nucleotide polymorphisms (SNPs) that are being applied in forensics and are emerging in wildlife monitoring studies and genomic epidemiology. Typically, targeted in non-coding regions, microhaplotypes in exonic regions can be designed with larger amplicons to capture functional non-synonymous sites and minimise insertion/deletion (indel) polymorphisms. Quality control is an important first step for high-confidence genotyping to counteract such false-positive variants. As genetic markers with higher polymorphism compared to biallelic SNPs, it is critical to ensure sequencing errors across the microhaplotype amplicon are filtered out to avoid introducing false-haplotypes. We developed the MhGeneS pipeline which works in tandem with Seq2Sat to help validate microhaplotype genotyping of the coding region of genes, with broader applicability to any microhaplotype profiling. We genotyped microhaplotype regions of the Zfx (≅ 160 bp) and Zfy (≅ 140 bp) genes, as well as an exon of the prion protein (Prnp) gene (≅ 370 bp) in caribou (Rangifer tarandus) using paired-end Illumina technology. As important quality metrics affecting microhaplotype calling, we identified the sequencing error rate profile related to the overlap or non-overlap of paired-end reads as well as the read depth as significant. In the case of Prnp, we achieved confident microhaplotype calling through MhGeneS by removing small sections of the 5′ and 3′ amplicons and using a minimum read depth of 20. Read depth and sequence trimming may be locus-specific, and validation of these parameters is recommended before the high-throughput profiling of samples.

Abstract Image

MhGeneS:一种可进行强大的微单体型基因分型的分析管道。
微单型是由两个或两个以上单核苷酸多态性(SNPs)组成的小连接基因组区域,目前已被应用于法医学,并正在野生动物监测研究和基因组流行病学中兴起。通常情况下,外显子区的微单倍型以非编码区为目标,可以设计较大的扩增子来捕获功能性非同义位点,并尽量减少插入/缺失(indel)多态性。质量控制是高置信度基因分型的重要第一步,可抵消此类假阳性变异。与双链 SNP 相比,遗传标记具有更高的多态性,因此必须确保过滤掉微单体型扩增片段中的测序错误,以避免引入假单体型。我们开发了 MhGeneS 管道,它与 Seq2Sat 协同工作,帮助验证基因编码区的微单体型基因分型,并可广泛应用于任何微单体型分析。我们利用成对端 Illumina 技术对驯鹿(Rangifer tarandus)的 Zfx(≅ 160 bp)和 Zfy(≅ 140 bp)基因以及朊病毒蛋白(Prnp)基因的一个外显子(≅ 370 bp)进行了微单基因型区域的基因分型。作为影响微单型鉴定的重要质量指标,我们发现与成对端读数重叠或不重叠有关的测序错误率曲线以及读数深度都很重要。就 Prnp 而言,我们通过 MhGeneS 去掉了 5' 和 3' 扩增子的一小部分,并使用最小读数深度为 20 的方法,实现了可靠的微单体型调用。读数深度和序列修剪可能会对特定位点有影响,建议在对样本进行高通量分析前对这些参数进行验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular Ecology Resources
Molecular Ecology Resources 生物-进化生物学
CiteScore
15.60
自引率
5.20%
发文量
170
审稿时长
3 months
期刊介绍: Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines. In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信