低覆盖率全基因组测序有助于在复杂的小鼠杂交中精确和经济地重建单倍型。

IF 2.7 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Samuel J Widmayer, Lydia K Wooldridge, Emily Swanzey, Mary Barter, Chrystal Snow, Michael Saul, Qingchang Meng, Beth Dumont, Laura Reinholdt, Daniel M Gatti
{"title":"低覆盖率全基因组测序有助于在复杂的小鼠杂交中精确和经济地重建单倍型。","authors":"Samuel J Widmayer, Lydia K Wooldridge, Emily Swanzey, Mary Barter, Chrystal Snow, Michael Saul, Qingchang Meng, Beth Dumont, Laura Reinholdt, Daniel M Gatti","doi":"10.1007/s00335-025-10148-6","DOIUrl":null,"url":null,"abstract":"<p><p>The search for the underlying genetic contributions to complex traits and diseases relies on accurate genetic data from populations of interest. Outbred populations, like the Diversity Outbred (DO), are commonly genotyped using commercial SNP arrays, such as the Giga Mouse Universal Genotyping Array (GigaMUGA). However, array genotypes are expensive to collect, subject to significant ascertainment bias, and too sparse to capture the genetic structure of highly recombined mouse crosses. We investigated the efficacy of sequencing-based genotyping by comparing genotyping results between the GigaMUGA, double-digest restriction-site associated DNA sequencing (ddRADseq), and low-coverage whole-genome sequencing (lcWGS). We aligned reads at ~ 1× coverage and imputed segregating SNPs from the eight DO founder strains onto 48 DO genomes and reconstructed their haplotypes using R/qtl2. Haplotype reconstructions derived from all three methods were highly concordant. However, lcWGS more faithfully recapitulated crossover counts and identified more small (< 1 Mb) haplotype blocks at as low as 0.1× coverage. Over 90% of local expression quantitative trait loci identified in a set of 183 DO-derived embryoid bodies using the GigaMUGA were recalled by lcWGS at coverages as low as 0.1×. We recommend that lcWGS be adopted as the primary method of genotyping complex crosses, and cell-based resources derived from them because they are as accurate as array-based reconstructions, robust to ultra-low sequencing depths, may more accurately model haplotypes of the mouse genome that are difficult to resolve with dense reference data, and cost-effective.</p>","PeriodicalId":18259,"journal":{"name":"Mammalian Genome","volume":" ","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Low-coverage whole-genome sequencing facilitates accurate and cost-effective haplotype reconstruction in complex mouse crosses.\",\"authors\":\"Samuel J Widmayer, Lydia K Wooldridge, Emily Swanzey, Mary Barter, Chrystal Snow, Michael Saul, Qingchang Meng, Beth Dumont, Laura Reinholdt, Daniel M Gatti\",\"doi\":\"10.1007/s00335-025-10148-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The search for the underlying genetic contributions to complex traits and diseases relies on accurate genetic data from populations of interest. Outbred populations, like the Diversity Outbred (DO), are commonly genotyped using commercial SNP arrays, such as the Giga Mouse Universal Genotyping Array (GigaMUGA). However, array genotypes are expensive to collect, subject to significant ascertainment bias, and too sparse to capture the genetic structure of highly recombined mouse crosses. We investigated the efficacy of sequencing-based genotyping by comparing genotyping results between the GigaMUGA, double-digest restriction-site associated DNA sequencing (ddRADseq), and low-coverage whole-genome sequencing (lcWGS). We aligned reads at ~ 1× coverage and imputed segregating SNPs from the eight DO founder strains onto 48 DO genomes and reconstructed their haplotypes using R/qtl2. Haplotype reconstructions derived from all three methods were highly concordant. However, lcWGS more faithfully recapitulated crossover counts and identified more small (< 1 Mb) haplotype blocks at as low as 0.1× coverage. Over 90% of local expression quantitative trait loci identified in a set of 183 DO-derived embryoid bodies using the GigaMUGA were recalled by lcWGS at coverages as low as 0.1×. We recommend that lcWGS be adopted as the primary method of genotyping complex crosses, and cell-based resources derived from them because they are as accurate as array-based reconstructions, robust to ultra-low sequencing depths, may more accurately model haplotypes of the mouse genome that are difficult to resolve with dense reference data, and cost-effective.</p>\",\"PeriodicalId\":18259,\"journal\":{\"name\":\"Mammalian Genome\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mammalian Genome\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s00335-025-10148-6\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mammalian Genome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s00335-025-10148-6","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

对复杂性状和疾病的潜在遗传贡献的研究依赖于来自感兴趣人群的准确遗传数据。近亲繁殖群体,如多样性近亲繁殖(DO),通常使用商业SNP阵列进行基因分型,例如千兆小鼠通用基因分型阵列(GigaMUGA)。然而,阵列基因型收集成本高,存在明显的确定偏差,并且过于稀疏,无法捕获高度重组小鼠杂交的遗传结构。通过比较GigaMUGA、双酶切限制性位点相关DNA测序(ddRADseq)和低覆盖全基因组测序(lcWGS)的基因分型结果,研究了基于测序的基因分型的有效性。我们将8个DO始祖菌株的分离snp输入到48个DO基因组上,并利用R/qtl2重构了它们的单倍型。三种方法得到的单倍型重建结果高度一致。然而,lcWGS更忠实地概括了交叉计数,并确定了更小的(
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Low-coverage whole-genome sequencing facilitates accurate and cost-effective haplotype reconstruction in complex mouse crosses.

The search for the underlying genetic contributions to complex traits and diseases relies on accurate genetic data from populations of interest. Outbred populations, like the Diversity Outbred (DO), are commonly genotyped using commercial SNP arrays, such as the Giga Mouse Universal Genotyping Array (GigaMUGA). However, array genotypes are expensive to collect, subject to significant ascertainment bias, and too sparse to capture the genetic structure of highly recombined mouse crosses. We investigated the efficacy of sequencing-based genotyping by comparing genotyping results between the GigaMUGA, double-digest restriction-site associated DNA sequencing (ddRADseq), and low-coverage whole-genome sequencing (lcWGS). We aligned reads at ~ 1× coverage and imputed segregating SNPs from the eight DO founder strains onto 48 DO genomes and reconstructed their haplotypes using R/qtl2. Haplotype reconstructions derived from all three methods were highly concordant. However, lcWGS more faithfully recapitulated crossover counts and identified more small (< 1 Mb) haplotype blocks at as low as 0.1× coverage. Over 90% of local expression quantitative trait loci identified in a set of 183 DO-derived embryoid bodies using the GigaMUGA were recalled by lcWGS at coverages as low as 0.1×. We recommend that lcWGS be adopted as the primary method of genotyping complex crosses, and cell-based resources derived from them because they are as accurate as array-based reconstructions, robust to ultra-low sequencing depths, may more accurately model haplotypes of the mouse genome that are difficult to resolve with dense reference data, and cost-effective.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mammalian Genome
Mammalian Genome 生物-生化与分子生物学
CiteScore
4.00
自引率
0.00%
发文量
33
审稿时长
6-12 weeks
期刊介绍: Mammalian Genome focuses on the experimental, theoretical and technical aspects of genetics, genomics, epigenetics and systems biology in mouse, human and other mammalian species, with an emphasis on the relationship between genotype and phenotype, elucidation of biological and disease pathways as well as experimental aspects of interventions, therapeutics, and precision medicine. The journal aims to publish high quality original papers that present novel findings in all areas of mammalian genetic research as well as review articles on areas of topical interest. The journal will also feature commentaries and editorials to inform readers of breakthrough discoveries as well as issues of research standards, policies and ethics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信