4只中华冠犬全基因组测序在离子质子系统变异检测中的评价。

Canine genetics and epidemiology Pub Date : 2015-10-08 eCollection Date: 2015-01-01 DOI:10.1186/s40575-015-0029-2
Agnese Viluma, Shumaila Sayyab, Sofia Mikko, Göran Andersson, Tomas F Bergström
{"title":"4只中华冠犬全基因组测序在离子质子系统变异检测中的评价。","authors":"Agnese Viluma,&nbsp;Shumaila Sayyab,&nbsp;Sofia Mikko,&nbsp;Göran Andersson,&nbsp;Tomas F Bergström","doi":"10.1186/s40575-015-0029-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Next generation sequencing (NGS) has traditionally been performed by large genome centers, but in recent years, the costs for whole-genome sequencing (WGS) have decreased substantially. With the introduction of smaller and less expensive \"desktop\" systems, NGS is now moving into the general laboratory. To evaluate the Ion Proton system for WGS we sequenced four Chinese Crested dogs and analyzed the data quality in terms of genome and exome coverage, the number of detected single nucleotide variants (SNVs) and insertions and deletions (INDELs) and the genotype concordance with the Illumina HD canine SNP array. For each of the four dogs, a 200 bp fragment library was constructed from genomic DNA and sequenced on two Ion PI chips per dog to reach mean coverage of 6-8x of the canine genome (genome size ≈ 2.4 Gb).</p><p><strong>Results: </strong>On average, each Ion PI chip yielded approximately 73.3 million reads with a mean read length of 130 bp (~9.5 Gb sequence data) of which 98.5 % could be aligned to the canine reference genome (CanFam3.1). By sequencing a single dog using one fragment library and two Ion PI chips, on average 80 % of the genome and 77 % exome was covered by at least four reads. After removing duplicate reads (20.7 %) the mean coverage across the whole genome was 6x. Using sequence data from all four individuals (four fragment libraries and eight Ion PI chips) the genome and exome coverage could be further increased to 97.2 and 94.3 %, respectively. We detected 4.83 million unique SNPs and 6.10 million unique INDEL positions across all individuals. A comparison between SNP genotypes detected with the WGS and the 170 K Illumina HD canine SNP array showed 90 % concordance.</p><p><strong>Conclusions: </strong>We have evaluated whole-genome sequencing on the Ion Proton system for genetic variant detection in four Chinese crested dogs. Even though INDEL calling with Ion Proton data is challenging due to specific platform errors, in case of SNP calling it can serve as an alternative to other next-generation sequencing platforms and SNP genotyping arrays, in studies aiming to identify causative mutations for rare monogenic diseases. In addition, we have identified new genetic variants of the Chinese Crested dog that will contribute to further whole-genome sequencing studies aimed to identify mutations associated with monogenic diseases with autosomal recessive inheritance.</p>","PeriodicalId":91060,"journal":{"name":"Canine genetics and epidemiology","volume":"2 ","pages":"16"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40575-015-0029-2","citationCount":"5","resultStr":"{\"title\":\"Evaluation of whole-genome sequencing of four Chinese crested dogs for variant detection using the ion proton system.\",\"authors\":\"Agnese Viluma,&nbsp;Shumaila Sayyab,&nbsp;Sofia Mikko,&nbsp;Göran Andersson,&nbsp;Tomas F Bergström\",\"doi\":\"10.1186/s40575-015-0029-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Next generation sequencing (NGS) has traditionally been performed by large genome centers, but in recent years, the costs for whole-genome sequencing (WGS) have decreased substantially. With the introduction of smaller and less expensive \\\"desktop\\\" systems, NGS is now moving into the general laboratory. To evaluate the Ion Proton system for WGS we sequenced four Chinese Crested dogs and analyzed the data quality in terms of genome and exome coverage, the number of detected single nucleotide variants (SNVs) and insertions and deletions (INDELs) and the genotype concordance with the Illumina HD canine SNP array. For each of the four dogs, a 200 bp fragment library was constructed from genomic DNA and sequenced on two Ion PI chips per dog to reach mean coverage of 6-8x of the canine genome (genome size ≈ 2.4 Gb).</p><p><strong>Results: </strong>On average, each Ion PI chip yielded approximately 73.3 million reads with a mean read length of 130 bp (~9.5 Gb sequence data) of which 98.5 % could be aligned to the canine reference genome (CanFam3.1). By sequencing a single dog using one fragment library and two Ion PI chips, on average 80 % of the genome and 77 % exome was covered by at least four reads. After removing duplicate reads (20.7 %) the mean coverage across the whole genome was 6x. Using sequence data from all four individuals (four fragment libraries and eight Ion PI chips) the genome and exome coverage could be further increased to 97.2 and 94.3 %, respectively. We detected 4.83 million unique SNPs and 6.10 million unique INDEL positions across all individuals. A comparison between SNP genotypes detected with the WGS and the 170 K Illumina HD canine SNP array showed 90 % concordance.</p><p><strong>Conclusions: </strong>We have evaluated whole-genome sequencing on the Ion Proton system for genetic variant detection in four Chinese crested dogs. Even though INDEL calling with Ion Proton data is challenging due to specific platform errors, in case of SNP calling it can serve as an alternative to other next-generation sequencing platforms and SNP genotyping arrays, in studies aiming to identify causative mutations for rare monogenic diseases. In addition, we have identified new genetic variants of the Chinese Crested dog that will contribute to further whole-genome sequencing studies aimed to identify mutations associated with monogenic diseases with autosomal recessive inheritance.</p>\",\"PeriodicalId\":91060,\"journal\":{\"name\":\"Canine genetics and epidemiology\",\"volume\":\"2 \",\"pages\":\"16\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s40575-015-0029-2\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Canine genetics and epidemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s40575-015-0029-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2015/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canine genetics and epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40575-015-0029-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

背景:下一代测序(NGS)传统上是由大型基因组中心进行的,但近年来,全基因组测序(WGS)的成本大幅下降。随着更小、更便宜的“桌面”系统的引入,NGS现在正进入普通实验室。为了评估离子质子系统对WGS的影响,我们对4只中华冠毛犬进行了测序,并从基因组和外显子组覆盖率、检测到的单核苷酸变异(snv)和插入缺失(INDELs)数量以及与Illumina HD犬SNP阵列的基因型一致性等方面分析了数据质量。对每只狗构建200 bp的基因组DNA片段文库,并在每只狗的2个离子PI芯片上测序,平均覆盖犬基因组的6-8倍(基因组大小≈2.4 Gb)。结果:平均每个离子PI芯片产生约7330万reads,平均读取长度为130 bp(约9.5 Gb序列数据),其中98.5%可以与犬参考基因组对齐(CanFam3.1)。通过使用一个片段库和两个离子PI芯片对一只狗进行测序,平均80%的基因组和77%的外显子组被至少四次读取覆盖。去除重复reads(20.7%)后,整个基因组的平均覆盖率为6倍。利用所有4个个体(4个片段文库和8个离子PI芯片)的序列数据,基因组和外显子组的覆盖率分别可以进一步提高到97.2和94.3%。我们在所有个体中检测到483万个独特的snp和610万个独特的INDEL位置。WGS检测的SNP基因型与170 K Illumina HD犬SNP基因型的一致性为90%。结论:我们利用离子质子系统(Ion Proton system)对4只中国冠毛犬的基因变异进行了全基因组测序。尽管使用离子质子数据的INDEL调用由于特定的平台错误而具有挑战性,但在SNP调用的情况下,它可以作为其他下一代测序平台和SNP基因分型阵列的替代方案,用于旨在确定罕见单基因疾病的致病突变的研究。此外,我们已经确定了中国冠毛犬的新遗传变异,这将有助于进一步的全基因组测序研究,旨在确定与常染色体隐性遗传的单基因疾病相关的突变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Evaluation of whole-genome sequencing of four Chinese crested dogs for variant detection using the ion proton system.

Evaluation of whole-genome sequencing of four Chinese crested dogs for variant detection using the ion proton system.

Evaluation of whole-genome sequencing of four Chinese crested dogs for variant detection using the ion proton system.

Evaluation of whole-genome sequencing of four Chinese crested dogs for variant detection using the ion proton system.

Background: Next generation sequencing (NGS) has traditionally been performed by large genome centers, but in recent years, the costs for whole-genome sequencing (WGS) have decreased substantially. With the introduction of smaller and less expensive "desktop" systems, NGS is now moving into the general laboratory. To evaluate the Ion Proton system for WGS we sequenced four Chinese Crested dogs and analyzed the data quality in terms of genome and exome coverage, the number of detected single nucleotide variants (SNVs) and insertions and deletions (INDELs) and the genotype concordance with the Illumina HD canine SNP array. For each of the four dogs, a 200 bp fragment library was constructed from genomic DNA and sequenced on two Ion PI chips per dog to reach mean coverage of 6-8x of the canine genome (genome size ≈ 2.4 Gb).

Results: On average, each Ion PI chip yielded approximately 73.3 million reads with a mean read length of 130 bp (~9.5 Gb sequence data) of which 98.5 % could be aligned to the canine reference genome (CanFam3.1). By sequencing a single dog using one fragment library and two Ion PI chips, on average 80 % of the genome and 77 % exome was covered by at least four reads. After removing duplicate reads (20.7 %) the mean coverage across the whole genome was 6x. Using sequence data from all four individuals (four fragment libraries and eight Ion PI chips) the genome and exome coverage could be further increased to 97.2 and 94.3 %, respectively. We detected 4.83 million unique SNPs and 6.10 million unique INDEL positions across all individuals. A comparison between SNP genotypes detected with the WGS and the 170 K Illumina HD canine SNP array showed 90 % concordance.

Conclusions: We have evaluated whole-genome sequencing on the Ion Proton system for genetic variant detection in four Chinese crested dogs. Even though INDEL calling with Ion Proton data is challenging due to specific platform errors, in case of SNP calling it can serve as an alternative to other next-generation sequencing platforms and SNP genotyping arrays, in studies aiming to identify causative mutations for rare monogenic diseases. In addition, we have identified new genetic variants of the Chinese Crested dog that will contribute to further whole-genome sequencing studies aimed to identify mutations associated with monogenic diseases with autosomal recessive inheritance.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信