Boquila: NGS读取模拟器,以消除序列分析中读取的核苷酸偏差

IF 1.1 4区 生物学 Q3 BIOLOGY
Umit Akkose, Ogun Adebali
{"title":"Boquila: NGS读取模拟器,以消除序列分析中读取的核苷酸偏差","authors":"Umit Akkose, Ogun Adebali","doi":"10.1101/2022.03.29.486262","DOIUrl":null,"url":null,"abstract":"Sequence content is heterogeneous throughout genomes. Therefore, Genome-wide NGS reads biased towards specific nucleotide profiles are affected by the genome-wide heterogeneous nucleotide distribution. Boquila generates sequences that mimic the nucleotide profile of true reads, which can be used to correct the nucleotide-based bias of genome-wide distribution of NGS reads. Boquila can be configured to generate reads from only specified regions of the reference genome. It also allows the use of input DNA sequencing to correct the bias due to the copy number variations in the genome. Boquila uses standard file formats for input and output data, and it can be easily integrated into any workflow for high-throughput sequencing applications.","PeriodicalId":23358,"journal":{"name":"Turkish Journal of Biology","volume":"47 1","pages":"158 - 163"},"PeriodicalIF":1.1000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Boquila: NGS read simulator to eliminate read nucleotide bias in sequence analysis\",\"authors\":\"Umit Akkose, Ogun Adebali\",\"doi\":\"10.1101/2022.03.29.486262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sequence content is heterogeneous throughout genomes. Therefore, Genome-wide NGS reads biased towards specific nucleotide profiles are affected by the genome-wide heterogeneous nucleotide distribution. Boquila generates sequences that mimic the nucleotide profile of true reads, which can be used to correct the nucleotide-based bias of genome-wide distribution of NGS reads. Boquila can be configured to generate reads from only specified regions of the reference genome. It also allows the use of input DNA sequencing to correct the bias due to the copy number variations in the genome. Boquila uses standard file formats for input and output data, and it can be easily integrated into any workflow for high-throughput sequencing applications.\",\"PeriodicalId\":23358,\"journal\":{\"name\":\"Turkish Journal of Biology\",\"volume\":\"47 1\",\"pages\":\"158 - 163\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Turkish Journal of Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1101/2022.03.29.486262\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/2022.03.29.486262","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 1

摘要

序列内容在整个基因组中是异质的。因此,偏向于特定核苷酸谱的全基因组NGS读数受到全基因组异质核苷酸分布的影响。Boquila生成了模拟真实reads的核苷酸谱的序列,这可以用来纠正NGS reads全基因组分布中基于核苷酸的偏差。Boquila可以配置为仅从参考基因组的特定区域生成读取。它还允许使用输入DNA测序来纠正由于基因组拷贝数变化而产生的偏差。Boquila使用标准文件格式的输入和输出数据,它可以很容易地集成到任何工作流程的高通量测序应用程序。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Boquila: NGS read simulator to eliminate read nucleotide bias in sequence analysis
Sequence content is heterogeneous throughout genomes. Therefore, Genome-wide NGS reads biased towards specific nucleotide profiles are affected by the genome-wide heterogeneous nucleotide distribution. Boquila generates sequences that mimic the nucleotide profile of true reads, which can be used to correct the nucleotide-based bias of genome-wide distribution of NGS reads. Boquila can be configured to generate reads from only specified regions of the reference genome. It also allows the use of input DNA sequencing to correct the bias due to the copy number variations in the genome. Boquila uses standard file formats for input and output data, and it can be easily integrated into any workflow for high-throughput sequencing applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.60
自引率
0.00%
发文量
20
审稿时长
6-12 weeks
期刊介绍: The Turkish Journal of Biology is published electronically 6 times a year by the Scientific and Technological Research Council of Turkey (TÜBİTAK) and accepts English-language manuscripts concerning all kinds of biological processes including biochemistry and biosynthesis, physiology and metabolism, molecular genetics, molecular biology, genomics, proteomics, molecular farming, biotechnology/genetic transformation, nanobiotechnology, bioinformatics and systems biology, cell and developmental biology, stem cell biology, and reproductive biology. Contribution is open to researchers of all nationalities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信