EFECTO DEL FILTRADO DE SECUENCIAS EN EL ENSAMBLADO DEL GENOMA DE Bacillus altitudinis AISLADO DE Ilex paraguariensis

IF 0.8 4区 生物学 Q4 PLANT SCIENCES
Bothalia Pub Date : 2021-01-15 DOI:10.15446/ABC.V26N2.86406
Ileana Julieta Cortese, M. Castrillo, P. Zapata, M. E. Laczeski
{"title":"EFECTO DEL FILTRADO DE SECUENCIAS EN EL ENSAMBLADO DEL GENOMA DE Bacillus altitudinis AISLADO DE Ilex paraguariensis","authors":"Ileana Julieta Cortese, M. Castrillo, P. Zapata, M. E. Laczeski","doi":"10.15446/ABC.V26N2.86406","DOIUrl":null,"url":null,"abstract":"Regardless of the type of technology applied to genome sequencing, sequence filtering is an essential step, where those low-quality readings or part of them are eliminated. In an assembly, the construction of a genome is carried out from the union of short readings in contigs. Some assemblers measure the relationship between sequences of a fixed length (k-mer) that can be affected by the presence of low-quality sequences. A common approach to evaluating assemblies is based on the analysis of the number of contigs, the length of the longest contig, and the value of N50 defined as the length of the contig representing 50 % of the length of the assembly. In this context, the present study aimed to evaluate the effect of the use of raw and filtered reads on the values of the quality parameters obtained in the assembly of the genome of the Bacillus altitudinis 19RS3 strain isolated from Ilex paraguariensis. The quality analysis of both starting files was performed with the FastqC software and the readings were filtered with the Trimmomatic software. The SPAdes software was used for the assembly and the QUAST tool for its evaluation. The best assembly for B. altitudinis19RS3 was obtained from the filtered readings with the value of k-mer 79, which generated 16 contigs greater than 500 bp with an N50 of 931 914 bp and the longest contig of 966 271 bp.","PeriodicalId":55336,"journal":{"name":"Bothalia","volume":"35 1","pages":"170-177"},"PeriodicalIF":0.8000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bothalia","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.15446/ABC.V26N2.86406","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 1

Abstract

Regardless of the type of technology applied to genome sequencing, sequence filtering is an essential step, where those low-quality readings or part of them are eliminated. In an assembly, the construction of a genome is carried out from the union of short readings in contigs. Some assemblers measure the relationship between sequences of a fixed length (k-mer) that can be affected by the presence of low-quality sequences. A common approach to evaluating assemblies is based on the analysis of the number of contigs, the length of the longest contig, and the value of N50 defined as the length of the contig representing 50 % of the length of the assembly. In this context, the present study aimed to evaluate the effect of the use of raw and filtered reads on the values of the quality parameters obtained in the assembly of the genome of the Bacillus altitudinis 19RS3 strain isolated from Ilex paraguariensis. The quality analysis of both starting files was performed with the FastqC software and the readings were filtered with the Trimmomatic software. The SPAdes software was used for the assembly and the QUAST tool for its evaluation. The best assembly for B. altitudinis19RS3 was obtained from the filtered readings with the value of k-mer 79, which generated 16 contigs greater than 500 bp with an N50 of 931 914 bp and the longest contig of 966 271 bp.
序列过滤对巴拉圭冬青高芽孢杆菌基因组组装的影响
无论何种技术应用于基因组测序,序列过滤都是必不可少的一步,它可以消除那些低质量的读数或部分低质量读数。在组装体中,基因组的构建是通过组合中的短读数的结合来完成的。一些组装器测量固定长度(k-mer)序列之间的关系,这可能受到低质量序列的存在的影响。评估组件的一种常用方法是基于对组件数量、最长组件长度和N50值的分析,N50值定义为组件长度占组件长度的50%。在此背景下,本研究旨在评估使用原始和过滤reads对巴拉圭冬青中分离的高原芽孢杆菌19RS3菌株基因组组装中获得的质量参数值的影响。用FastqC软件对两个起始文件进行质量分析,并用Trimmomatic软件对读数进行过滤。SPAdes软件用于装配,QUAST工具用于评估。在k-mer值为79的过滤读数中,获得了B. alutinis19rs3的最佳序列,产生了16个大于500 bp的序列,N50为931 914 bp,最长序列为966 271 bp。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Bothalia
Bothalia 生物-植物科学
CiteScore
1.70
自引率
0.00%
发文量
12
期刊介绍: Bothalia: African Biodiversity & Conservation is published by AOSIS for the South African National Biodiversity Institute (SANBI) and aims to disseminate knowledge, information and innovative approaches that promote and enhance the wise use and management of biodiversity in order to sustain the systems and species that support and benefit the people of Africa. The journal was previously published as Bothalia, and had served the South African botanical community since 1921. However the expanded mandate of SANBI necessitated a broader scope for the journal, and in 2014, the subtitle, African Biodiversity & Conservation was added to reflect this change.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信