{"title":"Sequali: efficient and comprehensive quality control of short- and long-read sequencing data.","authors":"Ruben H P Vorderman","doi":"10.1093/bioadv/vbaf010","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Quality control of sequencing data is the first step in many sequencing workflows. Short- and long-read sequencing technologies have many commonalities with regard to quality control. Several quality control programs exist; however, none possess a feature set that is adequate for both technologies. Quality control programs aimed at Oxford Nanopore Technologies sequencing lack vital features, such as adapter searching, overrepresented sequence analysis, and duplication analysis.</p><p><strong>Results: </strong>Sequali was developed to provide sequencing quality control for both short- and long-read sequencing technologies. It features adapter search, overrepresented sequence analysis, and duplication analysis and supports FASTQ and uBAM inputs. It is significantly faster than comparable sequencing quality control programs for both short- and long-read sequencing technologies.</p><p><strong>Availability and implementation: </strong>Sequali is an open-source Python application using C extensions and is freely available under the AGPL-3.0 license at https://github.com/rhpvorderman/sequali. The source code for each release is archived at zenodo: https://zenodo.org/doi/10.5281/zenodo.10822485.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf010"},"PeriodicalIF":2.4000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11802474/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Quality control of sequencing data is the first step in many sequencing workflows. Short- and long-read sequencing technologies have many commonalities with regard to quality control. Several quality control programs exist; however, none possess a feature set that is adequate for both technologies. Quality control programs aimed at Oxford Nanopore Technologies sequencing lack vital features, such as adapter searching, overrepresented sequence analysis, and duplication analysis.
Results: Sequali was developed to provide sequencing quality control for both short- and long-read sequencing technologies. It features adapter search, overrepresented sequence analysis, and duplication analysis and supports FASTQ and uBAM inputs. It is significantly faster than comparable sequencing quality control programs for both short- and long-read sequencing technologies.
Availability and implementation: Sequali is an open-source Python application using C extensions and is freely available under the AGPL-3.0 license at https://github.com/rhpvorderman/sequali. The source code for each release is archived at zenodo: https://zenodo.org/doi/10.5281/zenodo.10822485.