Yan Guo, Yuwei Song, Limin Jiang, Yu Chen, Michele Ceccarelli, Min Gao, Zechen Chong
{"title":"A detailed guide to assessing genome assembly based on long-read sequencing data using Inspector.","authors":"Yan Guo, Yuwei Song, Limin Jiang, Yu Chen, Michele Ceccarelli, Min Gao, Zechen Chong","doi":"10.1038/s41596-025-01149-5","DOIUrl":null,"url":null,"abstract":"<p><p>Long-read sequencing technologies yield extended DNA sequences capable of spanning intricate, repetitive genome regions, thereby facilitating the generation of more precise and comprehensive genome assemblies. However, assembly errors are inevitable owing to inherent genomic complexity and limitations of sequencing technology and assembly algorithms, making assembly evaluation crucial. The genome assembly evaluation tool Inspector presents several advantages over existing long-read de novo assembly evaluation tools, including (1) both reference-free and reference-guided assembly evaluation; (2) the ability to detect both small- and large-scale structural errors; (3) the option of assembly error correction, which can improve the quality value of the original assembly; and (4) the ability to perform haplotype-resolved assembly evaluation. Inspector can provide not only basic contig and alignment statistics, but also the precise locations and types of the different structural errors. These advantages provide a robust framework for long-read assembly evaluation. In this Protocol, we showcase four procedures to demonstrate the different applications of Inspector for long-read assembly evaluation. Inspector software and additional guides can be found at https://github.com/ChongLab/Inspector_protocol .</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Protocols","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41596-025-01149-5","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Long-read sequencing technologies yield extended DNA sequences capable of spanning intricate, repetitive genome regions, thereby facilitating the generation of more precise and comprehensive genome assemblies. However, assembly errors are inevitable owing to inherent genomic complexity and limitations of sequencing technology and assembly algorithms, making assembly evaluation crucial. The genome assembly evaluation tool Inspector presents several advantages over existing long-read de novo assembly evaluation tools, including (1) both reference-free and reference-guided assembly evaluation; (2) the ability to detect both small- and large-scale structural errors; (3) the option of assembly error correction, which can improve the quality value of the original assembly; and (4) the ability to perform haplotype-resolved assembly evaluation. Inspector can provide not only basic contig and alignment statistics, but also the precise locations and types of the different structural errors. These advantages provide a robust framework for long-read assembly evaluation. In this Protocol, we showcase four procedures to demonstrate the different applications of Inspector for long-read assembly evaluation. Inspector software and additional guides can be found at https://github.com/ChongLab/Inspector_protocol .
期刊介绍:
Nature Protocols focuses on publishing protocols used to address significant biological and biomedical science research questions, including methods grounded in physics and chemistry with practical applications to biological problems. The journal caters to a primary audience of research scientists and, as such, exclusively publishes protocols with research applications. Protocols primarily aimed at influencing patient management and treatment decisions are not featured.
The specific techniques covered encompass a wide range, including but not limited to: Biochemistry, Cell biology, Cell culture, Chemical modification, Computational biology, Developmental biology, Epigenomics, Genetic analysis, Genetic modification, Genomics, Imaging, Immunology, Isolation, purification, and separation, Lipidomics, Metabolomics, Microbiology, Model organisms, Nanotechnology, Neuroscience, Nucleic-acid-based molecular biology, Pharmacology, Plant biology, Protein analysis, Proteomics, Spectroscopy, Structural biology, Synthetic chemistry, Tissue culture, Toxicology, and Virology.