Sergey Abakumov, Elizabete Ruppeka-Rupeika, Xiong Chen, Arno Bouwens, Volker Leen, Peter Dedecker* and Johan Hofkens,
{"title":"DeepMAP: Deep CNN Classifiers Applied to Optical Mapping for Fast and Precise Species-Level Metagenomic Analysis","authors":"Sergey Abakumov, Elizabete Ruppeka-Rupeika, Xiong Chen, Arno Bouwens, Volker Leen, Peter Dedecker* and Johan Hofkens, ","doi":"10.1021/acsomega.4c0948510.1021/acsomega.4c09485","DOIUrl":null,"url":null,"abstract":"<p >DNA optical mapping is a powerful technique commonly used for structural variant calling and genome assembly verification. Despite being inherently high-throughput, the method has not yet been applied to highly complex settings such as species identification in microbiome analysis due to the lack of alignment algorithms that can both assign large numbers of reads in minutes and handle large database size. In this work, we present a novel genomic classification pipeline based on deep convolutional neural networks for optical mapping data (DeepMAP), which can perform fast and accurate assignment of individual optical maps to their respective genomes. We furthermore achieve a superior performance of DeepMAP in the presence of evolutionary divergent sequences, making it robust to the presence of unknown strains within metagenomic samples. We evaluate DeepMAP on genomic DNA extracted from bacterial mixtures, reaching species-level resolution with true positive rates of around 75% and a false positive rate of less than 1%, with measured classification speeds significantly outpacing those of previously developed approaches for high-density optical mapping data alignment.</p>","PeriodicalId":22,"journal":{"name":"ACS Omega","volume":"10 9","pages":"9224–9232 9224–9232"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsomega.4c09485","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Omega","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsomega.4c09485","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
DNA optical mapping is a powerful technique commonly used for structural variant calling and genome assembly verification. Despite being inherently high-throughput, the method has not yet been applied to highly complex settings such as species identification in microbiome analysis due to the lack of alignment algorithms that can both assign large numbers of reads in minutes and handle large database size. In this work, we present a novel genomic classification pipeline based on deep convolutional neural networks for optical mapping data (DeepMAP), which can perform fast and accurate assignment of individual optical maps to their respective genomes. We furthermore achieve a superior performance of DeepMAP in the presence of evolutionary divergent sequences, making it robust to the presence of unknown strains within metagenomic samples. We evaluate DeepMAP on genomic DNA extracted from bacterial mixtures, reaching species-level resolution with true positive rates of around 75% and a false positive rate of less than 1%, with measured classification speeds significantly outpacing those of previously developed approaches for high-density optical mapping data alignment.
ACS OmegaChemical Engineering-General Chemical Engineering
CiteScore
6.60
自引率
4.90%
发文量
3945
审稿时长
2.4 months
期刊介绍:
ACS Omega is an open-access global publication for scientific articles that describe new findings in chemistry and interfacing areas of science, without any perceived evaluation of immediate impact.