An expectation-maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis.
Yu Zhang, Cong Zhang, Wenwen Huo, Xinlei Wang, Michael Zhang, Kelli Palmer, Min Chen
{"title":"An expectation-maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in <i>Enterococcus faecalis</i>.","authors":"Yu Zhang, Cong Zhang, Wenwen Huo, Xinlei Wang, Michael Zhang, Kelli Palmer, Min Chen","doi":"10.1007/s42995-022-00144-z","DOIUrl":null,"url":null,"abstract":"<p><p>The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistance. Previously, deletion mutations within bacterial immune systems, ranging from dozens to thousands of base pairs (bps) in length, have been associated with the spread of antibiotic resistance. Most current methods for evaluating genomic structural variations (SVs) have concentrated on detecting them, rather than estimating the proportions of populations that carry distinct SVs. A better understanding of the distribution of mutations and subpopulations dynamics in bacterial populations is needed to appreciate antibiotic resistance evolution and movement of resistance genes through populations. Here, we propose a statistical model to estimate the proportions of genomic deletions in a mixed population based on Expectation-Maximization (EM) algorithms and next-generation sequencing (NGS) data. The method integrates both insert size and split-read mapping information to iteratively update estimated distributions. The proposed method was evaluated with three simulations that demonstrated the production of accurate estimations. The proposed method was then applied to investigate the horizontal transfers of antibiotic resistance genes in concert with changes in the CRISPR-Cas system of <i>E. faecalis</i>.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s42995-022-00144-z.</p>","PeriodicalId":53218,"journal":{"name":"Marine Life Science & Technology","volume":"5 1","pages":"28-43"},"PeriodicalIF":5.8000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9888353/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine Life Science & Technology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s42995-022-00144-z","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/31 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MARINE & FRESHWATER BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistance. Previously, deletion mutations within bacterial immune systems, ranging from dozens to thousands of base pairs (bps) in length, have been associated with the spread of antibiotic resistance. Most current methods for evaluating genomic structural variations (SVs) have concentrated on detecting them, rather than estimating the proportions of populations that carry distinct SVs. A better understanding of the distribution of mutations and subpopulations dynamics in bacterial populations is needed to appreciate antibiotic resistance evolution and movement of resistance genes through populations. Here, we propose a statistical model to estimate the proportions of genomic deletions in a mixed population based on Expectation-Maximization (EM) algorithms and next-generation sequencing (NGS) data. The method integrates both insert size and split-read mapping information to iteratively update estimated distributions. The proposed method was evaluated with three simulations that demonstrated the production of accurate estimations. The proposed method was then applied to investigate the horizontal transfers of antibiotic resistance genes in concert with changes in the CRISPR-Cas system of E. faecalis.
Supplementary information: The online version contains supplementary material available at 10.1007/s42995-022-00144-z.
期刊介绍:
Marine Life Science & Technology (MLST), established in 2019, is dedicated to publishing original research papers that unveil new discoveries and theories spanning a wide spectrum of life sciences and technologies. This includes fundamental biology, fisheries science and technology, medicinal bioresources, food science, biotechnology, ecology, and environmental biology, with a particular focus on marine habitats.
The journal is committed to nurturing synergistic interactions among these diverse disciplines, striving to advance multidisciplinary approaches within the scientific field. It caters to a readership comprising biological scientists, aquaculture researchers, marine technologists, biological oceanographers, and ecologists.