The presence of multiple variants of IncF plasmid alleles in a single genome sequence can hinder accurate replicon sequence typing using in silico pMLST tools.

IF 5 2区 生物学 Q1 MICROBIOLOGY
mSystems Pub Date : 2025-04-08 DOI:10.1128/msystems.01010-24
Michaela Ruzickova, Jana Palkovicova, Ivo Papousek, Max L Cummins, Steven P Djordjevic, Monika Dolejska
{"title":"The presence of multiple variants of IncF plasmid alleles in a single genome sequence can hinder accurate replicon sequence typing using <i>in silico</i> pMLST tools.","authors":"Michaela Ruzickova, Jana Palkovicova, Ivo Papousek, Max L Cummins, Steven P Djordjevic, Monika Dolejska","doi":"10.1128/msystems.01010-24","DOIUrl":null,"url":null,"abstract":"<p><p>IncF plasmids are mobile genetic elements found in bacteria from the <i>Enterobacteriaceae</i> family and often carry critical antibiotic and virulence gene cargo. The classification of IncF plasmids using the plasmid Multi-Locus Sequence Typing (pMLST) tool from the Center for Genomic Epidemiology (CGE; https://www.genomicepidemiology.org/) compares the sequences of IncF alleles against a database to create a plasmid sequence type (ST). Accurate identification of plasmid STs is useful as it enables an assessment of IncF plasmid lineages associated with pandemic enterobacterial STs. Our initial observations showed discrepancies in IncF allele variants reported by pMLST in a collection of 898 <i>Escherichia coli</i> ST131 genomes. To evaluate the limitations of the pMLST tool, we interrogated an in-house and public repository of 70,324 <i>E. coli</i> genomes of various STs and other <i>Enterobacteriaceae</i> genomes (<i>n</i> = 1247). All short-read assemblies and representatives selected for long-read sequencing were used to assess pMLST allele variants and to compare the output of pMLST tool versions. When multiple allele variants occurred in a single bacterial genome, the Python and web versions of the tool randomly selected one allele to report, leading to limited and inaccurate ST identification. Discrepancies were detected in 5,804 of 72,469 genomes (8.01%). Long-read sequencing of 27 genomes confirmed multiple IncF allele variants on one plasmid or two separate IncF plasmids in a single bacterial cell. The pMLST tool was unable to accurately distinguish allele variants and their location on replicons using short-read genome assemblies, or long-read genome assemblies if the same allele variant was present more than once.</p><p><strong>Importance: </strong>Plasmid sequence type is crucial for describing IncF plasmids due to their capacity to carry important antibiotic and virulence gene cargo and consequently due to their association with disease-causing enterobacterial lineages exhibiting resistance to clinically relevant antibiotics in humans and food-producing animals. As a result, precise reporting of IncF allele variants in IncF plasmids is necessary. Comparison of the FAB formulae generated by the pMLST tool with annotated long-read genome assemblies identified inconsistencies, including examples where multiple IncF allele variants were present on the same plasmid but missing in the FAB formula, or in cases where two IncF plasmids were detected in one bacterial cell, and the pMLST output provided information only about one plasmid. Such inconsistencies may cloud interpretation of IncF plasmid replicon type in specific bacterial lineages or inaccurate assumptions of host strain clonality.</p>","PeriodicalId":18819,"journal":{"name":"mSystems","volume":" ","pages":"e0101024"},"PeriodicalIF":5.0000,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSystems","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msystems.01010-24","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

IncF plasmids are mobile genetic elements found in bacteria from the Enterobacteriaceae family and often carry critical antibiotic and virulence gene cargo. The classification of IncF plasmids using the plasmid Multi-Locus Sequence Typing (pMLST) tool from the Center for Genomic Epidemiology (CGE; https://www.genomicepidemiology.org/) compares the sequences of IncF alleles against a database to create a plasmid sequence type (ST). Accurate identification of plasmid STs is useful as it enables an assessment of IncF plasmid lineages associated with pandemic enterobacterial STs. Our initial observations showed discrepancies in IncF allele variants reported by pMLST in a collection of 898 Escherichia coli ST131 genomes. To evaluate the limitations of the pMLST tool, we interrogated an in-house and public repository of 70,324 E. coli genomes of various STs and other Enterobacteriaceae genomes (n = 1247). All short-read assemblies and representatives selected for long-read sequencing were used to assess pMLST allele variants and to compare the output of pMLST tool versions. When multiple allele variants occurred in a single bacterial genome, the Python and web versions of the tool randomly selected one allele to report, leading to limited and inaccurate ST identification. Discrepancies were detected in 5,804 of 72,469 genomes (8.01%). Long-read sequencing of 27 genomes confirmed multiple IncF allele variants on one plasmid or two separate IncF plasmids in a single bacterial cell. The pMLST tool was unable to accurately distinguish allele variants and their location on replicons using short-read genome assemblies, or long-read genome assemblies if the same allele variant was present more than once.

Importance: Plasmid sequence type is crucial for describing IncF plasmids due to their capacity to carry important antibiotic and virulence gene cargo and consequently due to their association with disease-causing enterobacterial lineages exhibiting resistance to clinically relevant antibiotics in humans and food-producing animals. As a result, precise reporting of IncF allele variants in IncF plasmids is necessary. Comparison of the FAB formulae generated by the pMLST tool with annotated long-read genome assemblies identified inconsistencies, including examples where multiple IncF allele variants were present on the same plasmid but missing in the FAB formula, or in cases where two IncF plasmids were detected in one bacterial cell, and the pMLST output provided information only about one plasmid. Such inconsistencies may cloud interpretation of IncF plasmid replicon type in specific bacterial lineages or inaccurate assumptions of host strain clonality.

在单个基因组序列中存在IncF质粒等位基因的多个变体可能会妨碍使用硅pMLST工具进行精确的复制子序列分型。
IncF 质粒是肠杆菌科细菌中发现的移动遗传元件,通常携带重要的抗生素和毒力基因。基因组流行病学中心 (CGE; https://www.genomicepidemiology.org/) 使用质粒多焦点序列分型 (pMLST) 工具对 IncF 质粒进行分类,将 IncF 等位基因的序列与数据库进行比较,以创建质粒序列类型 (ST)。准确鉴定质粒 ST 非常有用,因为这样就能评估与大流行性肠杆菌 ST 相关的 IncF 质粒系。我们的初步观察显示,在 898 个大肠杆菌 ST131 基因组中,pMLST 报告的 IncF 等位基因变异存在差异。为了评估 pMLST 工具的局限性,我们查询了一个包含 70,324 个不同 ST 的大肠杆菌基因组和其他肠杆菌科基因组(n = 1247)的内部和公共资源库。所有短线程组装和长线程测序选出的代表用于评估 pMLST 等位基因变异,并比较 pMLST 工具版本的输出结果。当单个细菌基因组中出现多个等位基因变异时,Python 和网络版工具会随机选择一个等位基因进行报告,从而导致ST鉴定的局限性和不准确性。在 72,469 个基因组中有 5,804 个(8.01%)发现了差异。对 27 个基因组进行的长读测序证实,在一个质粒上存在多个 IncF 等位基因变体,或在一个细菌细胞中存在两个独立的 IncF 质粒。如果同一等位基因变体出现不止一次,pMLST 工具无法准确区分等位基因变体及其在复制子上的位置:质粒序列类型对于描述 IncF 质粒至关重要,因为它们能够携带重要的抗生素和毒力基因,并因此与人类和食用动物中对临床相关抗生素表现出抗药性的致病肠杆菌系相关联。因此,有必要准确报告 IncF 质粒中的 IncF 等位基因变异。将 pMLST 工具生成的 FAB 公式与注释的长读程基因组组装进行比较,发现了一些不一致的地方,包括同一质粒上存在多个 IncF 等位基因变体,但 FAB 公式中却缺失;或者在一个细菌细胞中检测到两个 IncF 质粒,而 pMLST 输出只提供了其中一个质粒的信息。这种不一致性可能会影响对特定细菌系中 IncF 质粒复制子类型的解释,或影响对宿主菌株克隆性的不准确假设。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
mSystems
mSystems Biochemistry, Genetics and Molecular Biology-Biochemistry
CiteScore
10.50
自引率
3.10%
发文量
308
审稿时长
13 weeks
期刊介绍: mSystems™ will publish preeminent work that stems from applying technologies for high-throughput analyses to achieve insights into the metabolic and regulatory systems at the scale of both the single cell and microbial communities. The scope of mSystems™ encompasses all important biological and biochemical findings drawn from analyses of large data sets, as well as new computational approaches for deriving these insights. mSystems™ will welcome submissions from researchers who focus on the microbiome, genomics, metagenomics, transcriptomics, metabolomics, proteomics, glycomics, bioinformatics, and computational microbiology. mSystems™ will provide streamlined decisions, while carrying on ASM''s tradition of rigorous peer review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信