Genomic re-sequencing reveals mutational divergence across genetically engineered strains of model archaea

bioRxiv Pub Date : 2024-08-09 DOI:10.1101/2024.08.08.607208
Andrew L. Soborowski, Rylee K Hackley, Sungmin Hwang, Guangyin Zhou, Keely A. Dulmage, Peter Schönheit, Charles Daniels, Alexandre W. Bisson-Filho, Anita Marchfelder, J. Maupin-Furlow, Thorsten Allers, Amy K. Schmid
{"title":"Genomic re-sequencing reveals mutational divergence across genetically engineered strains of model archaea","authors":"Andrew L. Soborowski, Rylee K Hackley, Sungmin Hwang, Guangyin Zhou, Keely A. Dulmage, Peter Schönheit, Charles Daniels, Alexandre W. Bisson-Filho, Anita Marchfelder, J. Maupin-Furlow, Thorsten Allers, Amy K. Schmid","doi":"10.1101/2024.08.08.607208","DOIUrl":null,"url":null,"abstract":"Because archaea are the evolutionary ancestors of eukaryotes, archaeal molecular biology has been a topic of intense recent research. The hypersaline adapted archaeal species Halobacterium salinarum and Haloferax volcanii serve as important model organisms because facile tools enable genetic manipulation. As a result, the number of strains in circulation among the haloarchaeal research community has increased over the last few decades. However, the degree of genetic divergence and effects on genetic integrity during inter-lab transfers remain unclear. To address this question, we performed whole genome re-sequencing on a cross-section of wild-type, parental, and knockout strains in both model species. Integrating these data with existing repositories of re-sequencing data, we identify mutations that have arisen in a collection of 60 strains, sampled from 2 species across 8 different labs. Independent of sequencing, we construct strain lineages, identifying branch points and significant genetic effects in strain history. Combining this with our sequencing data, we identify small clusters of mutations that definitively separate lab strains. Additionally, an analysis of gene knockout strains suggests that roughly 1 in 3 strains currently in use harbors second-site mutations of potential phenotypic impact. Overall, we find that divergence among lab strains is thus far minimal, though as the archaeal research community continues to grow, careful strain provenance and genomic re-sequencing are required to keep inter-lab divergence to a minimum, prevent the compounding of mutations into fully independent lineages, and maintain the current high degree of reproducible research between lab groups in the haloarchaeal research community. Data Summary Novel sequencing data for this project was submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) and can be found under bioproject accession PRJNA1120443. SRA accessions for previously published sequencing data are available in supplementary table 1. R code for performing analysis and generating figures is available at https://github.com/andrew-soborowski/halophile_genome_resequencing. Impact Statement Archaea are important due to their shared evolutionary history with eukaryotes. As the archaeal research community grows, keeping track of the genetic integrity of archaeal strains of interest is necessary. In particular, routine genetic manipulations and the common practice of sharing strains between labs allow mutations to arise in lab stocks. If these mutations affect cellular processes, they may jeopardize the reproducibility of work between research groups and confound the results of future studies. In this work, we examine DNA sequences from 60 strains across two species of archaea. We identify shared and unique mutations occurring between and within strains. Independently, we trace the lineage of each strain, identifying which genetic manipulations lead to observed off-target mutations. While overall divergence across labs is minimal so far, our work highlights the need for labs to continue proper strain husbandry.","PeriodicalId":505198,"journal":{"name":"bioRxiv","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.08.607208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Because archaea are the evolutionary ancestors of eukaryotes, archaeal molecular biology has been a topic of intense recent research. The hypersaline adapted archaeal species Halobacterium salinarum and Haloferax volcanii serve as important model organisms because facile tools enable genetic manipulation. As a result, the number of strains in circulation among the haloarchaeal research community has increased over the last few decades. However, the degree of genetic divergence and effects on genetic integrity during inter-lab transfers remain unclear. To address this question, we performed whole genome re-sequencing on a cross-section of wild-type, parental, and knockout strains in both model species. Integrating these data with existing repositories of re-sequencing data, we identify mutations that have arisen in a collection of 60 strains, sampled from 2 species across 8 different labs. Independent of sequencing, we construct strain lineages, identifying branch points and significant genetic effects in strain history. Combining this with our sequencing data, we identify small clusters of mutations that definitively separate lab strains. Additionally, an analysis of gene knockout strains suggests that roughly 1 in 3 strains currently in use harbors second-site mutations of potential phenotypic impact. Overall, we find that divergence among lab strains is thus far minimal, though as the archaeal research community continues to grow, careful strain provenance and genomic re-sequencing are required to keep inter-lab divergence to a minimum, prevent the compounding of mutations into fully independent lineages, and maintain the current high degree of reproducible research between lab groups in the haloarchaeal research community. Data Summary Novel sequencing data for this project was submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) and can be found under bioproject accession PRJNA1120443. SRA accessions for previously published sequencing data are available in supplementary table 1. R code for performing analysis and generating figures is available at https://github.com/andrew-soborowski/halophile_genome_resequencing. Impact Statement Archaea are important due to their shared evolutionary history with eukaryotes. As the archaeal research community grows, keeping track of the genetic integrity of archaeal strains of interest is necessary. In particular, routine genetic manipulations and the common practice of sharing strains between labs allow mutations to arise in lab stocks. If these mutations affect cellular processes, they may jeopardize the reproducibility of work between research groups and confound the results of future studies. In this work, we examine DNA sequences from 60 strains across two species of archaea. We identify shared and unique mutations occurring between and within strains. Independently, we trace the lineage of each strain, identifying which genetic manipulations lead to observed off-target mutations. While overall divergence across labs is minimal so far, our work highlights the need for labs to continue proper strain husbandry.
基因组重测序揭示模式古菌基因工程菌株的突变差异
由于古细菌是真核生物的进化祖先,因此古细菌分子生物学一直是近期研究的热点。适应高盐的古细菌物种盐卤杆菌(Haloobacterium salinarum)和火山卤杆菌(Haloferax volcanii)是重要的模式生物,因为它们的基因操作工具非常简单。因此,过去几十年来,在卤代古细菌研究界流通的菌株数量不断增加。然而,实验室间转移过程中的遗传差异程度和对遗传完整性的影响仍不清楚。为了解决这个问题,我们对两个模式物种的野生型、亲本和基因敲除菌株进行了全基因组重测序。通过将这些数据与现有的重测序数据资源库进行整合,我们确定了从 8 个不同实验室的 2 个物种中抽取的 60 个菌株中出现的突变。独立于测序,我们构建了菌株系谱,确定了菌株历史中的分支点和重要遗传效应。结合测序数据,我们确定了能明确区分实验室菌株的突变小簇。此外,对基因敲除菌株的分析表明,目前使用的菌株中,大约每 3 个菌株中就有 1 个存在对表型有潜在影响的第二位点突变。总之,我们发现迄今为止实验室菌株之间的差异极小,不过随着古细菌研究群体的不断扩大,需要对菌株出处和基因组重新测序进行仔细研究,以将实验室之间的差异保持在最低水平,防止突变复合成完全独立的品系,并保持目前光古细菌研究群体中实验室组之间研究的高度可重复性。数据摘要 本项目的新测序数据已提交给美国国家生物技术信息中心(NCBI)序列读取档案(SRA),可在生物项目登录号 PRJNA1120443 下找到。以前发表的测序数据的 SRA 编号见补充表 1。用于进行分析和生成图表的 R 代码可在 https://github.com/andrew-soborowski/halophile_genome_resequencing 上获取。影响声明 古细菌与真核生物有着共同的进化历史,因此非常重要。随着古细菌研究群体的不断壮大,跟踪感兴趣的古细菌菌株的遗传完整性十分必要。特别是,常规遗传操作和实验室之间共享菌株的常见做法会导致实验室菌株发生突变。如果这些突变影响了细胞过程,就可能危及研究小组之间工作的可重复性,并干扰未来研究的结果。在这项工作中,我们研究了来自两种古细菌 60 个菌株的 DNA 序列。我们确定了菌株之间和菌株内部发生的共同突变和独特突变。同时,我们追踪了每个菌株的世系,确定了哪些基因操作导致了观察到的脱靶突变。虽然到目前为止,各实验室之间的总体差异很小,但我们的工作凸显了实验室继续妥善管理菌株的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信