在大型水池中发现标记

Bioinformatics Advances Pub Date : 2024-07-27 DOI:10.1093/bioadv/vbae113

Beatriz Vieira Mourato, Ivan Tsers, Svenja Denker, Fabian Klötzl, Bernhard Haubold

{"title":"在大型水池中发现标记","authors":"Beatriz Vieira Mourato, Ivan Tsers, Svenja Denker, Fabian Klötzl, Bernhard Haubold","doi":"10.1093/bioadv/vbae113","DOIUrl":null,"url":null,"abstract":"\n \n \n Markers for polymerase chain reaction are routinely constructed by taking regions common to the genomes of a target organism and subtracting the regions found in the targets’ closest relatives, their neighbors. This approach is implemented in the published package Fur, which originally required memory proportional to the number of nucleotides in the neighborhood. This does not scale well.\n \n \n \n Here we describe a new version of Fur that only requires memory proportional to the longest neighbor. In spite of its greater memory efficiency, the new Fur remains fast and is accurate. We demonstrate this through application to simulated sequences and comparison to an efficient alternative. Then we use the new Fur to extract markers from 120 reference bacteria. To make this feasible, we also introduce software for automatically finding target and neighbor genomes and for assessing markers. We pick the best primers from the ten most sequenced reference bacteria and show their excellent in silico sensitivity and specificity.\n \n \n \n Fur is available from github.com/evolbioinf/fur, in the Docker image hub.docker.com/r/beatrizvm/mapro, and in the Code Ocean capsule 10.24433/CO.7955947.v1.\n","PeriodicalId":505477,"journal":{"name":"Bioinformatics Advances","volume":"91 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Marker Discovery in the Large\",\"authors\":\"Beatriz Vieira Mourato, Ivan Tsers, Svenja Denker, Fabian Klötzl, Bernhard Haubold\",\"doi\":\"10.1093/bioadv/vbae113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n \\n \\n Markers for polymerase chain reaction are routinely constructed by taking regions common to the genomes of a target organism and subtracting the regions found in the targets’ closest relatives, their neighbors. This approach is implemented in the published package Fur, which originally required memory proportional to the number of nucleotides in the neighborhood. This does not scale well.\\n \\n \\n \\n Here we describe a new version of Fur that only requires memory proportional to the longest neighbor. In spite of its greater memory efficiency, the new Fur remains fast and is accurate. We demonstrate this through application to simulated sequences and comparison to an efficient alternative. Then we use the new Fur to extract markers from 120 reference bacteria. To make this feasible, we also introduce software for automatically finding target and neighbor genomes and for assessing markers. We pick the best primers from the ten most sequenced reference bacteria and show their excellent in silico sensitivity and specificity.\\n \\n \\n \\n Fur is available from github.com/evolbioinf/fur, in the Docker image hub.docker.com/r/beatrizvm/mapro, and in the Code Ocean capsule 10.24433/CO.7955947.v1.\\n\",\"PeriodicalId\":505477,\"journal\":{\"name\":\"Bioinformatics Advances\",\"volume\":\"91 12\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics Advances\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioadv/vbae113\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbae113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

聚合酶链式反应的标记通常是通过提取目标生物基因组中的共同区域，然后减去目标生物近亲（即邻近生物）中的区域来构建的。这种方法在已发布的 Fur 软件包中实现，该软件包最初需要的内存与邻域中核苷酸的数量成正比。这并不能很好地扩展。在这里，我们将介绍一个新版本的 Fur，它只需要与最长邻域成比例的内存。尽管新版 Fur 的内存效率更高，但速度仍然很快，而且准确度也很高。我们通过对模拟序列的应用以及与高效替代方案的比较来证明这一点。然后，我们使用新毛皮从 120 个参考细菌中提取标记。为了使这项工作切实可行，我们还引入了自动查找目标基因组和邻近基因组以及评估标记的软件。我们从十种测序最多的参考细菌中挑选出了最佳引物，并展示了它们出色的硅灵敏度和特异性。 Fur可从github.com/evolbioinf/fur、Docker镜像 hub.docker.com/r/beatrizvm/mapro和Code Ocean胶囊10.24433/CO.7955947.v1中获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Marker Discovery in the Large

Markers for polymerase chain reaction are routinely constructed by taking regions common to the genomes of a target organism and subtracting the regions found in the targets’ closest relatives, their neighbors. This approach is implemented in the published package Fur, which originally required memory proportional to the number of nucleotides in the neighborhood. This does not scale well. Here we describe a new version of Fur that only requires memory proportional to the longest neighbor. In spite of its greater memory efficiency, the new Fur remains fast and is accurate. We demonstrate this through application to simulated sequences and comparison to an efficient alternative. Then we use the new Fur to extract markers from 120 reference bacteria. To make this feasible, we also introduce software for automatically finding target and neighbor genomes and for assessing markers. We pick the best primers from the ten most sequenced reference bacteria and show their excellent in silico sensitivity and specificity. Fur is available from github.com/evolbioinf/fur, in the Docker image hub.docker.com/r/beatrizvm/mapro, and in the Code Ocean capsule 10.24433/CO.7955947.v1.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Bioinformatics Advances

自引率

0.00%

发文量