RhizoBindingSites v2.0 是一个从基因组位点推导出的可能参与转录调控的 DNA 元基的生物信息学数据库。

IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS
Bioinformatics and Biology Insights Pub Date : 2024-09-06 eCollection Date: 2024-01-01 DOI:10.1177/11779322241272395
Hermenegildo Taboada-Castro, Alfredo José Hernández-Álvarez, Jaime A Castro-Mondragón, Sergio Encarnación-Guevara
{"title":"RhizoBindingSites v2.0 是一个从基因组位点推导出的可能参与转录调控的 DNA 元基的生物信息学数据库。","authors":"Hermenegildo Taboada-Castro, Alfredo José Hernández-Álvarez, Jaime A Castro-Mondragón, Sergio Encarnación-Guevara","doi":"10.1177/11779322241272395","DOIUrl":null,"url":null,"abstract":"<p><p>RhizoBindingSites is a <i>de novo</i> depurified database of conserved DNA motifs potentially involved in the transcriptional regulation of the <i>Rhizobium</i>, <i>Sinorhizobium</i>, <i>Bradyrhizobium</i>, <i>Azorhizobium</i>, and <i>Mesorhizobium</i> genera covering 9 representative symbiotic species, deduced from the upstream regulatory sequences of orthologous genes (O-matrices) from the Rhizobiales taxon. The sites collected with O-matrices per gene per genome from RhizoBindingSites were used to deduce matrices using the dyad-Regulatory Sequence Analysis Tool (RSAT) method, giving rise to novel S-matrices for the construction of the RizoBindingSites v2.0 database. A comparison of the S-matrix logos showed a greater frequency and/or re-definition of specific-position nucleotides found in the O-matrices. Moreover, S-matrices were better at detecting genes in the genome, and there was a more significant number of transcription factors (TFs) in the vicinity than O-matrices, corresponding to a more significant genomic coverage for S-matrices. O-matrices of 3187 TFs and S-matrices of 2754 TFs from 9 species were deposited in RhizoBindingSites and RhizoBindingSites v2.0, respectively. The homology between the matrices of TFs from a genome showed inter-regulation between the clustered TFs. In addition, matrices of AraC, ArsR, GntR, and LysR ortholog TFs showed different motifs, suggesting distinct regulation. Benchmarking showed 72%, 68%, and 81% of common genes per regulon for O-matrices and approximately 14% less common genes with S-matrices of <i>Rhizobium etli</i> CFN42, <i>Rhizobium leguminosarum</i> bv. <i>viciae</i> 3841, and <i>Sinorhizobium meliloti</i> 1021. These data were deposited in RhizoBindingSites and the RhizoBindingSites v2.0 database (http://rhizobindingsites.ccg.unam.mx/).</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11380129/pdf/","citationCount":"0","resultStr":"{\"title\":\"RhizoBindingSites v2.0 Is a Bioinformatic Database of DNA Motifs Potentially Involved in Transcriptional Regulation Deduced From Their Genomic Sites.\",\"authors\":\"Hermenegildo Taboada-Castro, Alfredo José Hernández-Álvarez, Jaime A Castro-Mondragón, Sergio Encarnación-Guevara\",\"doi\":\"10.1177/11779322241272395\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>RhizoBindingSites is a <i>de novo</i> depurified database of conserved DNA motifs potentially involved in the transcriptional regulation of the <i>Rhizobium</i>, <i>Sinorhizobium</i>, <i>Bradyrhizobium</i>, <i>Azorhizobium</i>, and <i>Mesorhizobium</i> genera covering 9 representative symbiotic species, deduced from the upstream regulatory sequences of orthologous genes (O-matrices) from the Rhizobiales taxon. The sites collected with O-matrices per gene per genome from RhizoBindingSites were used to deduce matrices using the dyad-Regulatory Sequence Analysis Tool (RSAT) method, giving rise to novel S-matrices for the construction of the RizoBindingSites v2.0 database. A comparison of the S-matrix logos showed a greater frequency and/or re-definition of specific-position nucleotides found in the O-matrices. Moreover, S-matrices were better at detecting genes in the genome, and there was a more significant number of transcription factors (TFs) in the vicinity than O-matrices, corresponding to a more significant genomic coverage for S-matrices. O-matrices of 3187 TFs and S-matrices of 2754 TFs from 9 species were deposited in RhizoBindingSites and RhizoBindingSites v2.0, respectively. The homology between the matrices of TFs from a genome showed inter-regulation between the clustered TFs. In addition, matrices of AraC, ArsR, GntR, and LysR ortholog TFs showed different motifs, suggesting distinct regulation. Benchmarking showed 72%, 68%, and 81% of common genes per regulon for O-matrices and approximately 14% less common genes with S-matrices of <i>Rhizobium etli</i> CFN42, <i>Rhizobium leguminosarum</i> bv. <i>viciae</i> 3841, and <i>Sinorhizobium meliloti</i> 1021. These data were deposited in RhizoBindingSites and the RhizoBindingSites v2.0 database (http://rhizobindingsites.ccg.unam.mx/).</p>\",\"PeriodicalId\":9065,\"journal\":{\"name\":\"Bioinformatics and Biology Insights\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11380129/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics and Biology Insights\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/11779322241272395\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics and Biology Insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11779322241272395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

根瘤结合位点(RhizoBindingSites)是一个从根瘤菌类(Rhizobiales)同源基因上游调控序列(O-matrices)中推导出的保守DNA基序数据库,其中包含根瘤菌属(Rhizobium)、 Sinorhizobium、 Bradyrhizobium、 Azorhizobium 和 Mesorhizobium 属(涵盖 9 个具有代表性的共生物种)中可能参与转录调控的保守DNA基序。从根瘤菌结合位点(RhizoBindingSites)中收集到的每个基因组每个基因的 O-位点,利用二元调控序列分析工具(RSAT)方法推导出了位点矩阵,从而产生了新的 S-位点矩阵,用于构建 RizoBindingSites v2.0 数据库。对 S-矩阵标识的比较显示,O-矩阵中特定位置核苷酸的出现频率更高,而且/或者对其进行了重新定义。此外,S-矩阵比 O-矩阵更善于检测基因组中的基因,附近的转录因子(TFs)数量也更多,这与 S-矩阵的基因组覆盖范围更广相对应。来自 9 个物种的 3187 个 TFs 的 O 矩阵和 2754 个 TFs 的 S 矩阵分别存入 RhizoBindingSites 和 RhizoBindingSites v2.0。来自一个基因组的 TFs 矩阵之间的同源性显示了聚类 TFs 之间的相互调控。此外,AraC、ArsR、GntR 和 LysR 直向同源基因 TF 的矩阵显示出不同的主题,表明它们之间存在不同的调控。基准分析显示,O 矩阵的每个调控子有 72%、68% 和 81% 的共同基因,而根瘤菌 etli CFN42、根瘤菌 leguminosarum bv. viciae 3841 和瓜萎镰刀菌 Sinorhizobium meliloti 1021 的 S 矩阵的共同基因则少了约 14%。这些数据已存入 RhizoBindingSites 和 RhizoBindingSites v2.0 数据库 (http://rhizobindingsites.ccg.unam.mx/)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
RhizoBindingSites v2.0 Is a Bioinformatic Database of DNA Motifs Potentially Involved in Transcriptional Regulation Deduced From Their Genomic Sites.

RhizoBindingSites is a de novo depurified database of conserved DNA motifs potentially involved in the transcriptional regulation of the Rhizobium, Sinorhizobium, Bradyrhizobium, Azorhizobium, and Mesorhizobium genera covering 9 representative symbiotic species, deduced from the upstream regulatory sequences of orthologous genes (O-matrices) from the Rhizobiales taxon. The sites collected with O-matrices per gene per genome from RhizoBindingSites were used to deduce matrices using the dyad-Regulatory Sequence Analysis Tool (RSAT) method, giving rise to novel S-matrices for the construction of the RizoBindingSites v2.0 database. A comparison of the S-matrix logos showed a greater frequency and/or re-definition of specific-position nucleotides found in the O-matrices. Moreover, S-matrices were better at detecting genes in the genome, and there was a more significant number of transcription factors (TFs) in the vicinity than O-matrices, corresponding to a more significant genomic coverage for S-matrices. O-matrices of 3187 TFs and S-matrices of 2754 TFs from 9 species were deposited in RhizoBindingSites and RhizoBindingSites v2.0, respectively. The homology between the matrices of TFs from a genome showed inter-regulation between the clustered TFs. In addition, matrices of AraC, ArsR, GntR, and LysR ortholog TFs showed different motifs, suggesting distinct regulation. Benchmarking showed 72%, 68%, and 81% of common genes per regulon for O-matrices and approximately 14% less common genes with S-matrices of Rhizobium etli CFN42, Rhizobium leguminosarum bv. viciae 3841, and Sinorhizobium meliloti 1021. These data were deposited in RhizoBindingSites and the RhizoBindingSites v2.0 database (http://rhizobindingsites.ccg.unam.mx/).

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Bioinformatics and Biology Insights
Bioinformatics and Biology Insights BIOCHEMICAL RESEARCH METHODS-
CiteScore
6.80
自引率
1.70%
发文量
36
审稿时长
8 weeks
期刊介绍: Bioinformatics and Biology Insights is an open access, peer-reviewed journal that considers articles on bioinformatics methods and their applications which must pertain to biological insights. All papers should be easily amenable to biologists and as such help bridge the gap between theories and applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信