Huaming Zhong, Xuan Shao, Jing Cao, Jie Huang, Jing Wang, Nuo Yang, Baodong Yuan
{"title":"比较爬行动物基因组中微卫星的分布模式","authors":"Huaming Zhong, Xuan Shao, Jing Cao, Jie Huang, Jing Wang, Nuo Yang, Baodong Yuan","doi":"10.1002/ece3.70458","DOIUrl":null,"url":null,"abstract":"<p>Microsatellites or simple sequence repeats (SSRs) are prevalent across various organisms' genomes. However, their distribution patterns and evolutionary dynamics in reptile genomes are rarely studied systematically. We herein conducted a comprehensive analysis of SSRs in the genomes of 36 reptile species. Our findings revealed that the total number of SSRs ranged from 1,840,965 to 7,664,452, accounting for 2.16%–8.19% of the genomes analyzed. The relative density ranged from 21,567.82 to 81,889.41 bp per megabase (Mbp). The abundance of different SSR categories followed the pattern of imperfect SSR (I-SSR) > perfect SSR (P-SSR) > compound SSR (C-SSR). A significant positive correlation was observed between the number of SSRs and genome size (<i>p</i> = 0.0034), whereas SSR frequency (<i>p</i> = 0.013) or density (<i>p</i> = 0.0099) showed a negative correlation with genome size. Furthermore, no correlation was found between SSR length and genome size. Mononucleotide repeats were the most common P-SSRs in crocodilians and turtles, whereas mononucleotides, trinucleotides, or tetranucleotides were the most common P-SSRs in snakes, lizards, and tuatara. P-SSRs of varying motif sizes showed nonrandom distribution across different genic regions, with AT-rich repeats being predominant. The genomic SSR content of the squamate lineage ranked the highest in abundance and variability, whereas crocodilians and turtles showed a slowly evolving and reduced microsatellite landscape. Gene ontology enrichment and Kyoto Encyclopedia of Genes and Genomes pathway analyses indicated that genes harboring P-SSRs in the coding DNA sequence regions were primarily involved in the regulation of transcription and translation processes. The SSR dataset generated in this study provides potential candidates for functional analysis and calls for broader-scale analyses across the evolutionary spectrum.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ece3.70458","citationCount":"0","resultStr":"{\"title\":\"Comparison of the Distribution Patterns of Microsatellites Across the Genomes of Reptiles\",\"authors\":\"Huaming Zhong, Xuan Shao, Jing Cao, Jie Huang, Jing Wang, Nuo Yang, Baodong Yuan\",\"doi\":\"10.1002/ece3.70458\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Microsatellites or simple sequence repeats (SSRs) are prevalent across various organisms' genomes. However, their distribution patterns and evolutionary dynamics in reptile genomes are rarely studied systematically. We herein conducted a comprehensive analysis of SSRs in the genomes of 36 reptile species. Our findings revealed that the total number of SSRs ranged from 1,840,965 to 7,664,452, accounting for 2.16%–8.19% of the genomes analyzed. The relative density ranged from 21,567.82 to 81,889.41 bp per megabase (Mbp). The abundance of different SSR categories followed the pattern of imperfect SSR (I-SSR) > perfect SSR (P-SSR) > compound SSR (C-SSR). A significant positive correlation was observed between the number of SSRs and genome size (<i>p</i> = 0.0034), whereas SSR frequency (<i>p</i> = 0.013) or density (<i>p</i> = 0.0099) showed a negative correlation with genome size. Furthermore, no correlation was found between SSR length and genome size. Mononucleotide repeats were the most common P-SSRs in crocodilians and turtles, whereas mononucleotides, trinucleotides, or tetranucleotides were the most common P-SSRs in snakes, lizards, and tuatara. P-SSRs of varying motif sizes showed nonrandom distribution across different genic regions, with AT-rich repeats being predominant. The genomic SSR content of the squamate lineage ranked the highest in abundance and variability, whereas crocodilians and turtles showed a slowly evolving and reduced microsatellite landscape. Gene ontology enrichment and Kyoto Encyclopedia of Genes and Genomes pathway analyses indicated that genes harboring P-SSRs in the coding DNA sequence regions were primarily involved in the regulation of transcription and translation processes. The SSR dataset generated in this study provides potential candidates for functional analysis and calls for broader-scale analyses across the evolutionary spectrum.</p>\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ece3.70458\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ece3.70458\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ece3.70458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
摘要
微卫星或简单序列重复序列(SSR)普遍存在于各种生物的基因组中。然而,它们在爬行动物基因组中的分布模式和进化动态却很少得到系统的研究。我们在本文中对 36 种爬行动物基因组中的 SSR 进行了全面分析。我们的研究结果表明,SSR的总数从1,840,965个到7,664,452个不等,占所分析基因组的2.16%-8.19%。相对密度范围为每兆碱基(Mbp)21,567.82 至 81,889.41 bp。不同 SSR 类别的丰度遵循不完全 SSR(I-SSR)> 完全 SSR(P-SSR)> 复合 SSR(C-SSR)的模式。SSR 的数量与基因组大小呈明显的正相关(p = 0.0034),而 SSR 的频率(p = 0.013)或密度(p = 0.0099)与基因组大小呈负相关。此外,SSR 长度与基因组大小之间没有相关性。单核苷酸重复序列是鳄鱼和龟类最常见的 P-SSR,而单核苷酸、三核苷酸或四核苷酸则是蛇类、蜥蜴和图瓦人最常见的 P-SSR。不同图案大小的P-SSR在不同基因区域呈现非随机分布,其中以富含AT的重复序列为主。有鳞类的基因组SSR含量在丰度和变异性方面都名列前茅,而鳄鱼和龟类则显示出缓慢进化和减少的微卫星景观。基因本体富集和京都基因和基因组百科全书途径分析表明,在编码DNA序列区携带P-SSR的基因主要参与转录和翻译过程的调控。本研究产生的 SSR 数据集为功能分析提供了潜在的候选者,并要求在整个进化谱中进行更广泛的分析。
Comparison of the Distribution Patterns of Microsatellites Across the Genomes of Reptiles
Microsatellites or simple sequence repeats (SSRs) are prevalent across various organisms' genomes. However, their distribution patterns and evolutionary dynamics in reptile genomes are rarely studied systematically. We herein conducted a comprehensive analysis of SSRs in the genomes of 36 reptile species. Our findings revealed that the total number of SSRs ranged from 1,840,965 to 7,664,452, accounting for 2.16%–8.19% of the genomes analyzed. The relative density ranged from 21,567.82 to 81,889.41 bp per megabase (Mbp). The abundance of different SSR categories followed the pattern of imperfect SSR (I-SSR) > perfect SSR (P-SSR) > compound SSR (C-SSR). A significant positive correlation was observed between the number of SSRs and genome size (p = 0.0034), whereas SSR frequency (p = 0.013) or density (p = 0.0099) showed a negative correlation with genome size. Furthermore, no correlation was found between SSR length and genome size. Mononucleotide repeats were the most common P-SSRs in crocodilians and turtles, whereas mononucleotides, trinucleotides, or tetranucleotides were the most common P-SSRs in snakes, lizards, and tuatara. P-SSRs of varying motif sizes showed nonrandom distribution across different genic regions, with AT-rich repeats being predominant. The genomic SSR content of the squamate lineage ranked the highest in abundance and variability, whereas crocodilians and turtles showed a slowly evolving and reduced microsatellite landscape. Gene ontology enrichment and Kyoto Encyclopedia of Genes and Genomes pathway analyses indicated that genes harboring P-SSRs in the coding DNA sequence regions were primarily involved in the regulation of transcription and translation processes. The SSR dataset generated in this study provides potential candidates for functional analysis and calls for broader-scale analyses across the evolutionary spectrum.