拟南芥体细胞胚胎发生(SE)的加权基因相关网络分析(WGCNA)和关键基因模块的鉴定,以发现 SE 相关枢纽基因。

IF 2.6 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
International Journal of Genomics Pub Date : 2022-07-04 eCollection Date: 2022-01-01 DOI:10.1155/2022/7471063
Kithmee K de Silva, Jim M Dunwell, Anushka M Wickramasuriya
{"title":"拟南芥体细胞胚胎发生(SE)的加权基因相关网络分析(WGCNA)和关键基因模块的鉴定,以发现 SE 相关枢纽基因。","authors":"Kithmee K de Silva, Jim M Dunwell, Anushka M Wickramasuriya","doi":"10.1155/2022/7471063","DOIUrl":null,"url":null,"abstract":"<p><p>Somatic embryogenesis (SE), which occurs naturally in many plant species, serves as a model to elucidate cellular and molecular mechanisms of embryo patterning in plants. Decoding the regulatory landscape of SE is essential for its further application. Hence, the present study was aimed at employing Weighted Gene Correlation Network Analysis (WGCNA) to construct a gene coexpression network (GCN) for <i>Arabidopsis</i> SE and then identifying highly correlated gene modules to uncover the hub genes associated with SE that may serve as potential molecular targets. A total of 17,059 genes were filtered from a microarray dataset comprising four stages of SE, i.e., stage I (zygotic embryos), stage II (proliferating tissues at 7 days of induction), stage III (proliferating tissues at 14 days of induction), and stage IV (mature somatic embryos). This included 1,711 transcription factors and 445 <i>EMBRYO DEFECTIVE</i> genes. GCN analysis identified a total of 26 gene modules with the module size ranging from 35 to 3,418 genes using a dynamic cut tree algorithm. The module-trait analysis revealed that four, four, seven, and four modules were associated with stages I, II, III, and IV, respectively. Further, we identified a total of 260 hub genes based on the degree of intramodular connectivity. Validation of the hub genes using publicly available expression datasets demonstrated that at least 78 hub genes are potentially associated with embryogenesis; of these, many genes remain functionally uncharacterized thus far. <i>In silico</i> promoter analysis of these genes revealed the presence of <i>cis</i>-acting regulatory elements, \"soybean embryo factor 4 (SEF4) binding site,\" and \"E-box\" of the napA storage-protein gene of <i>Brassica napus</i>; this suggests that these genes may play important roles in plant embryo development. The present study successfully applied WGCNA to construct a GCN for SE in <i>Arabidopsis</i> and identified hub genes involved in the development of somatic embryos. These hub genes could be used as molecular targets to further elucidate the molecular mechanisms underlying SE in plants.</p>","PeriodicalId":13988,"journal":{"name":"International Journal of Genomics","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9274236/pdf/","citationCount":"0","resultStr":"{\"title\":\"Weighted Gene Correlation Network Analysis (WGCNA) of <i>Arabidopsis</i> Somatic Embryogenesis (SE) and Identification of Key Gene Modules to Uncover SE-Associated Hub Genes.\",\"authors\":\"Kithmee K de Silva, Jim M Dunwell, Anushka M Wickramasuriya\",\"doi\":\"10.1155/2022/7471063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Somatic embryogenesis (SE), which occurs naturally in many plant species, serves as a model to elucidate cellular and molecular mechanisms of embryo patterning in plants. Decoding the regulatory landscape of SE is essential for its further application. Hence, the present study was aimed at employing Weighted Gene Correlation Network Analysis (WGCNA) to construct a gene coexpression network (GCN) for <i>Arabidopsis</i> SE and then identifying highly correlated gene modules to uncover the hub genes associated with SE that may serve as potential molecular targets. A total of 17,059 genes were filtered from a microarray dataset comprising four stages of SE, i.e., stage I (zygotic embryos), stage II (proliferating tissues at 7 days of induction), stage III (proliferating tissues at 14 days of induction), and stage IV (mature somatic embryos). This included 1,711 transcription factors and 445 <i>EMBRYO DEFECTIVE</i> genes. GCN analysis identified a total of 26 gene modules with the module size ranging from 35 to 3,418 genes using a dynamic cut tree algorithm. The module-trait analysis revealed that four, four, seven, and four modules were associated with stages I, II, III, and IV, respectively. Further, we identified a total of 260 hub genes based on the degree of intramodular connectivity. Validation of the hub genes using publicly available expression datasets demonstrated that at least 78 hub genes are potentially associated with embryogenesis; of these, many genes remain functionally uncharacterized thus far. <i>In silico</i> promoter analysis of these genes revealed the presence of <i>cis</i>-acting regulatory elements, \\\"soybean embryo factor 4 (SEF4) binding site,\\\" and \\\"E-box\\\" of the napA storage-protein gene of <i>Brassica napus</i>; this suggests that these genes may play important roles in plant embryo development. The present study successfully applied WGCNA to construct a GCN for SE in <i>Arabidopsis</i> and identified hub genes involved in the development of somatic embryos. These hub genes could be used as molecular targets to further elucidate the molecular mechanisms underlying SE in plants.</p>\",\"PeriodicalId\":13988,\"journal\":{\"name\":\"International Journal of Genomics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2022-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9274236/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1155/2022/7471063\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1155/2022/7471063","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

体细胞胚胎发生(SE)在许多植物物种中自然发生,是阐明植物胚胎形态的细胞和分子机制的模型。解码体细胞胚胎发生的调控格局对其进一步应用至关重要。因此,本研究旨在利用加权基因相关网络分析(WGCNA)构建拟南芥 SE 的基因共表达网络(GCN),然后识别高度相关的基因模块,从而发现与 SE 相关的枢纽基因,这些基因可能成为潜在的分子靶标。从包括拟南芥 SE 四个阶段(即阶段 I(合子胚胎)、阶段 II(诱导 7 天时的增殖组织)、阶段 III(诱导 14 天时的增殖组织)和阶段 IV(成熟体细胞胚胎))的芯片数据集中共筛选出 17,059 个基因。其中包括 1,711 个转录因子和 445 个胚胎缺陷基因。利用动态剪切树算法,GCN 分析共确定了 26 个基因模块,模块大小从 35 个基因到 3418 个基因不等。模块-性状分析显示,分别有 4 个、4 个、7 个和 4 个模块与 I、II、III 和 IV 期相关。此外,我们还根据模块内的连通程度确定了共 260 个中心基因。利用公开的表达数据集对这些中心基因进行的验证表明,至少有 78 个中心基因可能与胚胎发生有关;其中许多基因的功能至今仍未定性。对这些基因的启动子进行的硅分析表明,这些基因存在顺式调控元件、"大豆胚胎因子 4(SEF4)结合位点 "和甘蓝型油菜的 napA 储存蛋白基因的 "E-box";这表明这些基因可能在植物胚胎发育过程中发挥重要作用。本研究成功应用 WGCNA 构建了拟南芥 SE 的 GCN,并发现了参与体细胞胚胎发育的枢纽基因。这些中心基因可作为分子靶标,进一步阐明植物 SE 的分子机制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Weighted Gene Correlation Network Analysis (WGCNA) of <i>Arabidopsis</i> Somatic Embryogenesis (SE) and Identification of Key Gene Modules to Uncover SE-Associated Hub Genes.

Weighted Gene Correlation Network Analysis (WGCNA) of <i>Arabidopsis</i> Somatic Embryogenesis (SE) and Identification of Key Gene Modules to Uncover SE-Associated Hub Genes.

Weighted Gene Correlation Network Analysis (WGCNA) of <i>Arabidopsis</i> Somatic Embryogenesis (SE) and Identification of Key Gene Modules to Uncover SE-Associated Hub Genes.

Weighted Gene Correlation Network Analysis (WGCNA) of Arabidopsis Somatic Embryogenesis (SE) and Identification of Key Gene Modules to Uncover SE-Associated Hub Genes.

Somatic embryogenesis (SE), which occurs naturally in many plant species, serves as a model to elucidate cellular and molecular mechanisms of embryo patterning in plants. Decoding the regulatory landscape of SE is essential for its further application. Hence, the present study was aimed at employing Weighted Gene Correlation Network Analysis (WGCNA) to construct a gene coexpression network (GCN) for Arabidopsis SE and then identifying highly correlated gene modules to uncover the hub genes associated with SE that may serve as potential molecular targets. A total of 17,059 genes were filtered from a microarray dataset comprising four stages of SE, i.e., stage I (zygotic embryos), stage II (proliferating tissues at 7 days of induction), stage III (proliferating tissues at 14 days of induction), and stage IV (mature somatic embryos). This included 1,711 transcription factors and 445 EMBRYO DEFECTIVE genes. GCN analysis identified a total of 26 gene modules with the module size ranging from 35 to 3,418 genes using a dynamic cut tree algorithm. The module-trait analysis revealed that four, four, seven, and four modules were associated with stages I, II, III, and IV, respectively. Further, we identified a total of 260 hub genes based on the degree of intramodular connectivity. Validation of the hub genes using publicly available expression datasets demonstrated that at least 78 hub genes are potentially associated with embryogenesis; of these, many genes remain functionally uncharacterized thus far. In silico promoter analysis of these genes revealed the presence of cis-acting regulatory elements, "soybean embryo factor 4 (SEF4) binding site," and "E-box" of the napA storage-protein gene of Brassica napus; this suggests that these genes may play important roles in plant embryo development. The present study successfully applied WGCNA to construct a GCN for SE in Arabidopsis and identified hub genes involved in the development of somatic embryos. These hub genes could be used as molecular targets to further elucidate the molecular mechanisms underlying SE in plants.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Genomics
International Journal of Genomics BIOCHEMISTRY & MOLECULAR BIOLOGY-BIOTECHNOLOGY & APPLIED MICROBIOLOGY
CiteScore
5.40
自引率
0.00%
发文量
33
审稿时长
17 weeks
期刊介绍: International Journal of Genomics is a peer-reviewed, Open Access journal that publishes research articles as well as review articles in all areas of genome-scale analysis. Topics covered by the journal include, but are not limited to: bioinformatics, clinical genomics, disease genomics, epigenomics, evolutionary genomics, functional genomics, genome engineering, and synthetic genomics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信