转录网络中大量的连接基序，一个使用随机森林回归的案例研究

EAI Endorsed Trans. Mob. Commun. Appl. Pub Date : 2016-05-24 DOI:10.4108/eai.3-12-2015.2262520

Khajamoinuddin Syed, Bhanu K. Kamapantula, Michael L. Mayo, E. Perkins, P. Ghosh

{"title":"转录网络中大量的连接基序，一个使用随机森林回归的案例研究","authors":"Khajamoinuddin Syed, Bhanu K. Kamapantula, Michael L. Mayo, E. Perkins, P. Ghosh","doi":"10.4108/eai.3-12-2015.2262520","DOIUrl":null,"url":null,"abstract":"Biological network topologies are known to be robust despite internal and external perturbances. Motifs such as feed-forward loop and bifan have been marked to contribute to structural and functional significance. While network characteristics such as network density, average shortest path, and centrality measures etc., have been well studied, modular characteristics have not been explored in similar detail. Motif connectivity might play a major role in regulation under high perturbations. Connected motif abundance can skew network robustness as well. To test this hypothesis, we study the significance of the two connected feed-forward \n \nloop motifs using random forest regression modeling. We define thirty eight network features, fifteen of which are static and dynamic features and the other twenty three are two feed-forward loop connected motif features. We identify significant features among these using random forests regression and create models that can be used to train and predict the robustness of the biological networks. The performance of these models is measured using coefficient of determination metric and the significance of the features themselves is \n \ncharacterized using feature importance. Our experiments \n \nreveal that connected feed-forward loop motifs do not contribute to the robustness of network when models are created with all 38 features. For models with only connected motif features, the performance of a specific rhombus motif under high loss stands out.","PeriodicalId":299985,"journal":{"name":"EAI Endorsed Trans. Mob. Commun. Appl.","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Abundance of connected motifs in transcriptional networks, a case study using random forests regression\",\"authors\":\"Khajamoinuddin Syed, Bhanu K. Kamapantula, Michael L. Mayo, E. Perkins, P. Ghosh\",\"doi\":\"10.4108/eai.3-12-2015.2262520\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Biological network topologies are known to be robust despite internal and external perturbances. Motifs such as feed-forward loop and bifan have been marked to contribute to structural and functional significance. While network characteristics such as network density, average shortest path, and centrality measures etc., have been well studied, modular characteristics have not been explored in similar detail. Motif connectivity might play a major role in regulation under high perturbations. Connected motif abundance can skew network robustness as well. To test this hypothesis, we study the significance of the two connected feed-forward \\n \\nloop motifs using random forest regression modeling. We define thirty eight network features, fifteen of which are static and dynamic features and the other twenty three are two feed-forward loop connected motif features. We identify significant features among these using random forests regression and create models that can be used to train and predict the robustness of the biological networks. The performance of these models is measured using coefficient of determination metric and the significance of the features themselves is \\n \\ncharacterized using feature importance. Our experiments \\n \\nreveal that connected feed-forward loop motifs do not contribute to the robustness of network when models are created with all 38 features. For models with only connected motif features, the performance of a specific rhombus motif under high loss stands out.\",\"PeriodicalId\":299985,\"journal\":{\"name\":\"EAI Endorsed Trans. Mob. Commun. Appl.\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EAI Endorsed Trans. Mob. Commun. Appl.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/eai.3-12-2015.2262520\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Trans. Mob. Commun. Appl.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eai.3-12-2015.2262520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

生物网络拓扑被认为是鲁棒的，尽管内部和外部的扰动。诸如前馈环和双环等基序已被标记为具有结构和功能意义。虽然网络特征，如网络密度、平均最短路径和中心性测度等，已经得到了很好的研究，但模块化特征还没有得到类似的详细探讨。Motif连通性可能在高扰动下的调控中起主要作用。连接motif的丰度也会影响网络的稳健性。为了验证这一假设，我们使用随机森林回归模型研究了两个连接的前馈环基序的意义。我们定义了38个网络特征，其中15个是静态和动态特征，另外23个是两个前馈回路连接的motif特征。我们使用随机森林回归识别其中的重要特征，并创建可用于训练和预测生物网络鲁棒性的模型。这些模型的性能用决定系数度量，特征本身的显著性用特征重要性来表征。我们的实验表明，当使用所有38个特征创建模型时，连接的前馈回路基元对网络的鲁棒性没有贡献。对于只有连通基序特征的模型，在高损耗情况下，特定菱形基序的性能表现突出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Abundance of connected motifs in transcriptional networks, a case study using random forests regression

Biological network topologies are known to be robust despite internal and external perturbances. Motifs such as feed-forward loop and bifan have been marked to contribute to structural and functional significance. While network characteristics such as network density, average shortest path, and centrality measures etc., have been well studied, modular characteristics have not been explored in similar detail. Motif connectivity might play a major role in regulation under high perturbations. Connected motif abundance can skew network robustness as well. To test this hypothesis, we study the significance of the two connected feed-forward loop motifs using random forest regression modeling. We define thirty eight network features, fifteen of which are static and dynamic features and the other twenty three are two feed-forward loop connected motif features. We identify significant features among these using random forests regression and create models that can be used to train and predict the robustness of the biological networks. The performance of these models is measured using coefficient of determination metric and the significance of the features themselves is characterized using feature importance. Our experiments reveal that connected feed-forward loop motifs do not contribute to the robustness of network when models are created with all 38 features. For models with only connected motif features, the performance of a specific rhombus motif under high loss stands out.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EAI Endorsed Trans. Mob. Commun. Appl.

自引率

0.00%

发文量