Reverse Engineering Gene Networks: A Comparative Study at Genome-scale

Sriram P. Chockalingam, M. Aluru, Hongqing Guo, Yanhai Yin, S. Aluru
{"title":"Reverse Engineering Gene Networks: A Comparative Study at Genome-scale","authors":"Sriram P. Chockalingam, M. Aluru, Hongqing Guo, Yanhai Yin, S. Aluru","doi":"10.1145/3107411.3107428","DOIUrl":null,"url":null,"abstract":"Motivation: Reverse engineering gene networks from expression data is a widelymstudied problem, for which numerous mathematical models have been developed. Network reconstruction methods can be used to study specific pathways, or can be applied at the whole-genome scale to analyze large compendiums of expression datasets to uncover genome-wide interactions. However, few methods can scale to such large number of genes and experiments, and to date, genome-scale comparative assessment of network reconstruction methods has largely been limited to simpler organisms such as E. coli. Results: In this paper, we analyze 11,760 microarray experiments on the model plant Arabidopsis thaliana drawn from public repositories. We generate genome scale networks of Arabidopsis using three different methods -- Pearson correlation, mutual information, and graphical Gaussian modeling -- and analyze and compare these networks to test for their robustness in successfully recovering relationships between functionally related genes. We demonstrate that functional grouping of microarray experiments into different tissue types and experimental conditions is important to discover context-specific interactions. Our comparisons include benchmarking against experimentally confirmed interactions, the Arabidopsis network resource AraNet, and study of specific pathways. Our results show that networks generated by the mutual information based method have better characteristics in terms of functional modularity as measured by both connected component and sub-network extraction analysis with respect to gene sets selected from brassinosteroid and stress regulation pathways. Availability: The classification datasets and constructed genome-scale networks are publicly available at the location http://alurulab.cc.gatech.edu/arabidopsis-networks","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3107411.3107428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Motivation: Reverse engineering gene networks from expression data is a widelymstudied problem, for which numerous mathematical models have been developed. Network reconstruction methods can be used to study specific pathways, or can be applied at the whole-genome scale to analyze large compendiums of expression datasets to uncover genome-wide interactions. However, few methods can scale to such large number of genes and experiments, and to date, genome-scale comparative assessment of network reconstruction methods has largely been limited to simpler organisms such as E. coli. Results: In this paper, we analyze 11,760 microarray experiments on the model plant Arabidopsis thaliana drawn from public repositories. We generate genome scale networks of Arabidopsis using three different methods -- Pearson correlation, mutual information, and graphical Gaussian modeling -- and analyze and compare these networks to test for their robustness in successfully recovering relationships between functionally related genes. We demonstrate that functional grouping of microarray experiments into different tissue types and experimental conditions is important to discover context-specific interactions. Our comparisons include benchmarking against experimentally confirmed interactions, the Arabidopsis network resource AraNet, and study of specific pathways. Our results show that networks generated by the mutual information based method have better characteristics in terms of functional modularity as measured by both connected component and sub-network extraction analysis with respect to gene sets selected from brassinosteroid and stress regulation pathways. Availability: The classification datasets and constructed genome-scale networks are publicly available at the location http://alurulab.cc.gatech.edu/arabidopsis-networks
逆向工程基因网络:基因组尺度的比较研究
动机:从表达数据中反向工程基因网络是一个被广泛研究的问题,为此已经开发了许多数学模型。网络重建方法可用于研究特定途径,或可在全基因组尺度上应用于分析表达数据集的大型概要,以揭示全基因组的相互作用。然而,很少有方法可以扩展到如此大量的基因和实验,并且到目前为止,基因组规模的网络重建方法的比较评估在很大程度上仅限于更简单的生物体,如大肠杆菌。结果:本文分析了从公共数据库中提取的模式植物拟南芥的11,760个微阵列实验。我们使用三种不同的方法——Pearson相关、互信息和图形高斯建模——生成拟南芥基因组规模网络,并分析和比较这些网络,以测试它们在成功恢复功能相关基因之间关系方面的稳健性。我们证明,将微阵列实验功能分组为不同的组织类型和实验条件对于发现上下文特定的相互作用是重要的。我们的比较包括对实验证实的相互作用的基准,拟南芥网络资源AraNet,以及特定途径的研究。我们的研究结果表明,基于互信息的方法生成的网络在功能模块化方面具有更好的特征,这是通过连接组件和子网络提取分析来测量的,相对于从油菜素内酯和应激调节途径中选择的基因集。可用性:分类数据集和构建的基因组规模网络可在http://alurulab.cc.gatech.edu/arabidopsis-networks公开获取
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信