阈值对基因共表达网络拓扑结构的影响

IF 3.743 Q2 Biochemistry, Genetics and Molecular Biology
Cynthia Martins Villar Couto, César Henrique Comin and Luciano da Fontoura Costa
{"title":"阈值对基因共表达网络拓扑结构的影响","authors":"Cynthia Martins Villar Couto, César Henrique Comin and Luciano da Fontoura Costa","doi":"10.1039/C7MB00101K","DOIUrl":null,"url":null,"abstract":"<p >Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.</p>","PeriodicalId":90,"journal":{"name":"Molecular BioSystems","volume":null,"pages":null},"PeriodicalIF":3.7430,"publicationDate":"2017-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1039/C7MB00101K","citationCount":"10","resultStr":"{\"title\":\"Effects of threshold on the topology of gene co-expression networks†\",\"authors\":\"Cynthia Martins Villar Couto, César Henrique Comin and Luciano da Fontoura Costa\",\"doi\":\"10.1039/C7MB00101K\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.</p>\",\"PeriodicalId\":90,\"journal\":{\"name\":\"Molecular BioSystems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.7430,\"publicationDate\":\"2017-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1039/C7MB00101K\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular BioSystems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00101k\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular BioSystems","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00101k","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 10

摘要

最近报道了一些关于使用复杂网络理论分析基因共表达谱的进展。这种方法通常从构建一个未加权的基因共表达网络开始,因此需要选择一个合适的阈值来定义哪些顶点对将被连接。我们旨在通过建议和比较阈值选择的五种不同方法来解决这一重要问题。每种方法都考虑了各自的生物学动机标准,以选择潜在的合适阈值。采用来自不同生物群体的21个微阵列实验来研究将这五个标准应用于几种生物情况的效果。对于每个实验,我们使用Pearson相关系数来衡量每个基因对之间的关系,并根据几个值对得到的权重矩阵进行阈值化,生成相应的邻接矩阵(共表达网络)。然后应用五个建议标准中的每一个,以选择各自的阈值。通过使用几种测量来比较这些阈值方法对所得到的网络拓扑的影响,并且我们验证了,根据数据库的不同,对拓扑属性的影响可能很大。然而,一组数据库被证实同样受到大多数考虑的标准的影响。基于这些结果,可以建议,当生成的网络具有相似的测量值时,可以更自由地选择阈值方法。如果生成的网络明显不同,则更适合每个特定研究兴趣的阈值方法代表了合理的选择。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Effects of threshold on the topology of gene co-expression networks†

Effects of threshold on the topology of gene co-expression networks†

Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molecular BioSystems
Molecular BioSystems 生物-生化与分子生物学
CiteScore
2.94
自引率
0.00%
发文量
0
审稿时长
2.6 months
期刊介绍: Molecular Omics publishes molecular level experimental and bioinformatics research in the -omics sciences, including genomics, proteomics, transcriptomics and metabolomics. We will also welcome multidisciplinary papers presenting studies combining different types of omics, or the interface of omics and other fields such as systems biology or chemical biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信