Generating weighted and thresholded gene coexpression networks using signed distance correlation.

IF 1.4 Q2 SOCIAL SCIENCES, INTERDISCIPLINARY
Network Science Pub Date : 2022-06-01 Epub Date: 2022-06-16 DOI:10.1017/nws.2022.13
Javier Pardo-Diaz, Philip S Poole, Mariano Beguerisse-Díaz, Charlotte M Deane, Gesine Reinert
{"title":"Generating weighted and thresholded gene coexpression networks using signed distance correlation.","authors":"Javier Pardo-Diaz,&nbsp;Philip S Poole,&nbsp;Mariano Beguerisse-Díaz,&nbsp;Charlotte M Deane,&nbsp;Gesine Reinert","doi":"10.1017/nws.2022.13","DOIUrl":null,"url":null,"abstract":"<p><p>Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes or proteins, using a network of gene coexpression data that includes functional annotations. Signed distance correlation has proved useful for the construction of unweighted gene coexpression networks. However, transforming correlation values into unweighted networks may lead to a loss of important biological information related to the intensity of the correlation. Here we introduce a principled method to construct <i>weighted</i> gene coexpression networks using signed distance correlation. These networks contain weighted edges only between those pairs of genes whose correlation value is higher than a given threshold. We analyse data from different organisms and find that networks generated with our method based on signed distance correlation are more stable and capture more biological information compared to networks obtained from Pearson correlation. Moreover, we show that signed distance correlation networks capture more biological information than unweighted networks based on the same metric. While we use biological data sets to illustrate the method, the approach is general and can be used to construct networks in other domains. Code and data are available on https://github.com/javier-pardodiaz/sdcorGCN.</p>","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7613200/pdf/EMS145046.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Network Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/nws.2022.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/6/16 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"SOCIAL SCIENCES, INTERDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes or proteins, using a network of gene coexpression data that includes functional annotations. Signed distance correlation has proved useful for the construction of unweighted gene coexpression networks. However, transforming correlation values into unweighted networks may lead to a loss of important biological information related to the intensity of the correlation. Here we introduce a principled method to construct weighted gene coexpression networks using signed distance correlation. These networks contain weighted edges only between those pairs of genes whose correlation value is higher than a given threshold. We analyse data from different organisms and find that networks generated with our method based on signed distance correlation are more stable and capture more biological information compared to networks obtained from Pearson correlation. Moreover, we show that signed distance correlation networks capture more biological information than unweighted networks based on the same metric. While we use biological data sets to illustrate the method, the approach is general and can be used to construct networks in other domains. Code and data are available on https://github.com/javier-pardodiaz/sdcorGCN.

Abstract Image

使用符号距离相关生成加权和阈值基因共表达网络。
即使在经过充分研究的生物体中,许多基因也缺乏有用的功能注释。生成这种功能信息的一种方法是使用包含功能注释的基因共表达数据网络来推断基因或蛋白质之间的生物学关系。符号距离相关已被证明对构建非加权基因共表达网络是有用的。然而,将相关值转换为未加权的网络可能会导致丢失与相关强度相关的重要生物信息。本文介绍了一种利用符号距离相关构建加权基因共表达网络的基本方法。这些网络只包含那些相关值高于给定阈值的基因对之间的加权边。我们分析了来自不同生物的数据,发现与Pearson相关获得的网络相比,基于符号距离相关的方法生成的网络更稳定,捕获了更多的生物信息。此外,我们证明了基于相同度量的带符号距离相关网络比未加权网络捕获更多的生物信息。虽然我们使用生物数据集来说明该方法,但该方法是通用的,可以用于构建其他领域的网络。代码和数据可在https://github.com/javier-pardodiaz/sdcorGCN上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Network Science
Network Science SOCIAL SCIENCES, INTERDISCIPLINARY-
CiteScore
3.50
自引率
5.90%
发文量
24
期刊介绍: Network Science is an important journal for an important discipline - one using the network paradigm, focusing on actors and relational linkages, to inform research, methodology, and applications from many fields across the natural, social, engineering and informational sciences. Given growing understanding of the interconnectedness and globalization of the world, network methods are an increasingly recognized way to research aspects of modern society along with the individuals, organizations, and other actors within it. The discipline is ready for a comprehensive journal, open to papers from all relevant areas. Network Science is a defining work, shaping this discipline. The journal welcomes contributions from researchers in all areas working on network theory, methods, and data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信