蛋白质相互作用网络中重叠功能模块的快速检测算法

P. Sun, Lin Gao
{"title":"蛋白质相互作用网络中重叠功能模块的快速检测算法","authors":"P. Sun, Lin Gao","doi":"10.1109/CIBCB.2009.4925735","DOIUrl":null,"url":null,"abstract":"Accumulating evidence suggests that biological systems are composed of interacting, separable, functional modules which is that groups of vertices within which connections are dense but between which they are sparse. Identifying these modules is likely to capture the biologically meaningful interactions. In recent years, many algorithms have been developed for detecting such structures. These algorithms however are computationally demanding, which limits their application. The existing deterministic methods used for large networks find separated modules, whereas most of the actual networks are made of highly overlapping cohesive groups of vertices. In this paper, we propose an iterative-clique percolation method (ICPM) for identifying overlapping modules in PPI (protein-protein interaction) networks. Our method is based on clique percolation method (CPM) which not only considers the degree of nodes to minimize the search space (The vertices in k-cliques must have the degree of k-1 at least), but also converts k-cliques to (k-1)-cliques. It uses (k-1)-cliques by appending one node to (k-1)-cliques for finding k-cliques. Furthermore, since the PPI network is noisy and still incomplete, some methods treat the PPI networks as weighted graphs in which each edge (e.g., interaction) is associated with a weight representing the probability or reliability of that interaction for preprocessing and purifying PPI data. Thus, we extend the ICPM into weighted networks which takes into account the link weights in a more delicate way by incorporating the subgraph intensity. We test our method on both computer-generated and PPI networks. Our analysis of the yeast PPI network suggests that most of these modules have well-supported biological significance in the context of protein localization, function annotation, protein complexes.","PeriodicalId":162052,"journal":{"name":"2009 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Fast algorithms for detecting overlapping functional modules in protein-protein interaction networks\",\"authors\":\"P. Sun, Lin Gao\",\"doi\":\"10.1109/CIBCB.2009.4925735\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accumulating evidence suggests that biological systems are composed of interacting, separable, functional modules which is that groups of vertices within which connections are dense but between which they are sparse. Identifying these modules is likely to capture the biologically meaningful interactions. In recent years, many algorithms have been developed for detecting such structures. These algorithms however are computationally demanding, which limits their application. The existing deterministic methods used for large networks find separated modules, whereas most of the actual networks are made of highly overlapping cohesive groups of vertices. In this paper, we propose an iterative-clique percolation method (ICPM) for identifying overlapping modules in PPI (protein-protein interaction) networks. Our method is based on clique percolation method (CPM) which not only considers the degree of nodes to minimize the search space (The vertices in k-cliques must have the degree of k-1 at least), but also converts k-cliques to (k-1)-cliques. It uses (k-1)-cliques by appending one node to (k-1)-cliques for finding k-cliques. Furthermore, since the PPI network is noisy and still incomplete, some methods treat the PPI networks as weighted graphs in which each edge (e.g., interaction) is associated with a weight representing the probability or reliability of that interaction for preprocessing and purifying PPI data. Thus, we extend the ICPM into weighted networks which takes into account the link weights in a more delicate way by incorporating the subgraph intensity. We test our method on both computer-generated and PPI networks. Our analysis of the yeast PPI network suggests that most of these modules have well-supported biological significance in the context of protein localization, function annotation, protein complexes.\",\"PeriodicalId\":162052,\"journal\":{\"name\":\"2009 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2009.4925735\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2009.4925735","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

越来越多的证据表明,生物系统是由相互作用的、可分离的、功能模块组成的,也就是说,一组顶点之间的连接是密集的,而另一组顶点之间的连接是稀疏的。识别这些模块可能会捕捉到生物学上有意义的相互作用。近年来,已经开发了许多算法来检测这种结构。然而,这些算法的计算要求很高,这限制了它们的应用。现有的用于大型网络的确定性方法寻找分离的模块,而大多数实际网络是由高度重叠的内聚点组组成的。在本文中,我们提出了一种迭代-团渗透方法(ICPM)来识别蛋白质-蛋白质相互作用网络中的重叠模块。我们的方法基于团团渗透法(clique percolation method, CPM),该方法不仅考虑节点的度以最小化搜索空间(k-cliques中的顶点必须至少具有k-1度),而且将k-cliques转换为(k-1)-cliques。它通过在(k-1)-cliques上附加一个节点来使用(k-1)-cliques来查找k-cliques。此外,由于PPI网络是有噪声的,并且仍然不完整,一些方法将PPI网络视为加权图,其中每个边(例如,相互作用)与表示该相互作用的概率或可靠性的权重相关联,用于预处理和纯化PPI数据。因此,我们将ICPM扩展到加权网络,该网络通过结合子图强度以更微妙的方式考虑链路权重。我们在计算机生成的网络和PPI网络上测试了我们的方法。我们对酵母PPI网络的分析表明,这些模块中的大多数在蛋白质定位、功能注释和蛋白质复合物方面具有很好的生物学意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fast algorithms for detecting overlapping functional modules in protein-protein interaction networks
Accumulating evidence suggests that biological systems are composed of interacting, separable, functional modules which is that groups of vertices within which connections are dense but between which they are sparse. Identifying these modules is likely to capture the biologically meaningful interactions. In recent years, many algorithms have been developed for detecting such structures. These algorithms however are computationally demanding, which limits their application. The existing deterministic methods used for large networks find separated modules, whereas most of the actual networks are made of highly overlapping cohesive groups of vertices. In this paper, we propose an iterative-clique percolation method (ICPM) for identifying overlapping modules in PPI (protein-protein interaction) networks. Our method is based on clique percolation method (CPM) which not only considers the degree of nodes to minimize the search space (The vertices in k-cliques must have the degree of k-1 at least), but also converts k-cliques to (k-1)-cliques. It uses (k-1)-cliques by appending one node to (k-1)-cliques for finding k-cliques. Furthermore, since the PPI network is noisy and still incomplete, some methods treat the PPI networks as weighted graphs in which each edge (e.g., interaction) is associated with a weight representing the probability or reliability of that interaction for preprocessing and purifying PPI data. Thus, we extend the ICPM into weighted networks which takes into account the link weights in a more delicate way by incorporating the subgraph intensity. We test our method on both computer-generated and PPI networks. Our analysis of the yeast PPI network suggests that most of these modules have well-supported biological significance in the context of protein localization, function annotation, protein complexes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信