Mehdi Joodaki, Nasser Ghadiri, Amir Hossein Atashkar
{"title":"在Apache Spark上从PPI网络中检测蛋白质复合物","authors":"Mehdi Joodaki, Nasser Ghadiri, Amir Hossein Atashkar","doi":"10.1109/IKT.2017.8258627","DOIUrl":null,"url":null,"abstract":"Protein-Protein Interaction (PPI) network is a network of biomolecular interactions which plays a major role in modeling and analyzing biological activities. Studies of functional modules from PPI networks provide a better understanding of biological mechanisms. Recent advances in both biological and computer sciences demands for the vast amount of PPI networks data to be processed by experimental and computational methods. This could be a great challenge to find functional modules within these large networks. Existing methods are used to identify the functional modules, but some of them do not consider overlapping between functional module clusters. Moreover, most of the methods run on a single machine. Also, many existing algorithms only focus on topological features of PPI networks. In this paper, we introduce a new way for detecting the functional modules. It considers overlapping between clusters and runs on Apache Spark — a distributed processing platform. Our algorithm also considers both topological and biological features of PPI networks. The evaluation results show improved execution speed as well as more accurate results compared to classic methods.","PeriodicalId":338914,"journal":{"name":"2017 9th International Conference on Information and Knowledge Technology (IKT)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Protein complex detection from PPI networks on Apache Spark\",\"authors\":\"Mehdi Joodaki, Nasser Ghadiri, Amir Hossein Atashkar\",\"doi\":\"10.1109/IKT.2017.8258627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Protein-Protein Interaction (PPI) network is a network of biomolecular interactions which plays a major role in modeling and analyzing biological activities. Studies of functional modules from PPI networks provide a better understanding of biological mechanisms. Recent advances in both biological and computer sciences demands for the vast amount of PPI networks data to be processed by experimental and computational methods. This could be a great challenge to find functional modules within these large networks. Existing methods are used to identify the functional modules, but some of them do not consider overlapping between functional module clusters. Moreover, most of the methods run on a single machine. Also, many existing algorithms only focus on topological features of PPI networks. In this paper, we introduce a new way for detecting the functional modules. It considers overlapping between clusters and runs on Apache Spark — a distributed processing platform. Our algorithm also considers both topological and biological features of PPI networks. The evaluation results show improved execution speed as well as more accurate results compared to classic methods.\",\"PeriodicalId\":338914,\"journal\":{\"name\":\"2017 9th International Conference on Information and Knowledge Technology (IKT)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 9th International Conference on Information and Knowledge Technology (IKT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IKT.2017.8258627\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 9th International Conference on Information and Knowledge Technology (IKT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IKT.2017.8258627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Protein complex detection from PPI networks on Apache Spark
Protein-Protein Interaction (PPI) network is a network of biomolecular interactions which plays a major role in modeling and analyzing biological activities. Studies of functional modules from PPI networks provide a better understanding of biological mechanisms. Recent advances in both biological and computer sciences demands for the vast amount of PPI networks data to be processed by experimental and computational methods. This could be a great challenge to find functional modules within these large networks. Existing methods are used to identify the functional modules, but some of them do not consider overlapping between functional module clusters. Moreover, most of the methods run on a single machine. Also, many existing algorithms only focus on topological features of PPI networks. In this paper, we introduce a new way for detecting the functional modules. It considers overlapping between clusters and runs on Apache Spark — a distributed processing platform. Our algorithm also considers both topological and biological features of PPI networks. The evaluation results show improved execution speed as well as more accurate results compared to classic methods.