Lei Qiu, Yuan Li, Jing Sun, Jinsheng Liu, Yuhai Zhao
{"title":"Finding Global Contrast Core Subgraphs in Large-Scale Genetic Association Study","authors":"Lei Qiu, Yuan Li, Jing Sun, Jinsheng Liu, Yuhai Zhao","doi":"10.1109/IUCC/DSCI/SmartCNS.2019.00069","DOIUrl":null,"url":null,"abstract":"Genetic association study (GAS) is crucial to reveal the underlying principles of complex diseases. This is the first work that introduces contrast subgraph mining in two networks with significantly different edges or edge weights but the same vertices to solve GAS problem. It can not only detect the genetic loci highly associated with certain diseases, but discriminate between disease causing and disease preventing genetic loci, which captures more comprehensive and informative value for biologists. Inspired by the concept of r-core and minimum vertex weight, we propose to identify the novel global contrast r-core subgraphs r-GCCSs, which is more robust to outliers and redundancy. Further, we formulate the top-k r-GCCSs detection problem based on global contrast measure. In particular, (1) a linear time search algorithm is carefully developed to find the top-k r-GCCSs; (2) To further reduce the high computational cost, a linear space index is devised to support the top-k search. Comprehensive experiments on four large-scale real datasets demonstrate the efficiency and effectiveness of our approaches.","PeriodicalId":410905,"journal":{"name":"2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IUCC/DSCI/SmartCNS.2019.00069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Genetic association study (GAS) is crucial to reveal the underlying principles of complex diseases. This is the first work that introduces contrast subgraph mining in two networks with significantly different edges or edge weights but the same vertices to solve GAS problem. It can not only detect the genetic loci highly associated with certain diseases, but discriminate between disease causing and disease preventing genetic loci, which captures more comprehensive and informative value for biologists. Inspired by the concept of r-core and minimum vertex weight, we propose to identify the novel global contrast r-core subgraphs r-GCCSs, which is more robust to outliers and redundancy. Further, we formulate the top-k r-GCCSs detection problem based on global contrast measure. In particular, (1) a linear time search algorithm is carefully developed to find the top-k r-GCCSs; (2) To further reduce the high computational cost, a linear space index is devised to support the top-k search. Comprehensive experiments on four large-scale real datasets demonstrate the efficiency and effectiveness of our approaches.