Kai Che, Xiaoyan Liu, Maozu Guo, Junwei Zhang, Lei Wang, Yin Zhang
{"title":"Epistasis detection using a permutation-based Gradient Boosting Machine","authors":"Kai Che, Xiaoyan Liu, Maozu Guo, Junwei Zhang, Lei Wang, Yin Zhang","doi":"10.1109/BIBM.2016.7822697","DOIUrl":null,"url":null,"abstract":"Detecting single nucleotide polymorphism (SNP) epistasis contributes to understand disease susceptibility and discover disease pathogenesis underlying complex disease. In this paper, we propose an approach called permutation-based Gradient Boosting Machine (pGBM) to detect pure epistasis by estimating the power of a GBM classifier which is influenced by permuting SNP pairs. pGBM is based on two permutation strategies and gradient boosting machine model. To extend pGBM to detect pure epistasis well on unbalanced dataset, average AUC difference value is chosen as the metric that quantifies the SNP interactions intensity. The experiment results demonstrate that our method has a high success rate with both balanced/unbalanced simulation and real dataset. In addition, pGBM shows great potential to detect pure SNP epistasis to uncover more complex disease pathogenesis.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2016.7822697","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Detecting single nucleotide polymorphism (SNP) epistasis contributes to understand disease susceptibility and discover disease pathogenesis underlying complex disease. In this paper, we propose an approach called permutation-based Gradient Boosting Machine (pGBM) to detect pure epistasis by estimating the power of a GBM classifier which is influenced by permuting SNP pairs. pGBM is based on two permutation strategies and gradient boosting machine model. To extend pGBM to detect pure epistasis well on unbalanced dataset, average AUC difference value is chosen as the metric that quantifies the SNP interactions intensity. The experiment results demonstrate that our method has a high success rate with both balanced/unbalanced simulation and real dataset. In addition, pGBM shows great potential to detect pure SNP epistasis to uncover more complex disease pathogenesis.