Ronald D. Hagan, Brett D. Hagan, C. Phillips, B. Rhodes, M. Langston
{"title":"Compound Analytics using Combinatorics for Feature Selection: A Case Study in Biomarker Detection","authors":"Ronald D. Hagan, Brett D. Hagan, C. Phillips, B. Rhodes, M. Langston","doi":"10.1109/IPDPSW.2019.00050","DOIUrl":null,"url":null,"abstract":"Computer and data scientists are increasingly tasked with analyzing data growing at unprecedented rates. These data frequently involve a high level of dimensionality. In this work, we present a novel method for dimension reduction that combines statistical scoring with graph theoretical filtering to distill salient features for machine learning. We apply this method to the timely problem of detecting epigenetic biomarkers in DNA methylation data.","PeriodicalId":292054,"journal":{"name":"2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2019.00050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Computer and data scientists are increasingly tasked with analyzing data growing at unprecedented rates. These data frequently involve a high level of dimensionality. In this work, we present a novel method for dimension reduction that combines statistical scoring with graph theoretical filtering to distill salient features for machine learning. We apply this method to the timely problem of detecting epigenetic biomarkers in DNA methylation data.