{"title":"Causal Structure Learning with Reduced Partial Correlation Thresholding","authors":"A. Sondhi, A. Shojaie","doi":"10.1109/DSAA.2016.68","DOIUrl":null,"url":null,"abstract":"Directed acyclic graphs (DAGs) are commonly used to represent causal relations within a large number of random variables. Estimating DAGs from observational data is a difficult task, it is often impossible to uniquely determine edge direction. The skeleton of the graph, where directions are removed from edges, is often estimated instead. We consider the task of estimating the skeleton of a potentially high-dimensional DAG consisting of Gaussian random variables. A drawback of existing methods is that a prohibitively large number of conditional independence relations need to be tested for. By exploiting properties of common random graph families, we develop a new algorithm that requires conditioning only on small sets of variables. By extending previous theoretical results for undirected graphs to the setting of directed graphs, we prove the consistency of our algorithm, and demonstrate improvements over the state-of-the-art alternative in low and high-dimensional simulation settings. We conclude by applying our proposed algorithm on a real gene expression data set.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSAA.2016.68","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Directed acyclic graphs (DAGs) are commonly used to represent causal relations within a large number of random variables. Estimating DAGs from observational data is a difficult task, it is often impossible to uniquely determine edge direction. The skeleton of the graph, where directions are removed from edges, is often estimated instead. We consider the task of estimating the skeleton of a potentially high-dimensional DAG consisting of Gaussian random variables. A drawback of existing methods is that a prohibitively large number of conditional independence relations need to be tested for. By exploiting properties of common random graph families, we develop a new algorithm that requires conditioning only on small sets of variables. By extending previous theoretical results for undirected graphs to the setting of directed graphs, we prove the consistency of our algorithm, and demonstrate improvements over the state-of-the-art alternative in low and high-dimensional simulation settings. We conclude by applying our proposed algorithm on a real gene expression data set.