Probabilistic Graphical Models and Deep Belief Networks for Prognosis of Breast Cancer

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA) Pub Date : 2015-12-01 DOI:10.1109/ICMLA.2015.196

M. Khademi, N. Nedialkov

{"title":"Probabilistic Graphical Models and Deep Belief Networks for Prognosis of Breast Cancer","authors":"M. Khademi, N. Nedialkov","doi":"10.1109/ICMLA.2015.196","DOIUrl":null,"url":null,"abstract":"We propose a probabilistic graphical model (PGM) for prognosis and diagnosis of breast cancer. PGMs are suitable for building predictive models in medical applications, as they are powerful tools for making decisions under uncertainty from big data with missing attributes and noisy evidence. Previous work relied mostly on clinical data to create a predictive model. Moreover, practical knowledge of an expert was needed to build the structure of a model, which may not be accurate. In our opinion, since cancer is basically a genetic disease, the integration of microarray and clinical data can improve the accuracy of a predictive model. However, since microarray data is high-dimensional, including genomic variables may lead to poor results for structure and parameter learning due to the curse of dimensionality and small sample size problems. We address these problems by applying manifold learning and a deep belief network (DBN) to microarray data. First, we construct a PGM and a DBN using clinical and microarray data, and extract the structure of the clinical model automatically by applying a structure learning algorithm to the clinical data. Then, we integrate these two models using softmax nodes. Extensive experiments using real-world databases, such as METABRIC and NKI, show promising results in comparison to Support Vector Machines (SVMs) and k-Nearest Neighbors (k-NN) classifiers, for classifying tumors and predicting events like recurrence and metastasis.","PeriodicalId":288427,"journal":{"name":"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"47","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2015.196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 47

Abstract

We propose a probabilistic graphical model (PGM) for prognosis and diagnosis of breast cancer. PGMs are suitable for building predictive models in medical applications, as they are powerful tools for making decisions under uncertainty from big data with missing attributes and noisy evidence. Previous work relied mostly on clinical data to create a predictive model. Moreover, practical knowledge of an expert was needed to build the structure of a model, which may not be accurate. In our opinion, since cancer is basically a genetic disease, the integration of microarray and clinical data can improve the accuracy of a predictive model. However, since microarray data is high-dimensional, including genomic variables may lead to poor results for structure and parameter learning due to the curse of dimensionality and small sample size problems. We address these problems by applying manifold learning and a deep belief network (DBN) to microarray data. First, we construct a PGM and a DBN using clinical and microarray data, and extract the structure of the clinical model automatically by applying a structure learning algorithm to the clinical data. Then, we integrate these two models using softmax nodes. Extensive experiments using real-world databases, such as METABRIC and NKI, show promising results in comparison to Support Vector Machines (SVMs) and k-Nearest Neighbors (k-NN) classifiers, for classifying tumors and predicting events like recurrence and metastasis.

查看原文本刊更多论文

乳腺癌预后的概率图模型和深度信念网络

我们提出了一种乳腺癌预后和诊断的概率图模型(PGM)。pgm适用于在医疗应用中建立预测模型，因为它们是在属性缺失和证据嘈杂的大数据不确定情况下做出决策的强大工具。以前的工作主要依赖于临床数据来创建预测模型。此外，需要专家的实践知识来构建模型的结构，这可能不准确。我们认为，由于癌症基本上是一种遗传性疾病，将微阵列与临床数据相结合可以提高预测模型的准确性。然而，由于微阵列数据是高维的，包含基因组变量可能会由于维数的诅咒和小样本量问题导致结构和参数学习的结果不佳。我们通过将流形学习和深度信念网络(DBN)应用于微阵列数据来解决这些问题。首先，利用临床数据和微阵列数据构建PGM和DBN，并对临床数据应用结构学习算法自动提取临床模型的结构。然后，我们使用softmax节点对这两个模型进行整合。与支持向量机(svm)和k-近邻(k-NN)分类器相比，使用现实世界数据库(如METABRIC和NKI)进行的大量实验显示，在对肿瘤进行分类和预测复发和转移等事件方面，有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)

自引率

0.00%

发文量