{"title":"基于集成的人工神经网络DNA芯片缺失值估计","authors":"Sujay Saha, Saikat Bandopadhyay, A. Ghosh, K. Dey","doi":"10.1109/ICRCICN.2016.7813671","DOIUrl":null,"url":null,"abstract":"DNA microarrays are normally used to measure the expression values of thousands of several genes simultaneously in the form of large matrices. This raw gene expression data may contain some missing cells. These missing values may affect the analysis performed subsequently on these gene expression data. Several imputation methods, like K-Nearest Neighbor Imputation (KNNImpute), Singular Value Decomposition Imputation (SVDImpute), Local Least Square Imputation (LLSImpute), Bayesian Principal Component Analysis (BPCAImpute) etc. have already been proposed to impute those missing values. In this work we have proposed an ensemble classifier based Artificial Neural Network implementation, ANNImpute, to enhance the accuracy of the missing value imputation technique by applying Two Layer Perceptron Learning algorithm. Ensemble classification is done on the parameters such as learning rate a, weight vector & bias. We have applied our algorithm on two benchmark datasets like SPELLMAN and Tumour (GDS2932) and the results show that this approach performs well compared to the other existing methods as far as RMSE measures are concerned.","PeriodicalId":254393,"journal":{"name":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An ensemble based missing value estimation in DNA microarray using artificial neural network\",\"authors\":\"Sujay Saha, Saikat Bandopadhyay, A. Ghosh, K. Dey\",\"doi\":\"10.1109/ICRCICN.2016.7813671\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"DNA microarrays are normally used to measure the expression values of thousands of several genes simultaneously in the form of large matrices. This raw gene expression data may contain some missing cells. These missing values may affect the analysis performed subsequently on these gene expression data. Several imputation methods, like K-Nearest Neighbor Imputation (KNNImpute), Singular Value Decomposition Imputation (SVDImpute), Local Least Square Imputation (LLSImpute), Bayesian Principal Component Analysis (BPCAImpute) etc. have already been proposed to impute those missing values. In this work we have proposed an ensemble classifier based Artificial Neural Network implementation, ANNImpute, to enhance the accuracy of the missing value imputation technique by applying Two Layer Perceptron Learning algorithm. Ensemble classification is done on the parameters such as learning rate a, weight vector & bias. We have applied our algorithm on two benchmark datasets like SPELLMAN and Tumour (GDS2932) and the results show that this approach performs well compared to the other existing methods as far as RMSE measures are concerned.\",\"PeriodicalId\":254393,\"journal\":{\"name\":\"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)\",\"volume\":\"111 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRCICN.2016.7813671\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2016.7813671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An ensemble based missing value estimation in DNA microarray using artificial neural network
DNA microarrays are normally used to measure the expression values of thousands of several genes simultaneously in the form of large matrices. This raw gene expression data may contain some missing cells. These missing values may affect the analysis performed subsequently on these gene expression data. Several imputation methods, like K-Nearest Neighbor Imputation (KNNImpute), Singular Value Decomposition Imputation (SVDImpute), Local Least Square Imputation (LLSImpute), Bayesian Principal Component Analysis (BPCAImpute) etc. have already been proposed to impute those missing values. In this work we have proposed an ensemble classifier based Artificial Neural Network implementation, ANNImpute, to enhance the accuracy of the missing value imputation technique by applying Two Layer Perceptron Learning algorithm. Ensemble classification is done on the parameters such as learning rate a, weight vector & bias. We have applied our algorithm on two benchmark datasets like SPELLMAN and Tumour (GDS2932) and the results show that this approach performs well compared to the other existing methods as far as RMSE measures are concerned.