{"title":"基于微阵列基因表达数据的半监督模糊K-NN癌症分类","authors":"A. Halder, S. Misra","doi":"10.1109/ACES.2014.6808013","DOIUrl":null,"url":null,"abstract":"Cancer classification from microarray gene expression data is a challenging task in computational biology and bioinformatics as the sufficient number of labeled samples (required to train the traditional classifiers) are very expensive and difficult to collect. Therefore, the predication accuracies of the classifiers trained with limited training samples are often very low. Although, the unlabeled samples are relatively inexpensive and readily available, traditional classifiers not generally utilize the distribution of those unlabeled samples. In this context, this article presents a novel `self-training' based semi-supervised classification method using fuzzy K-Nearest Neighbour algorithm which utilizes the unlabeled samples along with the labeled samples to improve the prediction accuracy of the cancer classification. The proposed method is evaluated with a number of microarray gene expression cancer data sets. Experimental results justify the potentiality of the proposed semi-supervised method for cancer classification using microarray gene expression data in comparison to its other supervised counterparts.","PeriodicalId":353124,"journal":{"name":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Semi-supervised fuzzy K-NN for cancer classification from microarray gene expression data\",\"authors\":\"A. Halder, S. Misra\",\"doi\":\"10.1109/ACES.2014.6808013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cancer classification from microarray gene expression data is a challenging task in computational biology and bioinformatics as the sufficient number of labeled samples (required to train the traditional classifiers) are very expensive and difficult to collect. Therefore, the predication accuracies of the classifiers trained with limited training samples are often very low. Although, the unlabeled samples are relatively inexpensive and readily available, traditional classifiers not generally utilize the distribution of those unlabeled samples. In this context, this article presents a novel `self-training' based semi-supervised classification method using fuzzy K-Nearest Neighbour algorithm which utilizes the unlabeled samples along with the labeled samples to improve the prediction accuracy of the cancer classification. The proposed method is evaluated with a number of microarray gene expression cancer data sets. Experimental results justify the potentiality of the proposed semi-supervised method for cancer classification using microarray gene expression data in comparison to its other supervised counterparts.\",\"PeriodicalId\":353124,\"journal\":{\"name\":\"2014 First International Conference on Automation, Control, Energy and Systems (ACES)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 First International Conference on Automation, Control, Energy and Systems (ACES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACES.2014.6808013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACES.2014.6808013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Semi-supervised fuzzy K-NN for cancer classification from microarray gene expression data
Cancer classification from microarray gene expression data is a challenging task in computational biology and bioinformatics as the sufficient number of labeled samples (required to train the traditional classifiers) are very expensive and difficult to collect. Therefore, the predication accuracies of the classifiers trained with limited training samples are often very low. Although, the unlabeled samples are relatively inexpensive and readily available, traditional classifiers not generally utilize the distribution of those unlabeled samples. In this context, this article presents a novel `self-training' based semi-supervised classification method using fuzzy K-Nearest Neighbour algorithm which utilizes the unlabeled samples along with the labeled samples to improve the prediction accuracy of the cancer classification. The proposed method is evaluated with a number of microarray gene expression cancer data sets. Experimental results justify the potentiality of the proposed semi-supervised method for cancer classification using microarray gene expression data in comparison to its other supervised counterparts.