基于微阵列基因表达数据的半监督模糊K-NN癌症分类

A. Halder, S. Misra
{"title":"基于微阵列基因表达数据的半监督模糊K-NN癌症分类","authors":"A. Halder, S. Misra","doi":"10.1109/ACES.2014.6808013","DOIUrl":null,"url":null,"abstract":"Cancer classification from microarray gene expression data is a challenging task in computational biology and bioinformatics as the sufficient number of labeled samples (required to train the traditional classifiers) are very expensive and difficult to collect. Therefore, the predication accuracies of the classifiers trained with limited training samples are often very low. Although, the unlabeled samples are relatively inexpensive and readily available, traditional classifiers not generally utilize the distribution of those unlabeled samples. In this context, this article presents a novel `self-training' based semi-supervised classification method using fuzzy K-Nearest Neighbour algorithm which utilizes the unlabeled samples along with the labeled samples to improve the prediction accuracy of the cancer classification. The proposed method is evaluated with a number of microarray gene expression cancer data sets. Experimental results justify the potentiality of the proposed semi-supervised method for cancer classification using microarray gene expression data in comparison to its other supervised counterparts.","PeriodicalId":353124,"journal":{"name":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Semi-supervised fuzzy K-NN for cancer classification from microarray gene expression data\",\"authors\":\"A. Halder, S. Misra\",\"doi\":\"10.1109/ACES.2014.6808013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cancer classification from microarray gene expression data is a challenging task in computational biology and bioinformatics as the sufficient number of labeled samples (required to train the traditional classifiers) are very expensive and difficult to collect. Therefore, the predication accuracies of the classifiers trained with limited training samples are often very low. Although, the unlabeled samples are relatively inexpensive and readily available, traditional classifiers not generally utilize the distribution of those unlabeled samples. In this context, this article presents a novel `self-training' based semi-supervised classification method using fuzzy K-Nearest Neighbour algorithm which utilizes the unlabeled samples along with the labeled samples to improve the prediction accuracy of the cancer classification. The proposed method is evaluated with a number of microarray gene expression cancer data sets. Experimental results justify the potentiality of the proposed semi-supervised method for cancer classification using microarray gene expression data in comparison to its other supervised counterparts.\",\"PeriodicalId\":353124,\"journal\":{\"name\":\"2014 First International Conference on Automation, Control, Energy and Systems (ACES)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 First International Conference on Automation, Control, Energy and Systems (ACES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACES.2014.6808013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACES.2014.6808013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

摘要

从微阵列基因表达数据中进行癌症分类在计算生物学和生物信息学中是一项具有挑战性的任务,因为足够数量的标记样本(训练传统分类器所需)非常昂贵且难以收集。因此,用有限的训练样本训练的分类器的预测精度往往很低。尽管未标记的样本相对便宜且容易获得,但传统的分类器通常不利用这些未标记样本的分布。在此背景下,本文提出了一种新的基于“自我训练”的半监督分类方法,该方法使用模糊k近邻算法,将未标记样本与标记样本一起使用,以提高癌症分类的预测精度。所提出的方法是用一些微阵列基因表达癌症数据集进行评估。实验结果证明,与其他有监督的方法相比,利用微阵列基因表达数据进行癌症分类的半监督方法具有潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Semi-supervised fuzzy K-NN for cancer classification from microarray gene expression data
Cancer classification from microarray gene expression data is a challenging task in computational biology and bioinformatics as the sufficient number of labeled samples (required to train the traditional classifiers) are very expensive and difficult to collect. Therefore, the predication accuracies of the classifiers trained with limited training samples are often very low. Although, the unlabeled samples are relatively inexpensive and readily available, traditional classifiers not generally utilize the distribution of those unlabeled samples. In this context, this article presents a novel `self-training' based semi-supervised classification method using fuzzy K-Nearest Neighbour algorithm which utilizes the unlabeled samples along with the labeled samples to improve the prediction accuracy of the cancer classification. The proposed method is evaluated with a number of microarray gene expression cancer data sets. Experimental results justify the potentiality of the proposed semi-supervised method for cancer classification using microarray gene expression data in comparison to its other supervised counterparts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信