不同分类方法预测乳腺癌亚型的比较

Jingru Xu, Peng Wu, Yuehui Chen, Li Zhang
{"title":"不同分类方法预测乳腺癌亚型的比较","authors":"Jingru Xu, Peng Wu, Yuehui Chen, Li Zhang","doi":"10.1109/SPAC46244.2018.8965553","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the most common cancers among women. Due to heterogeneity of cancers, breast cancer is divided into different subtypes. Different subtypes have different molecular genesis, so the corresponding target cells and treatment plans are different. Identifying the correct cancer subtypes is important for cancer diagnosis and prognosis. Breast cancer subtypes can be divided into four types: Basal, Her2, Luminal A, and Luminal B. Many machine learning approaches have been applied to cancer subtypes classification in the past few years, we present a comparison of different classifiers K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perception (MLP), and Multi-Grained Cascade Forest (gcForest) on The Cancer Genome Atlas (TCGA) databases of breast cancer. As we all know, biological data are high-dimensional and have small sample size, so before classification, we use subtype dependent feature selection method to reduce dimensionality of RNA-Seq gene expression data. Experimental results show that gcForest has a higher accuracy rate for breast cancer subtypes prediction compared with other classifiers.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Comparison of Different Classification Methods for Breast Cancer Subtypes Prediction\",\"authors\":\"Jingru Xu, Peng Wu, Yuehui Chen, Li Zhang\",\"doi\":\"10.1109/SPAC46244.2018.8965553\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is one of the most common cancers among women. Due to heterogeneity of cancers, breast cancer is divided into different subtypes. Different subtypes have different molecular genesis, so the corresponding target cells and treatment plans are different. Identifying the correct cancer subtypes is important for cancer diagnosis and prognosis. Breast cancer subtypes can be divided into four types: Basal, Her2, Luminal A, and Luminal B. Many machine learning approaches have been applied to cancer subtypes classification in the past few years, we present a comparison of different classifiers K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perception (MLP), and Multi-Grained Cascade Forest (gcForest) on The Cancer Genome Atlas (TCGA) databases of breast cancer. As we all know, biological data are high-dimensional and have small sample size, so before classification, we use subtype dependent feature selection method to reduce dimensionality of RNA-Seq gene expression data. Experimental results show that gcForest has a higher accuracy rate for breast cancer subtypes prediction compared with other classifiers.\",\"PeriodicalId\":360369,\"journal\":{\"name\":\"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPAC46244.2018.8965553\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

乳腺癌是女性中最常见的癌症之一。由于癌症的异质性,乳腺癌被分为不同的亚型。不同亚型的分子发生机制不同,相应的靶细胞和治疗方案也不同。确定正确的癌症亚型对癌症的诊断和预后非常重要。乳腺癌亚型可分为基底、Her2、Luminal A和Luminal b四种类型。在过去的几年中,许多机器学习方法已经应用于癌症亚型分类,我们在乳腺癌癌症基因组图集(TCGA)数据库上比较了k -最近邻(KNN)、支持向量机(SVM)、多层感知(MLP)和多粒度级联森林(gcForest)的不同分类器。众所周知,生物数据是高维的,样本量小,因此在分类之前,我们使用亚型依赖特征选择方法对RNA-Seq基因表达数据进行降维处理。实验结果表明,与其他分类器相比,gcForest对乳腺癌亚型的预测准确率更高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparison of Different Classification Methods for Breast Cancer Subtypes Prediction
Breast cancer is one of the most common cancers among women. Due to heterogeneity of cancers, breast cancer is divided into different subtypes. Different subtypes have different molecular genesis, so the corresponding target cells and treatment plans are different. Identifying the correct cancer subtypes is important for cancer diagnosis and prognosis. Breast cancer subtypes can be divided into four types: Basal, Her2, Luminal A, and Luminal B. Many machine learning approaches have been applied to cancer subtypes classification in the past few years, we present a comparison of different classifiers K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perception (MLP), and Multi-Grained Cascade Forest (gcForest) on The Cancer Genome Atlas (TCGA) databases of breast cancer. As we all know, biological data are high-dimensional and have small sample size, so before classification, we use subtype dependent feature selection method to reduce dimensionality of RNA-Seq gene expression data. Experimental results show that gcForest has a higher accuracy rate for breast cancer subtypes prediction compared with other classifiers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信