{"title":"不同分类方法预测乳腺癌亚型的比较","authors":"Jingru Xu, Peng Wu, Yuehui Chen, Li Zhang","doi":"10.1109/SPAC46244.2018.8965553","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the most common cancers among women. Due to heterogeneity of cancers, breast cancer is divided into different subtypes. Different subtypes have different molecular genesis, so the corresponding target cells and treatment plans are different. Identifying the correct cancer subtypes is important for cancer diagnosis and prognosis. Breast cancer subtypes can be divided into four types: Basal, Her2, Luminal A, and Luminal B. Many machine learning approaches have been applied to cancer subtypes classification in the past few years, we present a comparison of different classifiers K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perception (MLP), and Multi-Grained Cascade Forest (gcForest) on The Cancer Genome Atlas (TCGA) databases of breast cancer. As we all know, biological data are high-dimensional and have small sample size, so before classification, we use subtype dependent feature selection method to reduce dimensionality of RNA-Seq gene expression data. Experimental results show that gcForest has a higher accuracy rate for breast cancer subtypes prediction compared with other classifiers.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Comparison of Different Classification Methods for Breast Cancer Subtypes Prediction\",\"authors\":\"Jingru Xu, Peng Wu, Yuehui Chen, Li Zhang\",\"doi\":\"10.1109/SPAC46244.2018.8965553\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is one of the most common cancers among women. Due to heterogeneity of cancers, breast cancer is divided into different subtypes. Different subtypes have different molecular genesis, so the corresponding target cells and treatment plans are different. Identifying the correct cancer subtypes is important for cancer diagnosis and prognosis. Breast cancer subtypes can be divided into four types: Basal, Her2, Luminal A, and Luminal B. Many machine learning approaches have been applied to cancer subtypes classification in the past few years, we present a comparison of different classifiers K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perception (MLP), and Multi-Grained Cascade Forest (gcForest) on The Cancer Genome Atlas (TCGA) databases of breast cancer. As we all know, biological data are high-dimensional and have small sample size, so before classification, we use subtype dependent feature selection method to reduce dimensionality of RNA-Seq gene expression data. Experimental results show that gcForest has a higher accuracy rate for breast cancer subtypes prediction compared with other classifiers.\",\"PeriodicalId\":360369,\"journal\":{\"name\":\"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPAC46244.2018.8965553\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparison of Different Classification Methods for Breast Cancer Subtypes Prediction
Breast cancer is one of the most common cancers among women. Due to heterogeneity of cancers, breast cancer is divided into different subtypes. Different subtypes have different molecular genesis, so the corresponding target cells and treatment plans are different. Identifying the correct cancer subtypes is important for cancer diagnosis and prognosis. Breast cancer subtypes can be divided into four types: Basal, Her2, Luminal A, and Luminal B. Many machine learning approaches have been applied to cancer subtypes classification in the past few years, we present a comparison of different classifiers K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perception (MLP), and Multi-Grained Cascade Forest (gcForest) on The Cancer Genome Atlas (TCGA) databases of breast cancer. As we all know, biological data are high-dimensional and have small sample size, so before classification, we use subtype dependent feature selection method to reduce dimensionality of RNA-Seq gene expression data. Experimental results show that gcForest has a higher accuracy rate for breast cancer subtypes prediction compared with other classifiers.