{"title":"基于微阵列基因表达谱的癌症分类深度学习模型","authors":"Aiguo Wang, Qi Hu","doi":"10.1109/CCAI57533.2023.10201310","DOIUrl":null,"url":null,"abstract":"Gene expression profiles measured by microarray technology enables accurate identification of disease genes, prediction of cancers, and distinguishing tumor subtypes at the molecular level. However, these profiles are characterized by a small sample size and high dimensionality, which would inevitably degrade the performance of analysis models. In this study, we proposed a deep learning-based model to improve the prediction accuracy. Specifically, we first use the minimum redundancy maximum relevancy feature selector to discard irrelevant and noisy features. Then, we utilize a deep autoencoder to learn complex and nonlinear relationships among data. Finally, a predictor is trained on the latent representation to classify cancer. We conduct extensive experiments on four publicly available microarray datasets and compare the proposed model with six commonly used feature selectors using naïve bayes and decision tree in terms of accuracy and F1. Results demonstrate the superiority of the proposed model over its competitors.","PeriodicalId":285760,"journal":{"name":"2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence (CCAI)","volume":"10 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Learning Models for Cancer Classification from Microarray Gene Expression Profiles\",\"authors\":\"Aiguo Wang, Qi Hu\",\"doi\":\"10.1109/CCAI57533.2023.10201310\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gene expression profiles measured by microarray technology enables accurate identification of disease genes, prediction of cancers, and distinguishing tumor subtypes at the molecular level. However, these profiles are characterized by a small sample size and high dimensionality, which would inevitably degrade the performance of analysis models. In this study, we proposed a deep learning-based model to improve the prediction accuracy. Specifically, we first use the minimum redundancy maximum relevancy feature selector to discard irrelevant and noisy features. Then, we utilize a deep autoencoder to learn complex and nonlinear relationships among data. Finally, a predictor is trained on the latent representation to classify cancer. We conduct extensive experiments on four publicly available microarray datasets and compare the proposed model with six commonly used feature selectors using naïve bayes and decision tree in terms of accuracy and F1. Results demonstrate the superiority of the proposed model over its competitors.\",\"PeriodicalId\":285760,\"journal\":{\"name\":\"2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence (CCAI)\",\"volume\":\"10 7\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence (CCAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCAI57533.2023.10201310\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence (CCAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCAI57533.2023.10201310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Learning Models for Cancer Classification from Microarray Gene Expression Profiles
Gene expression profiles measured by microarray technology enables accurate identification of disease genes, prediction of cancers, and distinguishing tumor subtypes at the molecular level. However, these profiles are characterized by a small sample size and high dimensionality, which would inevitably degrade the performance of analysis models. In this study, we proposed a deep learning-based model to improve the prediction accuracy. Specifically, we first use the minimum redundancy maximum relevancy feature selector to discard irrelevant and noisy features. Then, we utilize a deep autoencoder to learn complex and nonlinear relationships among data. Finally, a predictor is trained on the latent representation to classify cancer. We conduct extensive experiments on four publicly available microarray datasets and compare the proposed model with six commonly used feature selectors using naïve bayes and decision tree in terms of accuracy and F1. Results demonstrate the superiority of the proposed model over its competitors.