Kaouthar Touchanti, Imad Ezzazi, M. Bekkali, Said Maser
{"title":"基于支持向量机的两阶段结肠癌分类特征选择框架","authors":"Kaouthar Touchanti, Imad Ezzazi, M. Bekkali, Said Maser","doi":"10.1109/ISCV54655.2022.9806115","DOIUrl":null,"url":null,"abstract":"As the colon cancer gene expression dataset is of high dimension, many irrelevant, redundant and noisy features might be included which may cause unprecedented challenges for data mining and machine learning algorithms. In this paper, we have proposed a new feature selection based method for colon cancer classification. First, we have used the ReliefF filter technique to provide a ranking in terms of the discriminatory ability of each feature. Second, since ReliefF cannot handle feature redundancy as well as feature interaction, another step is performed to select the best subset of gene expression profiles from the available 2K subsets. The proposed method has efficiently reduced the dimensionality of the colon dataset and increased the classification accuracy. The results from the Colon Cancer Gene Expression Data Set confirmed the effectiveness of the proposed method compared to advanced techniques.","PeriodicalId":426665,"journal":{"name":"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"39 22","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A 2-stages feature selection framework for colon cancer classification using SVM\",\"authors\":\"Kaouthar Touchanti, Imad Ezzazi, M. Bekkali, Said Maser\",\"doi\":\"10.1109/ISCV54655.2022.9806115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the colon cancer gene expression dataset is of high dimension, many irrelevant, redundant and noisy features might be included which may cause unprecedented challenges for data mining and machine learning algorithms. In this paper, we have proposed a new feature selection based method for colon cancer classification. First, we have used the ReliefF filter technique to provide a ranking in terms of the discriminatory ability of each feature. Second, since ReliefF cannot handle feature redundancy as well as feature interaction, another step is performed to select the best subset of gene expression profiles from the available 2K subsets. The proposed method has efficiently reduced the dimensionality of the colon dataset and increased the classification accuracy. The results from the Colon Cancer Gene Expression Data Set confirmed the effectiveness of the proposed method compared to advanced techniques.\",\"PeriodicalId\":426665,\"journal\":{\"name\":\"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"volume\":\"39 22\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCV54655.2022.9806115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCV54655.2022.9806115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A 2-stages feature selection framework for colon cancer classification using SVM
As the colon cancer gene expression dataset is of high dimension, many irrelevant, redundant and noisy features might be included which may cause unprecedented challenges for data mining and machine learning algorithms. In this paper, we have proposed a new feature selection based method for colon cancer classification. First, we have used the ReliefF filter technique to provide a ranking in terms of the discriminatory ability of each feature. Second, since ReliefF cannot handle feature redundancy as well as feature interaction, another step is performed to select the best subset of gene expression profiles from the available 2K subsets. The proposed method has efficiently reduced the dimensionality of the colon dataset and increased the classification accuracy. The results from the Colon Cancer Gene Expression Data Set confirmed the effectiveness of the proposed method compared to advanced techniques.