{"title":"使用Ml算法的乳腺癌分类","authors":"Sara Shehab, A. Keshk","doi":"10.21608/kjis.2022.159008.1010","DOIUrl":null,"url":null,"abstract":": One of the most top diseases nowadays is breast cancer that causes death for many women over the world. Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Artificial intelligence has an effect role in detecting and classification the breast cancer. In this work 13 classification method are used like Support Vector Machine, AdaBoost, MLP classifier and others. This work is evaluated using three keys accuracy, cross validation score and execution time. The results detect that Linear SVC Support Vector Machine achieved high accuracy (98.25%) and Random Forest and AdaBoost achieved high cross validation score (97.01%) when compared with other classification methods. Whereas Gaussian NB classifier achieved minimum execution time (0.01 seconds). A data set with 31 feature and 570 records are used for testing the algorithms. 20% of data set will be used in testing and 80% for training. The proposed work achieves high accuracy when compared with the previous works.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"92 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Breast Cancer Classification Using Ml Algorithms\",\"authors\":\"Sara Shehab, A. Keshk\",\"doi\":\"10.21608/kjis.2022.159008.1010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": One of the most top diseases nowadays is breast cancer that causes death for many women over the world. Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Artificial intelligence has an effect role in detecting and classification the breast cancer. In this work 13 classification method are used like Support Vector Machine, AdaBoost, MLP classifier and others. This work is evaluated using three keys accuracy, cross validation score and execution time. The results detect that Linear SVC Support Vector Machine achieved high accuracy (98.25%) and Random Forest and AdaBoost achieved high cross validation score (97.01%) when compared with other classification methods. Whereas Gaussian NB classifier achieved minimum execution time (0.01 seconds). A data set with 31 feature and 570 records are used for testing the algorithms. 20% of data set will be used in testing and 80% for training. The proposed work achieves high accuracy when compared with the previous works.\",\"PeriodicalId\":115907,\"journal\":{\"name\":\"Kafrelsheikh Journal of Information Sciences\",\"volume\":\"92 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Kafrelsheikh Journal of Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21608/kjis.2022.159008.1010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kafrelsheikh Journal of Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/kjis.2022.159008.1010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
: One of the most top diseases nowadays is breast cancer that causes death for many women over the world. Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Artificial intelligence has an effect role in detecting and classification the breast cancer. In this work 13 classification method are used like Support Vector Machine, AdaBoost, MLP classifier and others. This work is evaluated using three keys accuracy, cross validation score and execution time. The results detect that Linear SVC Support Vector Machine achieved high accuracy (98.25%) and Random Forest and AdaBoost achieved high cross validation score (97.01%) when compared with other classification methods. Whereas Gaussian NB classifier achieved minimum execution time (0.01 seconds). A data set with 31 feature and 570 records are used for testing the algorithms. 20% of data set will be used in testing and 80% for training. The proposed work achieves high accuracy when compared with the previous works.