{"title":"Breast cancer risk prediction model based on C5.0 algorithm for postmenopausal women","authors":"Xia Zhang, Yingming Sun","doi":"10.1109/SPAC46244.2018.8965528","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the most common malignant tumors of women in the world, and it most happen in the elderly women, but in recent years the age of onset has become younger. As we know that, postmenopausal women are the groups with less research on breast cancer, and the characteristics of breast cancer are still to be explored. In this paper, based on the characteristic of 1031 postmenopausal women (⩾43 years old) breast cancer data, a breast cancer risk prediction model based on C5.0 algorithm was constructed and the model was optimized. The experimental results show that: a) Compared with machine learning methods such as neural network and support vector machine, C5.0 algorithm has better performance in constructing breast cancer risk prediction model; b) Costmatrix_C5.0 Model with cost matrix is better than adaptiveboosting_c5.0 model with Adaptive Enhancement algorithm; c) The risk of breast cancer is strongly correlated with post-menopausal hormones, age, age of menopause, history of benign breast disease and age of the first childbearing. This research is a practical application of data mining in the medical field and has certain reference value for the clinical diagnosis of breast cancer.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Breast cancer is one of the most common malignant tumors of women in the world, and it most happen in the elderly women, but in recent years the age of onset has become younger. As we know that, postmenopausal women are the groups with less research on breast cancer, and the characteristics of breast cancer are still to be explored. In this paper, based on the characteristic of 1031 postmenopausal women (⩾43 years old) breast cancer data, a breast cancer risk prediction model based on C5.0 algorithm was constructed and the model was optimized. The experimental results show that: a) Compared with machine learning methods such as neural network and support vector machine, C5.0 algorithm has better performance in constructing breast cancer risk prediction model; b) Costmatrix_C5.0 Model with cost matrix is better than adaptiveboosting_c5.0 model with Adaptive Enhancement algorithm; c) The risk of breast cancer is strongly correlated with post-menopausal hormones, age, age of menopause, history of benign breast disease and age of the first childbearing. This research is a practical application of data mining in the medical field and has certain reference value for the clinical diagnosis of breast cancer.