{"title":"威斯康星诊断乳腺癌数据分类算法分析","authors":"Rhamadina Fitrah Umami, R. Sarno","doi":"10.1109/iSemantic50169.2020.9234295","DOIUrl":null,"url":null,"abstract":"Breast cancer is a disease that causes excessive fear in women around the world. The number of high death rates by breast cancer can be reduced by early detection. This can make breast cancer a disease that is easy to cure. A collection of datasets about breast cancer is used in the process of early detection. Early detection is carried out to analyze the state of the early stages of breast cancer patients. This research paper proposes machine learning methods, namely Generalized Linear Model, Logistic Regression, and Gradient Boosted Decision Tree to enhance the classification performance of Wisconsin Diagnostic Breast Cancer Data. The diagnosis results in two classes of cancer decisions which are malignant and benign by looking at evaluating the accuracy of the data classification test. The result shows that the Generalized Linear Model achieves the accuracy of 99.4%, which is higher than the accuracies of the previous studies for classifying the Wisconsin Diagnostic Breast Cancer dataset.","PeriodicalId":345558,"journal":{"name":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Analysis of Classification Algorithm for Wisconsin Diagnosis Breast Cancer Data Study\",\"authors\":\"Rhamadina Fitrah Umami, R. Sarno\",\"doi\":\"10.1109/iSemantic50169.2020.9234295\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is a disease that causes excessive fear in women around the world. The number of high death rates by breast cancer can be reduced by early detection. This can make breast cancer a disease that is easy to cure. A collection of datasets about breast cancer is used in the process of early detection. Early detection is carried out to analyze the state of the early stages of breast cancer patients. This research paper proposes machine learning methods, namely Generalized Linear Model, Logistic Regression, and Gradient Boosted Decision Tree to enhance the classification performance of Wisconsin Diagnostic Breast Cancer Data. The diagnosis results in two classes of cancer decisions which are malignant and benign by looking at evaluating the accuracy of the data classification test. The result shows that the Generalized Linear Model achieves the accuracy of 99.4%, which is higher than the accuracies of the previous studies for classifying the Wisconsin Diagnostic Breast Cancer dataset.\",\"PeriodicalId\":345558,\"journal\":{\"name\":\"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"volume\":\"118 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSemantic50169.2020.9234295\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic50169.2020.9234295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
摘要
乳腺癌是一种让世界各地的女性感到极度恐惧的疾病。乳腺癌的高死亡率可以通过早期发现而减少。这使得乳腺癌成为一种容易治愈的疾病。在早期检测过程中使用了一系列关于乳腺癌的数据集。开展早期检测,分析早期乳腺癌患者的状态。本文提出了机器学习方法,即广义线性模型、逻辑回归和梯度增强决策树来提高威斯康星乳腺癌诊断数据的分类性能。通过评估数据分类测试的准确性,诊断结果为恶性和良性两类癌症决策。结果表明,广义线性模型(Generalized Linear Model)的准确率达到99.4%,高于以往研究对Wisconsin Diagnostic Breast Cancer数据集进行分类的准确率。
Analysis of Classification Algorithm for Wisconsin Diagnosis Breast Cancer Data Study
Breast cancer is a disease that causes excessive fear in women around the world. The number of high death rates by breast cancer can be reduced by early detection. This can make breast cancer a disease that is easy to cure. A collection of datasets about breast cancer is used in the process of early detection. Early detection is carried out to analyze the state of the early stages of breast cancer patients. This research paper proposes machine learning methods, namely Generalized Linear Model, Logistic Regression, and Gradient Boosted Decision Tree to enhance the classification performance of Wisconsin Diagnostic Breast Cancer Data. The diagnosis results in two classes of cancer decisions which are malignant and benign by looking at evaluating the accuracy of the data classification test. The result shows that the Generalized Linear Model achieves the accuracy of 99.4%, which is higher than the accuracies of the previous studies for classifying the Wisconsin Diagnostic Breast Cancer dataset.