Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms

J. Viji Cripsy, T. Divya
{"title":"Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms","authors":"J. Viji Cripsy, T. Divya","doi":"10.1109/ICSMDI57622.2023.00066","DOIUrl":null,"url":null,"abstract":"People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).","PeriodicalId":373017,"journal":{"name":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMDI57622.2023.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).
基于贝叶斯网络、逻辑回归、J48、随机森林和Naïve贝叶斯算法的特征选择方法的肺癌疾病预测与分类
从不吸烟的人也可能得肺癌,但吸烟者比不吸烟者的风险更高。呼吸系统的任何方面都可能受到肺癌的影响,肺癌可以从肺部的任何地方开始。肺癌的预测使用了不同的分类方法。本文使用五种不同的分类算法,利用Kaggle数据集预测肺癌患者。使用贝叶斯网络、逻辑回归、J48、随机森林和朴素贝叶斯方法,在仔细识别正确和错误案例的基础上,使用评价技术和WEKA工具测量结果的质量。实验结果表明,Logistic回归的效果最好(91.90%),其次是朴素贝叶斯(90.29%)、贝叶斯网络(88.34%)、j48(86.08%)和随机森林(90.93%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信