{"title":"基于贝叶斯网络、逻辑回归、J48、随机森林和Naïve贝叶斯算法的特征选择方法的肺癌疾病预测与分类","authors":"J. Viji Cripsy, T. Divya","doi":"10.1109/ICSMDI57622.2023.00066","DOIUrl":null,"url":null,"abstract":"People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).","PeriodicalId":373017,"journal":{"name":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms\",\"authors\":\"J. Viji Cripsy, T. Divya\",\"doi\":\"10.1109/ICSMDI57622.2023.00066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).\",\"PeriodicalId\":373017,\"journal\":{\"name\":\"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSMDI57622.2023.00066\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMDI57622.2023.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms
People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).