Prediction of post-covid-19 using supervised machine learning techniques

World Journal of Advanced Engineering Technology and Sciences Pub Date : 2024-07-30 DOI:10.30574/wjaets.2024.12.2.0297

Sunday Akinwamide, Rashidat Idris-Tajudeen, Titilope Helen Akin-Olayemi

{"title":"Prediction of post-covid-19 using supervised machine learning techniques","authors":"Sunday Akinwamide, Rashidat Idris-Tajudeen, Titilope Helen Akin-Olayemi","doi":"10.30574/wjaets.2024.12.2.0297","DOIUrl":null,"url":null,"abstract":"The COVID-19 pandemic has had a profound impact on global health, necessitating the development of predictive models to manage and mitigate its effects. Early diagnosis is crucial for preventing the progression of diseases that can significantly endanger human life. This study explores the application of supervised machine learning techniques to predict Post-COVID-19 outcomes, including long-term health complications and recovery trajectories. In this study, we utilized 10 advanced supervised machine learning algorithms, including both stand-alone models (Decision Tree, Random Forest, Logistic Regression, K-Nearest Neighbors, Support Vector Machine, and Gaussian Naive Bayes) and ensemble learning techniques (Bagging Decision Tree Ensemble, Boosting Decision Tree Ensemble, Voting Ensemble, and Stacked Generalization – Stacking Ensemble). These models were applied to analyze and predict the presence of COVID-19 using the COVID-19 Symptoms and Presence dataset from Kaggle. The performance of each model was evaluated using an 80:20 train-test split as well as 5, 10, 15, 20, and 25-fold cross-validation. Evaluation metrics included accuracy, precision, recall, F1-score, and the confusion matrix. The results indicate that the Decision Tree algorithm outperformed the other models, achieving an accuracy of 98.81%, a precision of 1.00, a recall of 0.98, and an F1-score of 0.99. Our results indicate that machine learning models can effectively predict Post-COVID-19 conditions, providing valuable insights for healthcare providers and policymakers.","PeriodicalId":275182,"journal":{"name":"World Journal of Advanced Engineering Technology and Sciences","volume":"3 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Advanced Engineering Technology and Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30574/wjaets.2024.12.2.0297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The COVID-19 pandemic has had a profound impact on global health, necessitating the development of predictive models to manage and mitigate its effects. Early diagnosis is crucial for preventing the progression of diseases that can significantly endanger human life. This study explores the application of supervised machine learning techniques to predict Post-COVID-19 outcomes, including long-term health complications and recovery trajectories. In this study, we utilized 10 advanced supervised machine learning algorithms, including both stand-alone models (Decision Tree, Random Forest, Logistic Regression, K-Nearest Neighbors, Support Vector Machine, and Gaussian Naive Bayes) and ensemble learning techniques (Bagging Decision Tree Ensemble, Boosting Decision Tree Ensemble, Voting Ensemble, and Stacked Generalization – Stacking Ensemble). These models were applied to analyze and predict the presence of COVID-19 using the COVID-19 Symptoms and Presence dataset from Kaggle. The performance of each model was evaluated using an 80:20 train-test split as well as 5, 10, 15, 20, and 25-fold cross-validation. Evaluation metrics included accuracy, precision, recall, F1-score, and the confusion matrix. The results indicate that the Decision Tree algorithm outperformed the other models, achieving an accuracy of 98.81%, a precision of 1.00, a recall of 0.98, and an F1-score of 0.99. Our results indicate that machine learning models can effectively predict Post-COVID-19 conditions, providing valuable insights for healthcare providers and policymakers.

查看原文本刊更多论文

利用监督机器学习技术预测后同卵双生-19

COVID-19 大流行对全球健康产生了深远影响，因此有必要开发预测模型来管理和减轻其影响。早期诊断对于防止严重危害人类生命的疾病恶化至关重要。本研究探讨了如何应用有监督的机器学习技术来预测后 COVID-19 的结果，包括长期健康并发症和康复轨迹。在这项研究中，我们采用了 10 种先进的监督机器学习算法，包括独立模型（决策树、随机森林、逻辑回归、K-最近邻、支持向量机和高斯直觉贝叶斯）和集合学习技术（袋装决策树集合、提升决策树集合、投票集合和堆叠泛化-堆叠集合）。这些模型被用于使用 Kaggle 的 COVID-19 症状和存在数据集分析和预测 COVID-19 的存在。使用 80:20 的训练-测试比例以及 5、10、15、20 和 25 倍交叉验证对每个模型的性能进行了评估。评估指标包括准确度、精确度、召回率、F1 分数和混淆矩阵。结果表明，决策树算法优于其他模型，准确率达到 98.81%，精确度达到 1.00，召回率达到 0.98，F1 分数达到 0.99。我们的研究结果表明，机器学习模型可以有效预测 COVID-19 后的情况，为医疗服务提供者和政策制定者提供有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

World Journal of Advanced Engineering Technology and Sciences

自引率

0.00%

发文量