Bilal Abdualgalil, Sajimon Abraham, Waleed M. Ismael
{"title":"基于临床数据的高效机器学习技术用于登革热疾病预测的早期诊断","authors":"Bilal Abdualgalil, Sajimon Abraham, Waleed M. Ismael","doi":"10.18196/jrc.v3i3.14387","DOIUrl":null,"url":null,"abstract":"Dengue fever is a worldwide issue, especially in Yemen. Although early detection is critical to reducing dengue disease deaths, accurate dengue diagnosis requires a long time due to the numerous clinical examinations. Thus, this issue necessitates the development of a new diagnostic schema. The objective of this work is to develop a diagnostic model for the earlier diagnosis of dengue disease using Efficient Machine Learning Techniques (EMLT). This paper proposed prediction models for dengue disease based on EMLT. Five different efficient machine learning models, including K-Nearest Neighbor (KNN), Gradient Boosting Classifier (GBC), Extra Tree Classifier (ETC), eXtreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LightGBM). All classifiers are trained and tested on the dataset using 10-Fold Cross-Validation and Holdout Cross-Validation approaches. On a test set, all models were evaluated using different metrics: accuracy, F1-sore, Recall, Precision, AUC, and operating time. Based on the findings, the ETC model achieved the highest accuracy in Hold-out and 10-fold cross-validation, with 99.12 % and 99.03 %, respectively. In the Holdout cross-validation approach, we conclude that the best classifier with high accuracy is ETC, which achieved 99.12 %. Finally, the experimental results indicate that classifier performance in holdout cross-validation outperforms 10-fold cross-validation. Accordingly, the proposed dengue prediction system demonstrates its efficacy and effectiveness in assisting doctors in accurately predicting dengue disease.","PeriodicalId":443428,"journal":{"name":"Journal of Robotics and Control (JRC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Early Diagnosis for Dengue Disease Prediction Using Efficient Machine Learning Techniques Based on Clinical Data\",\"authors\":\"Bilal Abdualgalil, Sajimon Abraham, Waleed M. Ismael\",\"doi\":\"10.18196/jrc.v3i3.14387\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dengue fever is a worldwide issue, especially in Yemen. Although early detection is critical to reducing dengue disease deaths, accurate dengue diagnosis requires a long time due to the numerous clinical examinations. Thus, this issue necessitates the development of a new diagnostic schema. The objective of this work is to develop a diagnostic model for the earlier diagnosis of dengue disease using Efficient Machine Learning Techniques (EMLT). This paper proposed prediction models for dengue disease based on EMLT. Five different efficient machine learning models, including K-Nearest Neighbor (KNN), Gradient Boosting Classifier (GBC), Extra Tree Classifier (ETC), eXtreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LightGBM). All classifiers are trained and tested on the dataset using 10-Fold Cross-Validation and Holdout Cross-Validation approaches. On a test set, all models were evaluated using different metrics: accuracy, F1-sore, Recall, Precision, AUC, and operating time. Based on the findings, the ETC model achieved the highest accuracy in Hold-out and 10-fold cross-validation, with 99.12 % and 99.03 %, respectively. In the Holdout cross-validation approach, we conclude that the best classifier with high accuracy is ETC, which achieved 99.12 %. Finally, the experimental results indicate that classifier performance in holdout cross-validation outperforms 10-fold cross-validation. Accordingly, the proposed dengue prediction system demonstrates its efficacy and effectiveness in assisting doctors in accurately predicting dengue disease.\",\"PeriodicalId\":443428,\"journal\":{\"name\":\"Journal of Robotics and Control (JRC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Robotics and Control (JRC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18196/jrc.v3i3.14387\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Robotics and Control (JRC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18196/jrc.v3i3.14387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Early Diagnosis for Dengue Disease Prediction Using Efficient Machine Learning Techniques Based on Clinical Data
Dengue fever is a worldwide issue, especially in Yemen. Although early detection is critical to reducing dengue disease deaths, accurate dengue diagnosis requires a long time due to the numerous clinical examinations. Thus, this issue necessitates the development of a new diagnostic schema. The objective of this work is to develop a diagnostic model for the earlier diagnosis of dengue disease using Efficient Machine Learning Techniques (EMLT). This paper proposed prediction models for dengue disease based on EMLT. Five different efficient machine learning models, including K-Nearest Neighbor (KNN), Gradient Boosting Classifier (GBC), Extra Tree Classifier (ETC), eXtreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LightGBM). All classifiers are trained and tested on the dataset using 10-Fold Cross-Validation and Holdout Cross-Validation approaches. On a test set, all models were evaluated using different metrics: accuracy, F1-sore, Recall, Precision, AUC, and operating time. Based on the findings, the ETC model achieved the highest accuracy in Hold-out and 10-fold cross-validation, with 99.12 % and 99.03 %, respectively. In the Holdout cross-validation approach, we conclude that the best classifier with high accuracy is ETC, which achieved 99.12 %. Finally, the experimental results indicate that classifier performance in holdout cross-validation outperforms 10-fold cross-validation. Accordingly, the proposed dengue prediction system demonstrates its efficacy and effectiveness in assisting doctors in accurately predicting dengue disease.