D. Andrade-Girón, Edgardo Carreño-Cisneros, Cecilia Mejía-Dominguez, W. Marín-Rodriguez, Henry Villarreal-Torres
{"title":"预测疑似COVID-19患者的机器学习算法比较","authors":"D. Andrade-Girón, Edgardo Carreño-Cisneros, Cecilia Mejía-Dominguez, W. Marín-Rodriguez, Henry Villarreal-Torres","doi":"10.56294/saludcyt2023336","DOIUrl":null,"url":null,"abstract":"The coronavirus disease (COVID-19) outbreak has infected millions of people, causing a high death rate worldwide. Patients suspected of having COVID-19 are transferred to different health facilities, which has caused a saturation in care, for which it is necessary to have a prediction model to classify patients at high risk of clinical deterioration. The objective of the research was to compare classification algorithms based on automatic learning machines, for the prediction of clinical diagnosis in patients with COVID-19. 1000 records of patients with suspected SARS-CoV-2 infection who were admitted by the emergency service in health establishments in Peru were collected. After pre-processing the data and engineering the attributes, a sample of 700 records was determined. Models were designed and algorithms were compared: Logistic Regression, Support Vector Machine, Nearest Neighbors, Decision Tree, Random Forest, and Navie Bayes. The evaluation of the results of each algorithm was carried out using Accuracy, precision, sensitivity and Chohen's Kappa to know the degree of agreement between the prediction by the learning machine and the results of reality, that is, to what extent both results agree in their measurement. The algorithm that presented the best results was the Support Vector Machine and Random Forest, which predicted the patients with an accuracy of 97%, and Cohen's Kappa of 0.95, with figures higher than the other models evaluated.","PeriodicalId":184806,"journal":{"name":"Salud Ciencia y Tecnología","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Comparison of Machine Learning Algorithms for Predicting Patients with Suspected COVID-19\",\"authors\":\"D. Andrade-Girón, Edgardo Carreño-Cisneros, Cecilia Mejía-Dominguez, W. Marín-Rodriguez, Henry Villarreal-Torres\",\"doi\":\"10.56294/saludcyt2023336\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The coronavirus disease (COVID-19) outbreak has infected millions of people, causing a high death rate worldwide. Patients suspected of having COVID-19 are transferred to different health facilities, which has caused a saturation in care, for which it is necessary to have a prediction model to classify patients at high risk of clinical deterioration. The objective of the research was to compare classification algorithms based on automatic learning machines, for the prediction of clinical diagnosis in patients with COVID-19. 1000 records of patients with suspected SARS-CoV-2 infection who were admitted by the emergency service in health establishments in Peru were collected. After pre-processing the data and engineering the attributes, a sample of 700 records was determined. Models were designed and algorithms were compared: Logistic Regression, Support Vector Machine, Nearest Neighbors, Decision Tree, Random Forest, and Navie Bayes. The evaluation of the results of each algorithm was carried out using Accuracy, precision, sensitivity and Chohen's Kappa to know the degree of agreement between the prediction by the learning machine and the results of reality, that is, to what extent both results agree in their measurement. The algorithm that presented the best results was the Support Vector Machine and Random Forest, which predicted the patients with an accuracy of 97%, and Cohen's Kappa of 0.95, with figures higher than the other models evaluated.\",\"PeriodicalId\":184806,\"journal\":{\"name\":\"Salud Ciencia y Tecnología\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Salud Ciencia y Tecnología\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.56294/saludcyt2023336\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Salud Ciencia y Tecnología","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56294/saludcyt2023336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparison of Machine Learning Algorithms for Predicting Patients with Suspected COVID-19
The coronavirus disease (COVID-19) outbreak has infected millions of people, causing a high death rate worldwide. Patients suspected of having COVID-19 are transferred to different health facilities, which has caused a saturation in care, for which it is necessary to have a prediction model to classify patients at high risk of clinical deterioration. The objective of the research was to compare classification algorithms based on automatic learning machines, for the prediction of clinical diagnosis in patients with COVID-19. 1000 records of patients with suspected SARS-CoV-2 infection who were admitted by the emergency service in health establishments in Peru were collected. After pre-processing the data and engineering the attributes, a sample of 700 records was determined. Models were designed and algorithms were compared: Logistic Regression, Support Vector Machine, Nearest Neighbors, Decision Tree, Random Forest, and Navie Bayes. The evaluation of the results of each algorithm was carried out using Accuracy, precision, sensitivity and Chohen's Kappa to know the degree of agreement between the prediction by the learning machine and the results of reality, that is, to what extent both results agree in their measurement. The algorithm that presented the best results was the Support Vector Machine and Random Forest, which predicted the patients with an accuracy of 97%, and Cohen's Kappa of 0.95, with figures higher than the other models evaluated.