{"title":"新型冠状病毒预测的机器学习分类器模型","authors":"Jhimli Adhikari","doi":"10.47164/IJNGC.V12I1.186","DOIUrl":null,"url":null,"abstract":"COVID-19 pandemic has become a major threat to the world. In this study a model is designed which can predict the likelihood of COVID-19 patients with maximum accuracy. Therefore three machine learning classification algorithms namely Decision Tree, Naive Bayes and Logistic Regression classifier are used in this experiment to detect Covid-19 disease at an early stage. The models are trained with 75% of the samples and tested with 25% of data. Since the dataset is imbalanced, the performances of all the three algorithms are evaluated on various measures like F-Measure, Accuracy and Matthews Correlation Coefficient. Accuracy is measured over correctly and incorrectly classified instances. All the analyses were performed with the use of Python, version 3.8.2. Receiver Operating Characteristic (ROC) curves are used to verify the result in a proper and systematic manner. This framework can be used, among other considerations, to prioritize testing for COVID-19 when testing resources are limited.","PeriodicalId":351421,"journal":{"name":"Int. J. Next Gener. Comput.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Classifier Model for Prediction of COVID-19\",\"authors\":\"Jhimli Adhikari\",\"doi\":\"10.47164/IJNGC.V12I1.186\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"COVID-19 pandemic has become a major threat to the world. In this study a model is designed which can predict the likelihood of COVID-19 patients with maximum accuracy. Therefore three machine learning classification algorithms namely Decision Tree, Naive Bayes and Logistic Regression classifier are used in this experiment to detect Covid-19 disease at an early stage. The models are trained with 75% of the samples and tested with 25% of data. Since the dataset is imbalanced, the performances of all the three algorithms are evaluated on various measures like F-Measure, Accuracy and Matthews Correlation Coefficient. Accuracy is measured over correctly and incorrectly classified instances. All the analyses were performed with the use of Python, version 3.8.2. Receiver Operating Characteristic (ROC) curves are used to verify the result in a proper and systematic manner. This framework can be used, among other considerations, to prioritize testing for COVID-19 when testing resources are limited.\",\"PeriodicalId\":351421,\"journal\":{\"name\":\"Int. J. Next Gener. Comput.\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Next Gener. Comput.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47164/IJNGC.V12I1.186\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Next Gener. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47164/IJNGC.V12I1.186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine Learning Classifier Model for Prediction of COVID-19
COVID-19 pandemic has become a major threat to the world. In this study a model is designed which can predict the likelihood of COVID-19 patients with maximum accuracy. Therefore three machine learning classification algorithms namely Decision Tree, Naive Bayes and Logistic Regression classifier are used in this experiment to detect Covid-19 disease at an early stage. The models are trained with 75% of the samples and tested with 25% of data. Since the dataset is imbalanced, the performances of all the three algorithms are evaluated on various measures like F-Measure, Accuracy and Matthews Correlation Coefficient. Accuracy is measured over correctly and incorrectly classified instances. All the analyses were performed with the use of Python, version 3.8.2. Receiver Operating Characteristic (ROC) curves are used to verify the result in a proper and systematic manner. This framework can be used, among other considerations, to prioritize testing for COVID-19 when testing resources are limited.