Utkrisht Singh, Mahendra Kumar Gourisaria, B. K. Mishra
{"title":"A Dual Dataset approach for the diagnosis of Hepatitis C Virus using Machine Learning","authors":"Utkrisht Singh, Mahendra Kumar Gourisaria, B. K. Mishra","doi":"10.1109/CONECCT55679.2022.9865758","DOIUrl":null,"url":null,"abstract":"Hepatitis C (HCV) is a micro-contagion that leads to liver inflammation, sometimes affecting the liver to a serious extent. In any medical therapy, proper diagnosis of treatment response is critical for decreasing the effects of the disease. It is assessed that three to four million new cases come every year for Hepatitis C, which is a public health issue that should be solved with treatment policies and recognition. The principal motive of this paper is to implement a twofold dataset approach for the finding of Hepatitis C Virus in the general population. Popular supervised learning models like Decision tree (DT), Logistic regression (LR), K-Nearest Neighbor (KNN), Extreme gradient boosting (XGB), Ada boost (AB), Gradient Boosting Machine, Gaussian Naive Bayes, Random Forest (RF), Gradient Boosting (GB), Support Vector Machine and its variations were instigated on the classification dataset, furthermore, some unsupervised learning models like K-means, Hierarchical clustering, DBMSCN, and Gaussian Mixture algorithms were applied on the HCV clustering dataset. It was concluded that Logistic Regression and K-Means were the superlative models","PeriodicalId":380005,"journal":{"name":"2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONECCT55679.2022.9865758","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Hepatitis C (HCV) is a micro-contagion that leads to liver inflammation, sometimes affecting the liver to a serious extent. In any medical therapy, proper diagnosis of treatment response is critical for decreasing the effects of the disease. It is assessed that three to four million new cases come every year for Hepatitis C, which is a public health issue that should be solved with treatment policies and recognition. The principal motive of this paper is to implement a twofold dataset approach for the finding of Hepatitis C Virus in the general population. Popular supervised learning models like Decision tree (DT), Logistic regression (LR), K-Nearest Neighbor (KNN), Extreme gradient boosting (XGB), Ada boost (AB), Gradient Boosting Machine, Gaussian Naive Bayes, Random Forest (RF), Gradient Boosting (GB), Support Vector Machine and its variations were instigated on the classification dataset, furthermore, some unsupervised learning models like K-means, Hierarchical clustering, DBMSCN, and Gaussian Mixture algorithms were applied on the HCV clustering dataset. It was concluded that Logistic Regression and K-Means were the superlative models