肝硬化、纤维化和肝炎分类的集成机器学习框架

2022 Third International Conference on Latest trends in Electrical Engineering and Computing Technologies (INTELLECT) Pub Date : 2022-11-16 DOI:10.1109/INTELLECT55495.2022.9969404

Sibgha Islam, A. Rehman, Sabeen Javaid, Tahir Muhammad Ali, Ali Nawaz

{"title":"肝硬化、纤维化和肝炎分类的集成机器学习框架","authors":"Sibgha Islam, A. Rehman, Sabeen Javaid, Tahir Muhammad Ali, Ali Nawaz","doi":"10.1109/INTELLECT55495.2022.9969404","DOIUrl":null,"url":null,"abstract":"Hepatitis C is an ailment that causes inflammation of the liver and leads to serious liver damage. In previous research, the accuracy of the model wasn't that accurate but the differences this paper made model worked well for the prediction of Hepatitis C disease. In the dataset, there are mainly four categories (Blood Donor, Suspected Blood Donor, Fibrosis, and Cirrhosis) used that are labeled. Its data type is polynomial with 0 missing values. The minimum value in the category is 7 for suspect blood donors and the most value is 533 for a blood donor. The machine learning algorithms used in medical approaches are increasing day by day for prediction tools, diagnosis tools, and detection of diseases such as the hepatitis C virus. We used the rapid miner Software for the application of machine learning algorithms. Firstly, took the dataset of the hepatitis C virus from the UCI machine learning site and then applied the five Machine Learning Algorithms, which include Naive Bayes, Random Forest, KNN, Decision Tree & Deep Learning (ANN). On applying feature selection, the attributes Age, ALB, ALP, AST, CHE, GGT, and PROT were selected. After applying different algorithms, the best results are shown by deep learning (ANN) with an accuracy of 95.50%. Rest all algorithms showed minimum accuracy as a Decision tree with 93.09%, Naïve Bayes with 91.89%, KNN with 93.09%, and Random Forest showed 94.29% of high accuracy.","PeriodicalId":219188,"journal":{"name":"2022 Third International Conference on Latest trends in Electrical Engineering and Computing Technologies (INTELLECT)","volume":"162 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Integrated Machine Learning Framework for Classification of Cirrhosis, Fibrosis, and Hepatitis\",\"authors\":\"Sibgha Islam, A. Rehman, Sabeen Javaid, Tahir Muhammad Ali, Ali Nawaz\",\"doi\":\"10.1109/INTELLECT55495.2022.9969404\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hepatitis C is an ailment that causes inflammation of the liver and leads to serious liver damage. In previous research, the accuracy of the model wasn't that accurate but the differences this paper made model worked well for the prediction of Hepatitis C disease. In the dataset, there are mainly four categories (Blood Donor, Suspected Blood Donor, Fibrosis, and Cirrhosis) used that are labeled. Its data type is polynomial with 0 missing values. The minimum value in the category is 7 for suspect blood donors and the most value is 533 for a blood donor. The machine learning algorithms used in medical approaches are increasing day by day for prediction tools, diagnosis tools, and detection of diseases such as the hepatitis C virus. We used the rapid miner Software for the application of machine learning algorithms. Firstly, took the dataset of the hepatitis C virus from the UCI machine learning site and then applied the five Machine Learning Algorithms, which include Naive Bayes, Random Forest, KNN, Decision Tree & Deep Learning (ANN). On applying feature selection, the attributes Age, ALB, ALP, AST, CHE, GGT, and PROT were selected. After applying different algorithms, the best results are shown by deep learning (ANN) with an accuracy of 95.50%. Rest all algorithms showed minimum accuracy as a Decision tree with 93.09%, Naïve Bayes with 91.89%, KNN with 93.09%, and Random Forest showed 94.29% of high accuracy.\",\"PeriodicalId\":219188,\"journal\":{\"name\":\"2022 Third International Conference on Latest trends in Electrical Engineering and Computing Technologies (INTELLECT)\",\"volume\":\"162 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Third International Conference on Latest trends in Electrical Engineering and Computing Technologies (INTELLECT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INTELLECT55495.2022.9969404\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Third International Conference on Latest trends in Electrical Engineering and Computing Technologies (INTELLECT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INTELLECT55495.2022.9969404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

丙型肝炎是一种引起肝脏炎症并导致严重肝损伤的疾病。在以往的研究中，该模型的准确性不是很准确，但本文提出的模型的差异对于丙型肝炎的预测效果很好。在数据集中，主要有四个类别(献血者、疑似献血者、纤维化和肝硬化)被标记使用。它的数据类型是零缺失值的多项式。对于疑似献血者，该类别的最小值为7，对于献血者，最大值为533。医学方法中使用的机器学习算法日益增加，用于预测工具，诊断工具和检测疾病(如丙型肝炎病毒)。我们使用快速挖矿软件进行机器学习算法的应用。首先，从UCI机器学习站点获取丙型肝炎病毒数据集，然后应用五种机器学习算法，包括朴素贝叶斯、随机森林、KNN、决策树和深度学习(ANN)。在应用特征选择时，选择属性Age、ALB、ALP、AST、CHE、GGT和PROT。应用不同算法后，深度学习(ANN)的准确率达到95.50%，效果最好。其余算法的准确率最低，决策树为93.09%，Naïve贝叶斯为91.89%，KNN为93.09%，随机森林为94.29%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Integrated Machine Learning Framework for Classification of Cirrhosis, Fibrosis, and Hepatitis

Hepatitis C is an ailment that causes inflammation of the liver and leads to serious liver damage. In previous research, the accuracy of the model wasn't that accurate but the differences this paper made model worked well for the prediction of Hepatitis C disease. In the dataset, there are mainly four categories (Blood Donor, Suspected Blood Donor, Fibrosis, and Cirrhosis) used that are labeled. Its data type is polynomial with 0 missing values. The minimum value in the category is 7 for suspect blood donors and the most value is 533 for a blood donor. The machine learning algorithms used in medical approaches are increasing day by day for prediction tools, diagnosis tools, and detection of diseases such as the hepatitis C virus. We used the rapid miner Software for the application of machine learning algorithms. Firstly, took the dataset of the hepatitis C virus from the UCI machine learning site and then applied the five Machine Learning Algorithms, which include Naive Bayes, Random Forest, KNN, Decision Tree & Deep Learning (ANN). On applying feature selection, the attributes Age, ALB, ALP, AST, CHE, GGT, and PROT were selected. After applying different algorithms, the best results are shown by deep learning (ANN) with an accuracy of 95.50%. Rest all algorithms showed minimum accuracy as a Decision tree with 93.09%, Naïve Bayes with 91.89%, KNN with 93.09%, and Random Forest showed 94.29% of high accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 Third International Conference on Latest trends in Electrical Engineering and Computing Technologies (INTELLECT)

自引率

0.00%

发文量