{"title":"基于机器学习概念的疾病诊断路径确定:丙型肝炎案例研究","authors":"Jimmy Tjen, V. Pratama","doi":"10.52158/jacost.v4i2.556","DOIUrl":null,"url":null,"abstract":"Hepatitis is considered to be one of the most dangerous diseases, which often leads to death if not handled properly. Thus, early detection via precise diagnosis is needed in order to prevent the unfortunate event. This research aims to provide a novel hepatitis C diagnosis based on the machine learning algorithm, which is the classification tree from the decision tree learning and the distance correlation, which measures the Euclidean distance between 2 vectors. In particular, the goal is to develop a low computational cost yet precise algorithm for diagnosing the possibility of whether a person is being infected with Hepatitis C or not. Based on the experiment, the distance correlation-based classification tree algorithm outperforms the classical classification tree algorithm by around 3% while using only 7 features instead of 12 as in the classical algorithm. Furthermore, the algorithm identified albumin (ALB), Creatinine (CREA), Bilirubin (BIL), Aspartate Transaminase (AST) and Cholesterol (CHOL) as significant risk factors in determining whether someone is potentially infected with hepatitis C or not, with Creatinine is identified as the most important parameter among all 5 parameters mentioned above.","PeriodicalId":151855,"journal":{"name":"Journal of Applied Computer Science and Technology","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Penentuan Jalur Diagnostik Penyakit Berbasis Konsep Pembelajaran Mesin: Studi kasus Penyakit Hepatitis C\",\"authors\":\"Jimmy Tjen, V. Pratama\",\"doi\":\"10.52158/jacost.v4i2.556\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hepatitis is considered to be one of the most dangerous diseases, which often leads to death if not handled properly. Thus, early detection via precise diagnosis is needed in order to prevent the unfortunate event. This research aims to provide a novel hepatitis C diagnosis based on the machine learning algorithm, which is the classification tree from the decision tree learning and the distance correlation, which measures the Euclidean distance between 2 vectors. In particular, the goal is to develop a low computational cost yet precise algorithm for diagnosing the possibility of whether a person is being infected with Hepatitis C or not. Based on the experiment, the distance correlation-based classification tree algorithm outperforms the classical classification tree algorithm by around 3% while using only 7 features instead of 12 as in the classical algorithm. Furthermore, the algorithm identified albumin (ALB), Creatinine (CREA), Bilirubin (BIL), Aspartate Transaminase (AST) and Cholesterol (CHOL) as significant risk factors in determining whether someone is potentially infected with hepatitis C or not, with Creatinine is identified as the most important parameter among all 5 parameters mentioned above.\",\"PeriodicalId\":151855,\"journal\":{\"name\":\"Journal of Applied Computer Science and Technology\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Computer Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52158/jacost.v4i2.556\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Computer Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52158/jacost.v4i2.556","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Penentuan Jalur Diagnostik Penyakit Berbasis Konsep Pembelajaran Mesin: Studi kasus Penyakit Hepatitis C
Hepatitis is considered to be one of the most dangerous diseases, which often leads to death if not handled properly. Thus, early detection via precise diagnosis is needed in order to prevent the unfortunate event. This research aims to provide a novel hepatitis C diagnosis based on the machine learning algorithm, which is the classification tree from the decision tree learning and the distance correlation, which measures the Euclidean distance between 2 vectors. In particular, the goal is to develop a low computational cost yet precise algorithm for diagnosing the possibility of whether a person is being infected with Hepatitis C or not. Based on the experiment, the distance correlation-based classification tree algorithm outperforms the classical classification tree algorithm by around 3% while using only 7 features instead of 12 as in the classical algorithm. Furthermore, the algorithm identified albumin (ALB), Creatinine (CREA), Bilirubin (BIL), Aspartate Transaminase (AST) and Cholesterol (CHOL) as significant risk factors in determining whether someone is potentially infected with hepatitis C or not, with Creatinine is identified as the most important parameter among all 5 parameters mentioned above.