基于遗传算法的C4.5算法优化肝炎患者预期寿命诊断

Margareta Ayu Riantik, R. Arifudin
{"title":"基于遗传算法的C4.5算法优化肝炎患者预期寿命诊断","authors":"Margareta Ayu Riantik, R. Arifudin","doi":"10.15294/jaist.v3i1.49014","DOIUrl":null,"url":null,"abstract":"As technology develops rapidly, the amount of data generated experiencing rapid development, including medical data. Data can help diagnose the life expectancy of people with the disease such as hepatitis using data mining methods in the medical field. In this research, technique data mining uses a classification technique with the C4.5 algorithm and the UCI Machine Learning Repository dataset. This dataset has 19 attributes, 1 class, and 155 samples. C4.5 algorithm is optimized using the Genetic Algorithm feature selection process. This study compares the accuracy of the C4.5 algorithm before and after optimization using a Genetic Algorithm. C4.5 algorithm produces the highest accuracy of 96.23%. Meanwhile, the C4.5 algorithm, after being optimized using Genetic Algorithm, has the highest accuracy of 98.11%. The number of features selected is 15 features. Application of Genetic Algorithms in C4.5 algorithm is proven to improve the accuracy in diagnosing life expectancy of people with hepatitis as much as 1.88%.","PeriodicalId":418742,"journal":{"name":"Journal of Advances in Information Systems and Technology","volume":"110 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Optimization of the C4.5 Algorithm by Using a Genetic Algorithm for the Diagnosis of Life Expectancy for Hepatitis Patients\",\"authors\":\"Margareta Ayu Riantik, R. Arifudin\",\"doi\":\"10.15294/jaist.v3i1.49014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As technology develops rapidly, the amount of data generated experiencing rapid development, including medical data. Data can help diagnose the life expectancy of people with the disease such as hepatitis using data mining methods in the medical field. In this research, technique data mining uses a classification technique with the C4.5 algorithm and the UCI Machine Learning Repository dataset. This dataset has 19 attributes, 1 class, and 155 samples. C4.5 algorithm is optimized using the Genetic Algorithm feature selection process. This study compares the accuracy of the C4.5 algorithm before and after optimization using a Genetic Algorithm. C4.5 algorithm produces the highest accuracy of 96.23%. Meanwhile, the C4.5 algorithm, after being optimized using Genetic Algorithm, has the highest accuracy of 98.11%. The number of features selected is 15 features. Application of Genetic Algorithms in C4.5 algorithm is proven to improve the accuracy in diagnosing life expectancy of people with hepatitis as much as 1.88%.\",\"PeriodicalId\":418742,\"journal\":{\"name\":\"Journal of Advances in Information Systems and Technology\",\"volume\":\"110 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advances in Information Systems and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15294/jaist.v3i1.49014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Information Systems and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15294/jaist.v3i1.49014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着技术的快速发展,产生的数据量也在快速增长,其中包括医疗数据。利用医学领域的数据挖掘方法,数据可以帮助诊断患有肝炎等疾病的人的预期寿命。在本研究中,技术数据挖掘使用了C4.5算法和UCI机器学习存储库数据集的分类技术。该数据集有19个属性、1个类和155个样本。C4.5算法采用遗传算法对特征选择过程进行优化。本研究比较了采用遗传算法优化前后C4.5算法的精度。C4.5算法准确率最高,达到96.23%。同时,C4.5算法经过遗传算法优化后,准确率最高,达到98.11%。选择的特性数为15个。将遗传算法应用于C4.5算法,可将肝炎患者预期寿命的诊断准确率提高1.88%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimization of the C4.5 Algorithm by Using a Genetic Algorithm for the Diagnosis of Life Expectancy for Hepatitis Patients
As technology develops rapidly, the amount of data generated experiencing rapid development, including medical data. Data can help diagnose the life expectancy of people with the disease such as hepatitis using data mining methods in the medical field. In this research, technique data mining uses a classification technique with the C4.5 algorithm and the UCI Machine Learning Repository dataset. This dataset has 19 attributes, 1 class, and 155 samples. C4.5 algorithm is optimized using the Genetic Algorithm feature selection process. This study compares the accuracy of the C4.5 algorithm before and after optimization using a Genetic Algorithm. C4.5 algorithm produces the highest accuracy of 96.23%. Meanwhile, the C4.5 algorithm, after being optimized using Genetic Algorithm, has the highest accuracy of 98.11%. The number of features selected is 15 features. Application of Genetic Algorithms in C4.5 algorithm is proven to improve the accuracy in diagnosing life expectancy of people with hepatitis as much as 1.88%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信