Toddler Nutritional Status Classification Using C4.5 and Particle Swarm Optimization

Alwis Nazir, Amany Akhyar, Y. Yusra, Elvia Budianita
{"title":"Toddler Nutritional Status Classification Using C4.5 and Particle Swarm Optimization","authors":"Alwis Nazir, Amany Akhyar, Y. Yusra, Elvia Budianita","doi":"10.15294/sji.v9i1.33158","DOIUrl":null,"url":null,"abstract":"Abstract. Purpose: This research was conducted to create a classification model in the form of the most optimal decision tree. Optimal in this case is the combination of parameters used that will produce the highest accuracy compared to other parameter combinations. From this best model, it will be used to predict the nutritional status class for the new data.Methods/Study design/approach: The dataset used is from Nutritional Status Monitoring in 2017 in Riau Province, Indonesia. From the dataset, the Knowledge Discovery in Database (KDD) stages were carried out to build several classification models in the form of decision trees. The decision tree that has the highest accuracy will then be selected to predict the class for the new data. Predictions for new data (unclassified data) will be made in a web-based system.Result/Findings: Particle Swarm Optimization is used to find optimal parameters. Before PSO is used, there are 213 parameters in the dataset that can be used to do classification. However, using many such parameters is time-consuming. After PSO is used, the optimal parameters found are the combination of 4 parameters, which can produce the most optimal decision tree. The 4 chosen parameters are gender, age (in months), height, and the way to measure the height (either stand up or lie down). The most optimal decision tree has an accuracy of 94.49%. From the most optimal decision tree, a web-based system was built to predict the class for new data (unclassified data).Novelty/Originality/Value: Particle Swarm Optimization (PSO) is a method that can help to select the most optimal parameters, or in other words produce the highest classification accuracy. The combination of parameters selected has also been confirmed by the nutritionist. The prediction system has been declared feasible to be used by nutritionists through the User Acceptance Test (UAT).","PeriodicalId":30781,"journal":{"name":"Scientific Journal of Informatics","volume":"107 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Journal of Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15294/sji.v9i1.33158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract. Purpose: This research was conducted to create a classification model in the form of the most optimal decision tree. Optimal in this case is the combination of parameters used that will produce the highest accuracy compared to other parameter combinations. From this best model, it will be used to predict the nutritional status class for the new data.Methods/Study design/approach: The dataset used is from Nutritional Status Monitoring in 2017 in Riau Province, Indonesia. From the dataset, the Knowledge Discovery in Database (KDD) stages were carried out to build several classification models in the form of decision trees. The decision tree that has the highest accuracy will then be selected to predict the class for the new data. Predictions for new data (unclassified data) will be made in a web-based system.Result/Findings: Particle Swarm Optimization is used to find optimal parameters. Before PSO is used, there are 213 parameters in the dataset that can be used to do classification. However, using many such parameters is time-consuming. After PSO is used, the optimal parameters found are the combination of 4 parameters, which can produce the most optimal decision tree. The 4 chosen parameters are gender, age (in months), height, and the way to measure the height (either stand up or lie down). The most optimal decision tree has an accuracy of 94.49%. From the most optimal decision tree, a web-based system was built to predict the class for new data (unclassified data).Novelty/Originality/Value: Particle Swarm Optimization (PSO) is a method that can help to select the most optimal parameters, or in other words produce the highest classification accuracy. The combination of parameters selected has also been confirmed by the nutritionist. The prediction system has been declared feasible to be used by nutritionists through the User Acceptance Test (UAT).
基于C4.5和粒子群优化的幼儿营养状况分类
摘要目的:本研究旨在以最优决策树的形式创建一个分类模型。在这种情况下,最佳的是所使用的参数组合,与其他参数组合相比,该组合将产生最高的精度。根据这个最佳模型,它将用于预测新数据的营养状况类别。方法/研究设计/方法:使用的数据集来自印度尼西亚廖内省2017年的营养状况监测。根据数据集,进行了数据库中的知识发现(KDD)阶段,以建立决策树形式的几个分类模型。然后将选择具有最高精度的决策树来预测新数据的类别。新数据(未分类数据)的预测将在基于网络的系统中进行。结果/发现:粒子群优化用于寻找最佳参数。在使用PSO之前,数据集中有213个参数可以用于进行分类。然而,使用许多这样的参数是耗时的。使用PSO后,找到的最优参数是4个参数的组合,可以生成最优化的决策树。选择的4个参数是性别、年龄(以月为单位)、身高和测量身高的方式(站起来或躺着)。最优决策树的准确率为94.49%。根据最优决策树,建立了一个基于网络的系统来预测新数据(未分类数据)的类别。新颖性/独创性/价值:粒子群优化(PSO)是一种可以帮助选择最优参数的方法,换句话说,可以产生最高的分类准确率。营养学家还确认了所选参数的组合。该预测系统已被宣布可供营养学家通过用户接受测试(UAT)使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
13
审稿时长
24 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信