Machine Learning Model For Stunting Prediction

Jurnal Health Sains Pub Date : 2023-09-25 DOI:10.46799/jhs.v4i9.1073

Sutarmi Sutarmi, Warijan Warijan, Tavip Indrayana, Dwi P. Putro B, Indra Gunawan

{"title":"Machine Learning Model For Stunting Prediction","authors":"Sutarmi Sutarmi, Warijan Warijan, Tavip Indrayana, Dwi P. Putro B, Indra Gunawan","doi":"10.46799/jhs.v4i9.1073","DOIUrl":null,"url":null,"abstract":"This study aims to find the best Supervised Machine Learning (SML) model for stunting prediction. This research was conducted using an experimental approach using 192 infant data with a composition of 183 normal infant data and 9 stunted infant data using a custom dataset. The conclusion obtained from this study can be concluded that the combination of the Random Forest classification algorithm with Support Vector Machine Weighting and the Genetic Algorithm Feature Selection has the best performance. The parameters with the best performance are: The training and testing data distribution is 90% of the training data and 10% of the testing data. The number of trees in the random forest algorithm is 100, and the Gain Ratio criterion and max_depth is 10. In the Genetic Algorithm, the best parameters are: The Roulette Wheel selection method, the population is 20, the mutation value is 0.03, and the crossover value is 0.9. The validation method uses k-fold cross validation with a value of k = 10. Another conclusion is that there are 44 supporting factors for stunting, which, if we take a ranking of 10 in order of magnitude from largest to smallest, the supporting factors for stunting are 1.Baby's weight at birth. 2.Baby’s Height at Birth. 3.Number of meal per day. 4.Breast Milk. 5.Diarrhe times per 3 month. 6.Child development examination during covid by Health Worker at home. 7.Mother's age at birth. 8.Mother height at birth. 9.Number of sibling. 10.Age when the first food was given. This research has the disadvantage of no test on other datasets. So researchers do not know the reliability of findings is on different datasets","PeriodicalId":431769,"journal":{"name":"Jurnal Health Sains","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Health Sains","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46799/jhs.v4i9.1073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This study aims to find the best Supervised Machine Learning (SML) model for stunting prediction. This research was conducted using an experimental approach using 192 infant data with a composition of 183 normal infant data and 9 stunted infant data using a custom dataset. The conclusion obtained from this study can be concluded that the combination of the Random Forest classification algorithm with Support Vector Machine Weighting and the Genetic Algorithm Feature Selection has the best performance. The parameters with the best performance are: The training and testing data distribution is 90% of the training data and 10% of the testing data. The number of trees in the random forest algorithm is 100, and the Gain Ratio criterion and max_depth is 10. In the Genetic Algorithm, the best parameters are: The Roulette Wheel selection method, the population is 20, the mutation value is 0.03, and the crossover value is 0.9. The validation method uses k-fold cross validation with a value of k = 10. Another conclusion is that there are 44 supporting factors for stunting, which, if we take a ranking of 10 in order of magnitude from largest to smallest, the supporting factors for stunting are 1.Baby's weight at birth. 2.Baby’s Height at Birth. 3.Number of meal per day. 4.Breast Milk. 5.Diarrhe times per 3 month. 6.Child development examination during covid by Health Worker at home. 7.Mother's age at birth. 8.Mother height at birth. 9.Number of sibling. 10.Age when the first food was given. This research has the disadvantage of no test on other datasets. So researchers do not know the reliability of findings is on different datasets

查看原文本刊更多论文

发育迟缓预测的机器学习模型

本研究旨在寻找用于发育迟缓预测的最佳监督机器学习(SML)模型。本研究采用实验方法，使用自定义数据集，使用192个婴儿数据，其中183个正常婴儿数据和9个发育不良婴儿数据组成。从本研究得出的结论可以得出，支持向量机加权的随机森林分类算法与遗传算法特征选择相结合的方法具有最佳的性能。性能最好的参数有:训练和测试数据分布分别为训练数据的90%和测试数据的10%。随机森林算法中的树数为100，增益比准则和max_depth为10。在遗传算法中，最佳参数为:轮盘选择法，种群为20，突变值为0.03，交叉值为0.9。验证方法使用k-fold交叉验证，值k = 10。另一个结论是发育不良的支持因素有44个，如果按照从大到小的10个数量级排序，发育不良的支持因素为1。婴儿出生时的体重。2.婴儿出生时的身高。每天吃多少顿饭。4.母乳。每3个月腹泻数次。6.卫生工作者在家中对covid期间的儿童进行发育检查。7.母亲出生时的年龄。8.母亲出生时的身高。9.兄弟姐妹数目。10.第一次吃东西时的年龄。本研究的缺点是没有对其他数据集进行测试。因此，研究人员不知道研究结果的可靠性取决于不同的数据集

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Jurnal Health Sains

自引率

0.00%

发文量