特征选择和关联规则学习确定埃塞俄比亚学童营养不良的风险因素

Frontiers in epidemiology Pub Date : 2023-07-06 eCollection Date: 2023-01-01 DOI:10.3389/fepid.2023.1150619
William A Russel, Jim Perry, Claire Bonzani, Amanda Dontino, Zeleke Mekonnen, Ahmet Ay, Bineyam Taye
{"title":"特征选择和关联规则学习确定埃塞俄比亚学童营养不良的风险因素","authors":"William A Russel, Jim Perry, Claire Bonzani, Amanda Dontino, Zeleke Mekonnen, Ahmet Ay, Bineyam Taye","doi":"10.3389/fepid.2023.1150619","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Previous studies have sought to identify risk factors for malnutrition in populations of schoolchildren, depending on traditional logistic regression methods. However, holistic machine learning (ML) approaches are emerging that may provide a more comprehensive analysis of risk factors.</p><p><strong>Methods: </strong>This study employed feature selection and association rule learning ML methods in conjunction with logistic regression on epidemiological survey data from 1,036 Ethiopian school children. Our first analysis used the entire dataset and then we reran this analysis on age, residence, and sex population subsets.</p><p><strong>Results: </strong>Both logistic regression and ML methods identified older childhood age as a significant risk factor, while females and vaccinated individuals showed reduced odds of stunting. Our machine learning analyses provided additional insights into the data, as feature selection identified that age, school latrine cleanliness, large family size, and nail trimming habits were significant risk factors for stunting, underweight, and thinness. Association rule learning revealed an association between co-occurring hygiene and socio-economical variables with malnutrition that was otherwise missed using traditional statistical methods.</p><p><strong>Discussion: </strong>Our analysis supports the benefit of integrating feature selection methods, association rules learning techniques, and logistic regression to identify comprehensive risk factors associated with malnutrition in young children.</p>","PeriodicalId":73083,"journal":{"name":"Frontiers in epidemiology","volume":" ","pages":"1150619"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10910994/pdf/","citationCount":"0","resultStr":"{\"title\":\"Feature selection and association rule learning identify risk factors of malnutrition among Ethiopian schoolchildren.\",\"authors\":\"William A Russel, Jim Perry, Claire Bonzani, Amanda Dontino, Zeleke Mekonnen, Ahmet Ay, Bineyam Taye\",\"doi\":\"10.3389/fepid.2023.1150619\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Previous studies have sought to identify risk factors for malnutrition in populations of schoolchildren, depending on traditional logistic regression methods. However, holistic machine learning (ML) approaches are emerging that may provide a more comprehensive analysis of risk factors.</p><p><strong>Methods: </strong>This study employed feature selection and association rule learning ML methods in conjunction with logistic regression on epidemiological survey data from 1,036 Ethiopian school children. Our first analysis used the entire dataset and then we reran this analysis on age, residence, and sex population subsets.</p><p><strong>Results: </strong>Both logistic regression and ML methods identified older childhood age as a significant risk factor, while females and vaccinated individuals showed reduced odds of stunting. Our machine learning analyses provided additional insights into the data, as feature selection identified that age, school latrine cleanliness, large family size, and nail trimming habits were significant risk factors for stunting, underweight, and thinness. Association rule learning revealed an association between co-occurring hygiene and socio-economical variables with malnutrition that was otherwise missed using traditional statistical methods.</p><p><strong>Discussion: </strong>Our analysis supports the benefit of integrating feature selection methods, association rules learning techniques, and logistic regression to identify comprehensive risk factors associated with malnutrition in young children.</p>\",\"PeriodicalId\":73083,\"journal\":{\"name\":\"Frontiers in epidemiology\",\"volume\":\" \",\"pages\":\"1150619\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10910994/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in epidemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fepid.2023.1150619\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fepid.2023.1150619","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

以往的研究都是依靠传统的逻辑回归方法,试图确定学龄儿童营养不良的危险因素。然而,整体机器学习(ML)方法正在出现,可以提供更全面的风险因素分析。本研究采用特征选择和关联规则学习ML方法,并结合逻辑回归对来自1,036名埃塞俄比亚学龄儿童的流行病学调查数据进行分析。我们的第一次分析使用了整个数据集,然后我们对年龄、居住地和性别人口子集进行了重新分析。逻辑回归和ML方法都确定儿童年龄较大是重要的危险因素,而女性和接种疫苗的个体发育迟缓的几率降低。我们的机器学习分析为数据提供了额外的见解,因为特征选择确定了年龄、学校厕所清洁度、大家庭规模和修剪指甲的习惯是发育迟缓、体重不足和消瘦的重要风险因素。关联规则学习揭示了共同发生的卫生和社会经济变量与营养不良之间的关联,否则使用传统的统计方法会错过。我们的分析支持整合特征选择方法、关联规则学习技术和逻辑回归的好处,以确定与幼儿营养不良相关的综合危险因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Feature selection and association rule learning identify risk factors of malnutrition among Ethiopian schoolchildren.

Introduction: Previous studies have sought to identify risk factors for malnutrition in populations of schoolchildren, depending on traditional logistic regression methods. However, holistic machine learning (ML) approaches are emerging that may provide a more comprehensive analysis of risk factors.

Methods: This study employed feature selection and association rule learning ML methods in conjunction with logistic regression on epidemiological survey data from 1,036 Ethiopian school children. Our first analysis used the entire dataset and then we reran this analysis on age, residence, and sex population subsets.

Results: Both logistic regression and ML methods identified older childhood age as a significant risk factor, while females and vaccinated individuals showed reduced odds of stunting. Our machine learning analyses provided additional insights into the data, as feature selection identified that age, school latrine cleanliness, large family size, and nail trimming habits were significant risk factors for stunting, underweight, and thinness. Association rule learning revealed an association between co-occurring hygiene and socio-economical variables with malnutrition that was otherwise missed using traditional statistical methods.

Discussion: Our analysis supports the benefit of integrating feature selection methods, association rules learning techniques, and logistic regression to identify comprehensive risk factors associated with malnutrition in young children.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信