利用机器学习识别在以人口为基础的水、卫生、洗手和营养干预中预期收益最高的子群体。

Caitlin Hemlock, Laura H Kwong, Lia C H Fernald, Alan E Hubbard, John M Colford, Fahmida Tofail, Md Mahbubur Rahman, Sarker Parvez, Stephen P Luby, Andrew N Mertens
{"title":"利用机器学习识别在以人口为基础的水、卫生、洗手和营养干预中预期收益最高的子群体。","authors":"Caitlin Hemlock, Laura H Kwong, Lia C H Fernald, Alan E Hubbard, John M Colford, Fahmida Tofail, Md Mahbubur Rahman, Sarker Parvez, Stephen P Luby, Andrew N Mertens","doi":"10.1101/2025.06.17.25329796","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Understanding who benefits most from investments in water, sanitation, and hygiene (WaSH) interventions can elucidate causal pathways, uncover complex interactions between population characteristics and interventions, and inform targeted implementation. We applied machine learning to identify and describe households of children that benefited most from WaSH and nutrition interventions.</p><p><strong>Methods: </strong>We used causal forests and baseline characteristics of pregnant women enroled in a trial in Bangladesh (2013-2015) to test for heterogenous treatment effects of the primary trial outcomes at two years (length-for-age Z-score [LAZ-score] and diarrhoea prevalence) and one secondary outcome (child development [EASQ Z-score]) for each treatment-outcome combination. We split households into three groups based on predicted treatment effect magnitude and compared characteristics of those that benefitted the most (Tercile 3) versus the least (Tercile 1).</p><p><strong>Results: </strong>Heterogeneity was detected in the effect of Sanitation on EASQ Z-score, compared to Control; children in Tercile 3 were estimated to gain 0.51 SD (95% CI: 0.35, 0.67) whereas children in Tercile 1 were estimated to have no benefit. At baseline, households of children in Tercile 3 were more likely to report that chickens always entered the house (85% vs. 4%) and had animal feces observed in the child's play area (84% vs. 18%) when compared with Tercile 1. Tercile 3 households also owned less land and assets and lived further from Dhaka, any population center, or a market. We did not detect heterogeneity for any other treatment-outcome comparison.</p><p><strong>Conclusions: </strong>We did not detect heterogeneity in any treatment arms for the outcomes of diarrhoea or LAZ-score, showing that children from all backgrounds benefit from effective interventions equally based on household characteristics. We found heterogeneity in the effect of receiving sanitation improvements on child development, where poorer households located in more remote areas and potentially with higher levels of animal fecal contamination had the highest expected benefit.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204266/pdf/","citationCount":"0","resultStr":"{\"title\":\"Using machine learning to identify subgroups with the highest expected benefit in a population-based water, sanitation, handwashing, and nutrition intervention.\",\"authors\":\"Caitlin Hemlock, Laura H Kwong, Lia C H Fernald, Alan E Hubbard, John M Colford, Fahmida Tofail, Md Mahbubur Rahman, Sarker Parvez, Stephen P Luby, Andrew N Mertens\",\"doi\":\"10.1101/2025.06.17.25329796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Understanding who benefits most from investments in water, sanitation, and hygiene (WaSH) interventions can elucidate causal pathways, uncover complex interactions between population characteristics and interventions, and inform targeted implementation. We applied machine learning to identify and describe households of children that benefited most from WaSH and nutrition interventions.</p><p><strong>Methods: </strong>We used causal forests and baseline characteristics of pregnant women enroled in a trial in Bangladesh (2013-2015) to test for heterogenous treatment effects of the primary trial outcomes at two years (length-for-age Z-score [LAZ-score] and diarrhoea prevalence) and one secondary outcome (child development [EASQ Z-score]) for each treatment-outcome combination. We split households into three groups based on predicted treatment effect magnitude and compared characteristics of those that benefitted the most (Tercile 3) versus the least (Tercile 1).</p><p><strong>Results: </strong>Heterogeneity was detected in the effect of Sanitation on EASQ Z-score, compared to Control; children in Tercile 3 were estimated to gain 0.51 SD (95% CI: 0.35, 0.67) whereas children in Tercile 1 were estimated to have no benefit. At baseline, households of children in Tercile 3 were more likely to report that chickens always entered the house (85% vs. 4%) and had animal feces observed in the child's play area (84% vs. 18%) when compared with Tercile 1. Tercile 3 households also owned less land and assets and lived further from Dhaka, any population center, or a market. We did not detect heterogeneity for any other treatment-outcome comparison.</p><p><strong>Conclusions: </strong>We did not detect heterogeneity in any treatment arms for the outcomes of diarrhoea or LAZ-score, showing that children from all backgrounds benefit from effective interventions equally based on household characteristics. We found heterogeneity in the effect of receiving sanitation improvements on child development, where poorer households located in more remote areas and potentially with higher levels of animal fecal contamination had the highest expected benefit.</p>\",\"PeriodicalId\":94281,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204266/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2025.06.17.25329796\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.06.17.25329796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:了解谁从水、环境卫生和个人卫生(WaSH)干预措施的投资中获益最多,可以阐明因果关系,揭示人口特征与干预措施之间复杂的相互作用,并为有针对性的实施提供信息。我们应用机器学习来识别和描述从WaSH和营养干预中受益最多的儿童家庭。方法:我们在孟加拉国(2013-2015年)的一项试验中使用了孕妇的因果森林和基线特征,以检验两年内主要试验结果(年龄长度Z-score [LAZ-score]和腹泻患病率)和一个次要结局(儿童发育[EASQ Z-score])对每个治疗结局组合的异质性治疗效果。我们根据预测的治疗效果大小将家庭分为三组,并比较了受益最多(Tercile 3)和受益最少(Tercile 1)的家庭的特征。结果:与对照组相比,卫生条件对EASQ z得分的影响存在异质性;Tercile 3组的儿童估计获得了0.51 SD (95% CI: 0.35, 0.67),而Tercile 1组的儿童估计没有获益。在基线时,与Tercile 1相比,Tercile 3的儿童家庭更有可能报告总是有鸡进入家中(85%对4%),并且在儿童游乐区观察到动物粪便(84%对18%)。贫困家庭拥有的土地和资产也更少,他们住的地方离达卡、任何人口中心或市场都更远。我们没有发现任何其他治疗结果比较的异质性。结论:我们没有发现任何治疗组中腹泻或laz评分结果的异质性,表明基于家庭特征,所有背景的儿童都能从有效的干预措施中受益。我们发现,卫生设施改善对儿童发展的影响存在异质性,位于偏远地区、动物粪便污染程度可能较高的贫困家庭的预期效益最高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using machine learning to identify subgroups with the highest expected benefit in a population-based water, sanitation, handwashing, and nutrition intervention.

Background: Understanding who benefits most from investments in water, sanitation, and hygiene (WaSH) interventions can elucidate causal pathways, uncover complex interactions between population characteristics and interventions, and inform targeted implementation. We applied machine learning to identify and describe households of children that benefited most from WaSH and nutrition interventions.

Methods: We used causal forests and baseline characteristics of pregnant women enroled in a trial in Bangladesh (2013-2015) to test for heterogenous treatment effects of the primary trial outcomes at two years (length-for-age Z-score [LAZ-score] and diarrhoea prevalence) and one secondary outcome (child development [EASQ Z-score]) for each treatment-outcome combination. We split households into three groups based on predicted treatment effect magnitude and compared characteristics of those that benefitted the most (Tercile 3) versus the least (Tercile 1).

Results: Heterogeneity was detected in the effect of Sanitation on EASQ Z-score, compared to Control; children in Tercile 3 were estimated to gain 0.51 SD (95% CI: 0.35, 0.67) whereas children in Tercile 1 were estimated to have no benefit. At baseline, households of children in Tercile 3 were more likely to report that chickens always entered the house (85% vs. 4%) and had animal feces observed in the child's play area (84% vs. 18%) when compared with Tercile 1. Tercile 3 households also owned less land and assets and lived further from Dhaka, any population center, or a market. We did not detect heterogeneity for any other treatment-outcome comparison.

Conclusions: We did not detect heterogeneity in any treatment arms for the outcomes of diarrhoea or LAZ-score, showing that children from all backgrounds benefit from effective interventions equally based on household characteristics. We found heterogeneity in the effect of receiving sanitation improvements on child development, where poorer households located in more remote areas and potentially with higher levels of animal fecal contamination had the highest expected benefit.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信