Andrea Sosa-Moreno, Gwenyth O. Lee, Zhenke Wu, S. Aya Fanny, Gabriel Trueba, Philip J. Cooper, Karen Levy and Joseph N. S. Eisenberg*,
{"title":"Comparing the Power of Low vs High-Precision Methods for Measuring E. coli in Drinking Water in Low-Resource Settings","authors":"Andrea Sosa-Moreno, Gwenyth O. Lee, Zhenke Wu, S. Aya Fanny, Gabriel Trueba, Philip J. Cooper, Karen Levy and Joseph N. S. Eisenberg*, ","doi":"10.1021/acsestwater.4c0111710.1021/acsestwater.4c01117","DOIUrl":null,"url":null,"abstract":"<p >Methods to measure <i>Escherichia coli</i> concentrations in water vary in precision, complexity, and cost. Low-precision methods are more affordable, faster, and simpler to implement in low-resource settings but may reduce statistical power. We compared the statistical power of low- and high-precision methods using data from UNICEF’s Multiple Indicator Cluster Surveys across 11 low-income regions, and from a birth cohort study in Ecuador. Both data sets included continuous <i>E. coli</i> concentrations from high-precision methods, which we categorized to emulate low-precision methods outcomes. Using logistic regression, we modeled associations between water quality and two dichotomous outcomes: water treatment (treated/untreated) and water storage (stored/not stored). We compared the sample size needed to reach 80% power for detecting statistically significant differences between these groups. Power was calculated using a bootstrap-based algorithm. Compared to continuous measures, categorizing <i>E. coli</i> concentrations required 10–90% larger sample sizes in treatment models and about 10% in storage models, except in regions with good water quality, where similar or lower sample sizes were sufficient. Our findings indicate that low-precision methods can reliably infer associations between water practices and water quality but often require larger sample sizes, highlighting a trade-off between cost and statistical power in resource-limited settings.</p>","PeriodicalId":93847,"journal":{"name":"ACS ES&T water","volume":"5 5","pages":"2244–2254 2244–2254"},"PeriodicalIF":4.8000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS ES&T water","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsestwater.4c01117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Methods to measure Escherichia coli concentrations in water vary in precision, complexity, and cost. Low-precision methods are more affordable, faster, and simpler to implement in low-resource settings but may reduce statistical power. We compared the statistical power of low- and high-precision methods using data from UNICEF’s Multiple Indicator Cluster Surveys across 11 low-income regions, and from a birth cohort study in Ecuador. Both data sets included continuous E. coli concentrations from high-precision methods, which we categorized to emulate low-precision methods outcomes. Using logistic regression, we modeled associations between water quality and two dichotomous outcomes: water treatment (treated/untreated) and water storage (stored/not stored). We compared the sample size needed to reach 80% power for detecting statistically significant differences between these groups. Power was calculated using a bootstrap-based algorithm. Compared to continuous measures, categorizing E. coli concentrations required 10–90% larger sample sizes in treatment models and about 10% in storage models, except in regions with good water quality, where similar or lower sample sizes were sufficient. Our findings indicate that low-precision methods can reliably infer associations between water practices and water quality but often require larger sample sizes, highlighting a trade-off between cost and statistical power in resource-limited settings.