Leigh Ann Starcevich , Hillary K. Burgess , Amy V. Uhrin
{"title":"优化海洋垃圾监测:在调查设计中平衡统计能力和试验规模","authors":"Leigh Ann Starcevich , Hillary K. Burgess , Amy V. Uhrin","doi":"10.1016/j.ecolind.2025.113807","DOIUrl":null,"url":null,"abstract":"<div><div>Conducting a power analysis is a best practice for designing long-term environmental monitoring programs so that trade-offs in survey design elements such as sample size and temporal revisits can be evaluated. However, these analyses are sensitive to <em>a priori</em> decisions about the underlying distribution of the data, appropriate model structure, and choice of trend test. Improper trend model specification and inappropriate trend tests can affect power estimates, leading to misguided choices of survey design elements, trend tests with inflated test size (i.e., increased probability of detecting a nonexistent trend), and management decisions based on misleading information.</div><div>In this simulation study, we conducted a power analysis to inform the design of a nationwide shoreline marine debris monitoring survey across 10 regional extents in the United States. Pilot data consisted of debris item counts having many zeros, transformed into item densities. We applied a generalized linear mixed model with a Tweedie conditional distribution to model simulated debris item density populations having known trend. We compared test size and power for three trend tests (Z-test, likelihood ratio test [LRT], and <em>t</em>-test), five levels of variance composition, five temporal revisit designs, and three levels of within-year replication for samples of 50 to 62 sites surveyed over an 11-year monitoring duration.</div><div>Test size and power were negatively affected when the year-to-year variation in debris item density was high. Revisit designs that included even a small panel of annually visited sites maintained test size close to the nominal level of 0.15. The LRT provided nominal trend test size in most cases except when the year-to-year variation in debris item density was high and in scenarios with sparse replication across years. In those cases, the LRT exhibited inflated test size and power, and the <em>t</em>-test provided a more conservative, but low power, test of trend. The LRT had slightly inflated test size when revisit designs without an annual panel of sites were used.</div><div>Our results demonstrate the importance of assessing both test size and power when assessing monitoring design choices to ensure that accurate trend inferences can be made.</div></div>","PeriodicalId":11459,"journal":{"name":"Ecological Indicators","volume":"178 ","pages":"Article 113807"},"PeriodicalIF":7.0000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing marine debris monitoring: balancing statistical power and test size in survey design\",\"authors\":\"Leigh Ann Starcevich , Hillary K. Burgess , Amy V. Uhrin\",\"doi\":\"10.1016/j.ecolind.2025.113807\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Conducting a power analysis is a best practice for designing long-term environmental monitoring programs so that trade-offs in survey design elements such as sample size and temporal revisits can be evaluated. However, these analyses are sensitive to <em>a priori</em> decisions about the underlying distribution of the data, appropriate model structure, and choice of trend test. Improper trend model specification and inappropriate trend tests can affect power estimates, leading to misguided choices of survey design elements, trend tests with inflated test size (i.e., increased probability of detecting a nonexistent trend), and management decisions based on misleading information.</div><div>In this simulation study, we conducted a power analysis to inform the design of a nationwide shoreline marine debris monitoring survey across 10 regional extents in the United States. Pilot data consisted of debris item counts having many zeros, transformed into item densities. We applied a generalized linear mixed model with a Tweedie conditional distribution to model simulated debris item density populations having known trend. We compared test size and power for three trend tests (Z-test, likelihood ratio test [LRT], and <em>t</em>-test), five levels of variance composition, five temporal revisit designs, and three levels of within-year replication for samples of 50 to 62 sites surveyed over an 11-year monitoring duration.</div><div>Test size and power were negatively affected when the year-to-year variation in debris item density was high. Revisit designs that included even a small panel of annually visited sites maintained test size close to the nominal level of 0.15. The LRT provided nominal trend test size in most cases except when the year-to-year variation in debris item density was high and in scenarios with sparse replication across years. In those cases, the LRT exhibited inflated test size and power, and the <em>t</em>-test provided a more conservative, but low power, test of trend. The LRT had slightly inflated test size when revisit designs without an annual panel of sites were used.</div><div>Our results demonstrate the importance of assessing both test size and power when assessing monitoring design choices to ensure that accurate trend inferences can be made.</div></div>\",\"PeriodicalId\":11459,\"journal\":{\"name\":\"Ecological Indicators\",\"volume\":\"178 \",\"pages\":\"Article 113807\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Indicators\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1470160X2500737X\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Indicators","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1470160X2500737X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Optimizing marine debris monitoring: balancing statistical power and test size in survey design
Conducting a power analysis is a best practice for designing long-term environmental monitoring programs so that trade-offs in survey design elements such as sample size and temporal revisits can be evaluated. However, these analyses are sensitive to a priori decisions about the underlying distribution of the data, appropriate model structure, and choice of trend test. Improper trend model specification and inappropriate trend tests can affect power estimates, leading to misguided choices of survey design elements, trend tests with inflated test size (i.e., increased probability of detecting a nonexistent trend), and management decisions based on misleading information.
In this simulation study, we conducted a power analysis to inform the design of a nationwide shoreline marine debris monitoring survey across 10 regional extents in the United States. Pilot data consisted of debris item counts having many zeros, transformed into item densities. We applied a generalized linear mixed model with a Tweedie conditional distribution to model simulated debris item density populations having known trend. We compared test size and power for three trend tests (Z-test, likelihood ratio test [LRT], and t-test), five levels of variance composition, five temporal revisit designs, and three levels of within-year replication for samples of 50 to 62 sites surveyed over an 11-year monitoring duration.
Test size and power were negatively affected when the year-to-year variation in debris item density was high. Revisit designs that included even a small panel of annually visited sites maintained test size close to the nominal level of 0.15. The LRT provided nominal trend test size in most cases except when the year-to-year variation in debris item density was high and in scenarios with sparse replication across years. In those cases, the LRT exhibited inflated test size and power, and the t-test provided a more conservative, but low power, test of trend. The LRT had slightly inflated test size when revisit designs without an annual panel of sites were used.
Our results demonstrate the importance of assessing both test size and power when assessing monitoring design choices to ensure that accurate trend inferences can be made.
期刊介绍:
The ultimate aim of Ecological Indicators is to integrate the monitoring and assessment of ecological and environmental indicators with management practices. The journal provides a forum for the discussion of the applied scientific development and review of traditional indicator approaches as well as for theoretical, modelling and quantitative applications such as index development. Research into the following areas will be published.
• All aspects of ecological and environmental indicators and indices.
• New indicators, and new approaches and methods for indicator development, testing and use.
• Development and modelling of indices, e.g. application of indicator suites across multiple scales and resources.
• Analysis and research of resource, system- and scale-specific indicators.
• Methods for integration of social and other valuation metrics for the production of scientifically rigorous and politically-relevant assessments using indicator-based monitoring and assessment programs.
• How research indicators can be transformed into direct application for management purposes.
• Broader assessment objectives and methods, e.g. biodiversity, biological integrity, and sustainability, through the use of indicators.
• Resource-specific indicators such as landscape, agroecosystems, forests, wetlands, etc.