{"title":"Incorrect inference from size-selectivity studies due to widespread misuse of bootstrap confidence bands","authors":"Russell B. Millar , Matt K. Broadhurst","doi":"10.1016/j.fishres.2024.107225","DOIUrl":null,"url":null,"abstract":"<div><div>The bootstrap is now widely used in size-selectivity experiments to quantify the uncertainty in estimated absolute or relative selectivity curves. A two-level hierarchical bootstrap is commonly used, with deployments being resampled at the first level followed by resampling of individual fish at the second level. It is standard practice to construct plots of the estimated curves with an overlay of the bootstrap confidence intervals for each length, thereby forming a bootstrap confidence band. These confidence bands are routinely used to perform statistical tests, most often to test the null hypothesis that two gears have the same length-dependent selectivity and/or fishing efficiency. Using computer simulation, it is shown here that the actual Type 1 error (non-coverage) rate of the bootstrap confidence intervals can vary considerably depending on the experimental scenario. The bootstrap confidence band test was found to inflate the Type 1 error rate of the confidence intervals by a factor of about five due to the use of multiple confidence intervals in the construction of a confidence band. Moreover, the bootstrap confidence band test can have very low statistical power, as demonstrated with an example in which a permutation test has 97% power to detect a difference between two fishing gears, whereas the bootstrap confidence band test has power of just 68%. The permutation test is recommended for testing differences between the size selectivity of gears and its ease of implementation is demonstrated. The misuse of bootstrap confidence bands has recently propagated to other areas of fishing-gear selectivity research, including assessment of differences in species selectivity between gears, and this is another situation where permutation tests would be straightforward to implement.</div></div>","PeriodicalId":50443,"journal":{"name":"Fisheries Research","volume":"281 ","pages":"Article 107225"},"PeriodicalIF":2.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fisheries Research","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165783624002893","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}
引用次数: 0
Abstract
The bootstrap is now widely used in size-selectivity experiments to quantify the uncertainty in estimated absolute or relative selectivity curves. A two-level hierarchical bootstrap is commonly used, with deployments being resampled at the first level followed by resampling of individual fish at the second level. It is standard practice to construct plots of the estimated curves with an overlay of the bootstrap confidence intervals for each length, thereby forming a bootstrap confidence band. These confidence bands are routinely used to perform statistical tests, most often to test the null hypothesis that two gears have the same length-dependent selectivity and/or fishing efficiency. Using computer simulation, it is shown here that the actual Type 1 error (non-coverage) rate of the bootstrap confidence intervals can vary considerably depending on the experimental scenario. The bootstrap confidence band test was found to inflate the Type 1 error rate of the confidence intervals by a factor of about five due to the use of multiple confidence intervals in the construction of a confidence band. Moreover, the bootstrap confidence band test can have very low statistical power, as demonstrated with an example in which a permutation test has 97% power to detect a difference between two fishing gears, whereas the bootstrap confidence band test has power of just 68%. The permutation test is recommended for testing differences between the size selectivity of gears and its ease of implementation is demonstrated. The misuse of bootstrap confidence bands has recently propagated to other areas of fishing-gear selectivity research, including assessment of differences in species selectivity between gears, and this is another situation where permutation tests would be straightforward to implement.
期刊介绍:
This journal provides an international forum for the publication of papers in the areas of fisheries science, fishing technology, fisheries management and relevant socio-economics. The scope covers fisheries in salt, brackish and freshwater systems, and all aspects of associated ecology, environmental aspects of fisheries, and economics. Both theoretical and practical papers are acceptable, including laboratory and field experimental studies relevant to fisheries. Papers on the conservation of exploitable living resources are welcome. Review and Viewpoint articles are also published. As the specified areas inevitably impinge on and interrelate with each other, the approach of the journal is multidisciplinary, and authors are encouraged to emphasise the relevance of their own work to that of other disciplines. The journal is intended for fisheries scientists, biological oceanographers, gear technologists, economists, managers, administrators, policy makers and legislators.