{"title":"猫和颈椎外吸收:统计学考虑:致编辑的信“颈椎外吸收的潜在易感性特征:一项观察性研究(Patel et al., 2025)”。","authors":"Nasir Z. Bashir, Eduardo Bernabé","doi":"10.1111/iej.14258","DOIUrl":null,"url":null,"abstract":"<p>We have read with some intrigue the recent article by Patel et al., <span>2025</span>, which investigates putative risk factors for external cervical resorption (ECR). In a subsequent letter, Tay & Cooray, <span>2025</span>, provide a fantastic exposition on why basic consideration of epidemiological methods means that drawing reliable inferences from this article is incredibly difficult. In addition to these concerns, we would like to highlight issues pertaining to the statistical methods used:</p><p>The way in which the authors present their power calculations is an example of ‘cut-and-paste’ statistics (White et al., <span>2022</span>). They claim ‘For a sample size of 180 patients, a power of 95% would be achieved to detect differences between two independent proportions, assuming a level of confidence of 95%’.</p><p>It is evident that power is dependent on what the investigator specifies p<sub>1</sub> and p<sub>2</sub> to be. The authors have not described what they assumed these to be, nor is there any mention of what the desired minimum detectable effect is, that is, p<sub>2</sub>–p<sub>1</sub>. Hence, it is completely unclear <i>what effect</i> the authors are claiming they have 95% power to detect.</p><p>For readers who wish to run the power calculation analogous to that which the authors present, we provide the following <b>R</b> code:\n </p><p>The output of which indicates the authors have a power to detect a difference of 0.26 between proportions in independent groups, assuming p<sub>1</sub> = 0.5. The differences in proportions between the groups observed in the study sample are generally far smaller than 0.26.</p><p>The study sample was stratified into individuals with no identifiable factor, a single factor, or multiple factors. Whilst stratification in this manner is not inherently an issue, statistical tests carried out should be interpreted appropriately. In figure 4, there are 15 proposed risk factors. This means there are 2<sup>15</sup> (= 31 768) possible combinations of risk factors which an individual may have. Taken to the logical extreme, should the authors not run all 31 768 tests and report all significant findings? Effectively, figures 6 through 8 suggest a less extreme version of this was carried out, with 15 tests per figure, resulting in (at a minimum) a total of 45 significance tests. Assuming a type I error rate of 0.05, this means that 2 to 3 of the reported significant findings are likely false positives. Tay & Cooray, <span>2025</span>, suggest the use of correction for multiple testing to control the false discovery rate. There have been back-and-forth debates on the need to adjust for multiple testing for several decades, and we are not necessarily arguing that it should or should not have been done in this paper (Rothman, <span>1990</span>). What we are stating, however, is that the issues around multiple testing should have been brought up and transparently discussed.</p><p>The authors claim that ‘the aim of this study was to investigate potential predisposing factors associated with ECR’. All of the individuals in this study have ECR, so it is unclear exactly how this aim will be addressed, without comparison to individuals who do not have ECR.</p><p>This inappropriate study sample means that the null hypothesis in these significance tests which the authors carry out is <span>not</span> that the predisposing factors are associated with no increase in risk of ECR, that is, the intuitive null hypothesis which one typically assumes. Given that the study sample is patients with ECR, the findings of the statistical tests are to be interpreted as conditional on the presence of disease. For example, consider the reported result that ‘There was a significant association between cat ownership and ECR in the mandible (23.6%, 25/106 teeth, <i>p</i> = .002)’. This statement is not disingenuous, but it is still prone to misinterpretation by the reader. The null hypothesis being tested here is: <span>amongst individuals with ECR</span>, is cat ownership associated with jaw location? In other words, we are not being told if cat ownership is associated with an increased risk of ECR. We are being told, if you have ECR, is cat ownership associated with a difference in whether the ECR is incident in the upper or lower jaw? This hypothesis test seems effectively irrelevant to clinical practice.</p><p>More generally, the null hypotheses being across the statistical tests used were, ‘amongst individuals with ECR, is there an association between factor X and jaw location?’ Where factor X may be age, sex, cat ownership, number of putative risk factors etc. Again, it is completely unclear why such a test would be informative for clinical practice.</p><p>Finally, and most obviously, the authors give only cursory consideration as to how cat ownership in their study sample relates to the population from which they are sampling. Of course, one should not forget (as Tay & Cooray, <span>2025</span>, mention) naïve comparisons are prone to confounding and do not account for any causal structure. However, all the tests carried out in the article are done in this manner, so we feel equally justified in doing so.</p><p>The authors collected their data during the period from 2017 to 2022, and national statistics from Cats Protection indicate that the proportion of cat-owning households in Greater London in 2021 was 26% (Cats Protection, <span>2021</span>). Assuming the authors have taken a representative sample of patients with ECR, this means the prevalence of cat-owning amongst patients with ECR is over 10% lower than that of the population from which they were sampled. Of course, this is a liberal estimate as one may need to correct for repeated measures. There were 194 patients and 215 teeth, so the upper bound on the proportion of cat-owning ECR patients is 34/194 = 17.5% (95% CI: 12.2% to 22.9%); still much lower than the London population prevalence of cat-owning households.</p><p>Based on the above data, should we then ask: does cat-owning prevent ECR?</p><p>Nasir Z. Bashir: Conceptualization, design, data acquisition and interpretation, drafting and critically revising the manuscript. Eduardo Bernabé: Drafting and critically revising the manuscript.</p><p>NZB is supported by the Wellcome Trust (322 777/Z/24/Z).</p><p>The authors declare no conflicts of interest.</p><p>No ethical approval was required for this work.</p>","PeriodicalId":13724,"journal":{"name":"International endodontic journal","volume":"58 8","pages":"1277-1279"},"PeriodicalIF":7.1000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/iej.14258","citationCount":"0","resultStr":"{\"title\":\"Cats and external cervical resorption: Statistical considerations\",\"authors\":\"Nasir Z. Bashir, Eduardo Bernabé\",\"doi\":\"10.1111/iej.14258\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We have read with some intrigue the recent article by Patel et al., <span>2025</span>, which investigates putative risk factors for external cervical resorption (ECR). In a subsequent letter, Tay & Cooray, <span>2025</span>, provide a fantastic exposition on why basic consideration of epidemiological methods means that drawing reliable inferences from this article is incredibly difficult. In addition to these concerns, we would like to highlight issues pertaining to the statistical methods used:</p><p>The way in which the authors present their power calculations is an example of ‘cut-and-paste’ statistics (White et al., <span>2022</span>). They claim ‘For a sample size of 180 patients, a power of 95% would be achieved to detect differences between two independent proportions, assuming a level of confidence of 95%’.</p><p>It is evident that power is dependent on what the investigator specifies p<sub>1</sub> and p<sub>2</sub> to be. The authors have not described what they assumed these to be, nor is there any mention of what the desired minimum detectable effect is, that is, p<sub>2</sub>–p<sub>1</sub>. Hence, it is completely unclear <i>what effect</i> the authors are claiming they have 95% power to detect.</p><p>For readers who wish to run the power calculation analogous to that which the authors present, we provide the following <b>R</b> code:\\n </p><p>The output of which indicates the authors have a power to detect a difference of 0.26 between proportions in independent groups, assuming p<sub>1</sub> = 0.5. The differences in proportions between the groups observed in the study sample are generally far smaller than 0.26.</p><p>The study sample was stratified into individuals with no identifiable factor, a single factor, or multiple factors. Whilst stratification in this manner is not inherently an issue, statistical tests carried out should be interpreted appropriately. In figure 4, there are 15 proposed risk factors. This means there are 2<sup>15</sup> (= 31 768) possible combinations of risk factors which an individual may have. Taken to the logical extreme, should the authors not run all 31 768 tests and report all significant findings? Effectively, figures 6 through 8 suggest a less extreme version of this was carried out, with 15 tests per figure, resulting in (at a minimum) a total of 45 significance tests. Assuming a type I error rate of 0.05, this means that 2 to 3 of the reported significant findings are likely false positives. Tay & Cooray, <span>2025</span>, suggest the use of correction for multiple testing to control the false discovery rate. There have been back-and-forth debates on the need to adjust for multiple testing for several decades, and we are not necessarily arguing that it should or should not have been done in this paper (Rothman, <span>1990</span>). What we are stating, however, is that the issues around multiple testing should have been brought up and transparently discussed.</p><p>The authors claim that ‘the aim of this study was to investigate potential predisposing factors associated with ECR’. All of the individuals in this study have ECR, so it is unclear exactly how this aim will be addressed, without comparison to individuals who do not have ECR.</p><p>This inappropriate study sample means that the null hypothesis in these significance tests which the authors carry out is <span>not</span> that the predisposing factors are associated with no increase in risk of ECR, that is, the intuitive null hypothesis which one typically assumes. Given that the study sample is patients with ECR, the findings of the statistical tests are to be interpreted as conditional on the presence of disease. For example, consider the reported result that ‘There was a significant association between cat ownership and ECR in the mandible (23.6%, 25/106 teeth, <i>p</i> = .002)’. This statement is not disingenuous, but it is still prone to misinterpretation by the reader. The null hypothesis being tested here is: <span>amongst individuals with ECR</span>, is cat ownership associated with jaw location? In other words, we are not being told if cat ownership is associated with an increased risk of ECR. We are being told, if you have ECR, is cat ownership associated with a difference in whether the ECR is incident in the upper or lower jaw? This hypothesis test seems effectively irrelevant to clinical practice.</p><p>More generally, the null hypotheses being across the statistical tests used were, ‘amongst individuals with ECR, is there an association between factor X and jaw location?’ Where factor X may be age, sex, cat ownership, number of putative risk factors etc. Again, it is completely unclear why such a test would be informative for clinical practice.</p><p>Finally, and most obviously, the authors give only cursory consideration as to how cat ownership in their study sample relates to the population from which they are sampling. Of course, one should not forget (as Tay & Cooray, <span>2025</span>, mention) naïve comparisons are prone to confounding and do not account for any causal structure. However, all the tests carried out in the article are done in this manner, so we feel equally justified in doing so.</p><p>The authors collected their data during the period from 2017 to 2022, and national statistics from Cats Protection indicate that the proportion of cat-owning households in Greater London in 2021 was 26% (Cats Protection, <span>2021</span>). Assuming the authors have taken a representative sample of patients with ECR, this means the prevalence of cat-owning amongst patients with ECR is over 10% lower than that of the population from which they were sampled. Of course, this is a liberal estimate as one may need to correct for repeated measures. There were 194 patients and 215 teeth, so the upper bound on the proportion of cat-owning ECR patients is 34/194 = 17.5% (95% CI: 12.2% to 22.9%); still much lower than the London population prevalence of cat-owning households.</p><p>Based on the above data, should we then ask: does cat-owning prevent ECR?</p><p>Nasir Z. Bashir: Conceptualization, design, data acquisition and interpretation, drafting and critically revising the manuscript. Eduardo Bernabé: Drafting and critically revising the manuscript.</p><p>NZB is supported by the Wellcome Trust (322 777/Z/24/Z).</p><p>The authors declare no conflicts of interest.</p><p>No ethical approval was required for this work.</p>\",\"PeriodicalId\":13724,\"journal\":{\"name\":\"International endodontic journal\",\"volume\":\"58 8\",\"pages\":\"1277-1279\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/iej.14258\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International endodontic journal\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/iej.14258\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DENTISTRY, ORAL SURGERY & MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International endodontic journal","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/iej.14258","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
Cats and external cervical resorption: Statistical considerations
We have read with some intrigue the recent article by Patel et al., 2025, which investigates putative risk factors for external cervical resorption (ECR). In a subsequent letter, Tay & Cooray, 2025, provide a fantastic exposition on why basic consideration of epidemiological methods means that drawing reliable inferences from this article is incredibly difficult. In addition to these concerns, we would like to highlight issues pertaining to the statistical methods used:
The way in which the authors present their power calculations is an example of ‘cut-and-paste’ statistics (White et al., 2022). They claim ‘For a sample size of 180 patients, a power of 95% would be achieved to detect differences between two independent proportions, assuming a level of confidence of 95%’.
It is evident that power is dependent on what the investigator specifies p1 and p2 to be. The authors have not described what they assumed these to be, nor is there any mention of what the desired minimum detectable effect is, that is, p2–p1. Hence, it is completely unclear what effect the authors are claiming they have 95% power to detect.
For readers who wish to run the power calculation analogous to that which the authors present, we provide the following R code:
The output of which indicates the authors have a power to detect a difference of 0.26 between proportions in independent groups, assuming p1 = 0.5. The differences in proportions between the groups observed in the study sample are generally far smaller than 0.26.
The study sample was stratified into individuals with no identifiable factor, a single factor, or multiple factors. Whilst stratification in this manner is not inherently an issue, statistical tests carried out should be interpreted appropriately. In figure 4, there are 15 proposed risk factors. This means there are 215 (= 31 768) possible combinations of risk factors which an individual may have. Taken to the logical extreme, should the authors not run all 31 768 tests and report all significant findings? Effectively, figures 6 through 8 suggest a less extreme version of this was carried out, with 15 tests per figure, resulting in (at a minimum) a total of 45 significance tests. Assuming a type I error rate of 0.05, this means that 2 to 3 of the reported significant findings are likely false positives. Tay & Cooray, 2025, suggest the use of correction for multiple testing to control the false discovery rate. There have been back-and-forth debates on the need to adjust for multiple testing for several decades, and we are not necessarily arguing that it should or should not have been done in this paper (Rothman, 1990). What we are stating, however, is that the issues around multiple testing should have been brought up and transparently discussed.
The authors claim that ‘the aim of this study was to investigate potential predisposing factors associated with ECR’. All of the individuals in this study have ECR, so it is unclear exactly how this aim will be addressed, without comparison to individuals who do not have ECR.
This inappropriate study sample means that the null hypothesis in these significance tests which the authors carry out is not that the predisposing factors are associated with no increase in risk of ECR, that is, the intuitive null hypothesis which one typically assumes. Given that the study sample is patients with ECR, the findings of the statistical tests are to be interpreted as conditional on the presence of disease. For example, consider the reported result that ‘There was a significant association between cat ownership and ECR in the mandible (23.6%, 25/106 teeth, p = .002)’. This statement is not disingenuous, but it is still prone to misinterpretation by the reader. The null hypothesis being tested here is: amongst individuals with ECR, is cat ownership associated with jaw location? In other words, we are not being told if cat ownership is associated with an increased risk of ECR. We are being told, if you have ECR, is cat ownership associated with a difference in whether the ECR is incident in the upper or lower jaw? This hypothesis test seems effectively irrelevant to clinical practice.
More generally, the null hypotheses being across the statistical tests used were, ‘amongst individuals with ECR, is there an association between factor X and jaw location?’ Where factor X may be age, sex, cat ownership, number of putative risk factors etc. Again, it is completely unclear why such a test would be informative for clinical practice.
Finally, and most obviously, the authors give only cursory consideration as to how cat ownership in their study sample relates to the population from which they are sampling. Of course, one should not forget (as Tay & Cooray, 2025, mention) naïve comparisons are prone to confounding and do not account for any causal structure. However, all the tests carried out in the article are done in this manner, so we feel equally justified in doing so.
The authors collected their data during the period from 2017 to 2022, and national statistics from Cats Protection indicate that the proportion of cat-owning households in Greater London in 2021 was 26% (Cats Protection, 2021). Assuming the authors have taken a representative sample of patients with ECR, this means the prevalence of cat-owning amongst patients with ECR is over 10% lower than that of the population from which they were sampled. Of course, this is a liberal estimate as one may need to correct for repeated measures. There were 194 patients and 215 teeth, so the upper bound on the proportion of cat-owning ECR patients is 34/194 = 17.5% (95% CI: 12.2% to 22.9%); still much lower than the London population prevalence of cat-owning households.
Based on the above data, should we then ask: does cat-owning prevent ECR?
Nasir Z. Bashir: Conceptualization, design, data acquisition and interpretation, drafting and critically revising the manuscript. Eduardo Bernabé: Drafting and critically revising the manuscript.
NZB is supported by the Wellcome Trust (322 777/Z/24/Z).
期刊介绍:
The International Endodontic Journal is published monthly and strives to publish original articles of the highest quality to disseminate scientific and clinical knowledge; all manuscripts are subjected to peer review. Original scientific articles are published in the areas of biomedical science, applied materials science, bioengineering, epidemiology and social science relevant to endodontic disease and its management, and to the restoration of root-treated teeth. In addition, review articles, reports of clinical cases, book reviews, summaries and abstracts of scientific meetings and news items are accepted.
The International Endodontic Journal is essential reading for general dental practitioners, specialist endodontists, research, scientists and dental teachers.