{"title":"P > 0.05 is Good: The NORD-h Protocol for Several Hypothesis Analysis Based on Known Risks, Costs, and Benefits.","authors":"Alessandro Rovetta, Mohammad Ali Mansournia","doi":"10.3961/jpmph.24.250","DOIUrl":null,"url":null,"abstract":"<p><p>Statistical testing in medicine is a controversial and commonly misunderstood topic. Despite decades of efforts by renowned associations and international experts, fallacies such as nullism, the magnitude fallacy, and dichotomania are still widespread within clinical and epidemiological research. This can lead to serious health errors (e.g., misidentification of adverse reactions). In this regard, our work sheds light on another common interpretive and cognitive error: the fallacy of high significance, understood as the mistaken tendency to prioritize findings that lead to low p-values. Indeed, there are target hypotheses (e.g., a hazard ratio of 0.10) for which a high p-value is an optimal and desirable outcome. Accordingly, we propose a novel method that goes beyond mere null hypothesis testing by assessing the statistical surprise of the experimental result compared to the prediction of several target assumptions. Additionally, we formalize the concept of interval hypotheses based on prior information about costs, risks, and benefits for the stakeholders (NORD-h protocol). The incompatibility graph (or surprisal graph) is adopted in this context. Finally, we discuss the epistemic necessity for a descriptive, (quasi) unconditional approach in statistics, which is essential to draw valid conclusions about the consistency of data with all relevant possibilities, including study limitations. Given these considerations, this new protocol has the potential to significantly impact the production of reliable evidence in public health.</p>","PeriodicalId":16893,"journal":{"name":"Journal of Preventive Medicine and Public Health","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Preventive Medicine and Public Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3961/jpmph.24.250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Statistical testing in medicine is a controversial and commonly misunderstood topic. Despite decades of efforts by renowned associations and international experts, fallacies such as nullism, the magnitude fallacy, and dichotomania are still widespread within clinical and epidemiological research. This can lead to serious health errors (e.g., misidentification of adverse reactions). In this regard, our work sheds light on another common interpretive and cognitive error: the fallacy of high significance, understood as the mistaken tendency to prioritize findings that lead to low p-values. Indeed, there are target hypotheses (e.g., a hazard ratio of 0.10) for which a high p-value is an optimal and desirable outcome. Accordingly, we propose a novel method that goes beyond mere null hypothesis testing by assessing the statistical surprise of the experimental result compared to the prediction of several target assumptions. Additionally, we formalize the concept of interval hypotheses based on prior information about costs, risks, and benefits for the stakeholders (NORD-h protocol). The incompatibility graph (or surprisal graph) is adopted in this context. Finally, we discuss the epistemic necessity for a descriptive, (quasi) unconditional approach in statistics, which is essential to draw valid conclusions about the consistency of data with all relevant possibilities, including study limitations. Given these considerations, this new protocol has the potential to significantly impact the production of reliable evidence in public health.
医学中的统计检验是一个颇具争议且常被误解的话题。尽管经过知名协会和国际专家数十年的努力,无效论、幅度谬误和二分法等谬误仍在临床和流行病学研究中广泛存在。这可能导致严重的健康错误(如不良反应的错误识别)。在这方面,我们的工作揭示了另一种常见的解释和认知错误:高显著性谬误,即优先考虑导致低 p 值的研究结果的错误倾向。事实上,在一些目标假设(如 0.10 的危险比)中,高 p 值是最佳和理想的结果。因此,我们提出了一种新方法,它超越了单纯的空假设检验,而是通过评估实验结果与若干目标假设的预测值相比在统计学上的惊喜程度。此外,我们还根据利益相关者的成本、风险和收益的先验信息,正式提出了区间假设的概念(NORD-h 协议)。在这种情况下,我们采用了不相容图(或惊喜图)。最后,我们讨论了统计学中描述性、(准)无条件方法在认识论上的必要性,这对于得出关于数据与所有相关可能性(包括研究局限性)一致性的有效结论至关重要。考虑到这些因素,这一新方案有可能对公共卫生领域可靠证据的产生产生重大影响。