脊柱随机对照试验中基线分类变量和p值的准确性和分布。

IF 2.9 3区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Royal Society Open Science Pub Date : 2025-01-15 eCollection Date: 2025-01-01 DOI:10.1098/rsos.240170

Mark J Bolland, Alison Avenell, Andrew Grey

{"title":"脊柱随机对照试验中基线分类变量和p值的准确性和分布。","authors":"Mark J Bolland, Alison Avenell, Andrew Grey","doi":"10.1098/rsos.240170","DOIUrl":null,"url":null,"abstract":"Levayer and colleagues assessed integrity issues in randomized controlled trials (RCTs) in four spine journals using baseline p-values from categorical variables, concluding that there was no evidence of 'systemic fraudulent behaviour'. We used their published dataset to assess the accuracy of reported p-values and whether observed and expected distributions of frequency counts and p-values were consistent. In 51 out of 929 (5.5%) baseline variables, the sum of frequencies did not agree with the reported number of participants. For one-third of reported p-values (172 out of 522), we could not calculate a matching p-value using a range of statistical tests. Sparse data were common: for 22% (74 out of 332) of variables in which the reported p-value matched the p-value calculated from a chi-square test, the expected cells were smaller than recommended for the use of chi-square tests. There were 20-25% more two-arm trials with differences in frequency counts of 1 or 2 between-groups than expected. There were small differences between observed and expected distributions of baseline p-values, but these depended on analysis methods. In summary, incorrectly reported p-values and incorrect statistical test usage were common, and there were differences between observed and expected distributions of baseline p-values and frequency counts, raising questions about the integrity of some RCTs in these journals.","PeriodicalId":21525,"journal":{"name":"Royal Society Open Science","volume":"12 1","pages":"240170"},"PeriodicalIF":2.9000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11739909/pdf/","citationCount":"0","resultStr":"{\"title\":\"Accuracy and distribution of baseline categorical variables and p-values in spine randomized controlled trials.\",\"authors\":\"Mark J Bolland, Alison Avenell, Andrew Grey\",\"doi\":\"10.1098/rsos.240170\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Levayer and colleagues assessed integrity issues in randomized controlled trials (RCTs) in four spine journals using baseline p-values from categorical variables, concluding that there was no evidence of 'systemic fraudulent behaviour'. We used their published dataset to assess the accuracy of reported p-values and whether observed and expected distributions of frequency counts and p-values were consistent. In 51 out of 929 (5.5%) baseline variables, the sum of frequencies did not agree with the reported number of participants. For one-third of reported p-values (172 out of 522), we could not calculate a matching p-value using a range of statistical tests. Sparse data were common: for 22% (74 out of 332) of variables in which the reported p-value matched the p-value calculated from a chi-square test, the expected cells were smaller than recommended for the use of chi-square tests. There were 20-25% more two-arm trials with differences in frequency counts of 1 or 2 between-groups than expected. There were small differences between observed and expected distributions of baseline p-values, but these depended on analysis methods. In summary, incorrectly reported p-values and incorrect statistical test usage were common, and there were differences between observed and expected distributions of baseline p-values and frequency counts, raising questions about the integrity of some RCTs in these journals.\",\"PeriodicalId\":21525,\"journal\":{\"name\":\"Royal Society Open Science\",\"volume\":\"12 1\",\"pages\":\"240170\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11739909/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Royal Society Open Science\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1098/rsos.240170\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Royal Society Open Science","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsos.240170","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

Levayer及其同事使用分类变量的基线p值评估了四种脊柱期刊上的随机对照试验（rct）中的诚信问题，得出结论认为没有证据表明存在“系统性欺诈行为”。我们使用他们发表的数据集来评估报告的p值的准确性，以及频率计数和p值的观察和预期分布是否一致。在929个基线变量中的51个（5.5%）中，频率总和与报告的参与者人数不一致。对于报告的p值的三分之一（522个中的172个），我们无法使用一系列统计检验计算匹配的p值。稀疏数据很常见：在报告的p值与卡方检验计算的p值相匹配的变量中，有22%（332个变量中的74个）的期望单元小于使用卡方检验的推荐值。组间频率计数差异为1或2的两组试验比预期多20-25%。基线p值的观察分布和预期分布之间存在微小差异，但这取决于分析方法。总之，不正确的p值报告和不正确的统计检验使用是常见的，并且在基线p值和频率计数的观察和预期分布之间存在差异，这引起了对这些期刊中一些随机对照试验的完整性的质疑。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Accuracy and distribution of baseline categorical variables and p-values in spine randomized controlled trials.

Levayer and colleagues assessed integrity issues in randomized controlled trials (RCTs) in four spine journals using baseline p-values from categorical variables, concluding that there was no evidence of 'systemic fraudulent behaviour'. We used their published dataset to assess the accuracy of reported p-values and whether observed and expected distributions of frequency counts and p-values were consistent. In 51 out of 929 (5.5%) baseline variables, the sum of frequencies did not agree with the reported number of participants. For one-third of reported p-values (172 out of 522), we could not calculate a matching p-value using a range of statistical tests. Sparse data were common: for 22% (74 out of 332) of variables in which the reported p-value matched the p-value calculated from a chi-square test, the expected cells were smaller than recommended for the use of chi-square tests. There were 20-25% more two-arm trials with differences in frequency counts of 1 or 2 between-groups than expected. There were small differences between observed and expected distributions of baseline p-values, but these depended on analysis methods. In summary, incorrectly reported p-values and incorrect statistical test usage were common, and there were differences between observed and expected distributions of baseline p-values and frequency counts, raising questions about the integrity of some RCTs in these journals.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Royal Society Open Science Multidisciplinary-Multidisciplinary

CiteScore

6.00

自引率

0.00%

发文量

508

审稿时长

14 weeks

期刊介绍： Royal Society Open Science is a new open journal publishing high-quality original research across the entire range of science on the basis of objective peer-review. The journal covers the entire range of science and mathematics and will allow the Society to publish all the high-quality work it receives without the usual restrictions on scope, length or impact.