复杂的心理测量分析真的重要吗？使用抗抑郁药物试验的个体参与者数据比较多种方法。

IF 5.5 2区医学 Q1 PSYCHIATRY

Psychological Medicine Pub Date : 2025-10-01 DOI:10.1017/S0033291725101785

David Byrne, Frank Doyle, Susan Brannick, Robert M Carney, Pim Cuijpers, Alexandra L Dima, Kenneth E Freedland, Suzanne Guerin, David Hevey, Bishember Kathuria, Emma Wallace, Fiona Boland

{"title":"复杂的心理测量分析真的重要吗？使用抗抑郁药物试验的个体参与者数据比较多种方法。","authors":"David Byrne, Frank Doyle, Susan Brannick, Robert M Carney, Pim Cuijpers, Alexandra L Dima, Kenneth E Freedland, Suzanne Guerin, David Hevey, Bishember Kathuria, Emma Wallace, Fiona Boland","doi":"10.1017/S0033291725101785","DOIUrl":null,"url":null,"abstract":"Background: Psychometric methods are used to remove underperforming items and reduce error in existing measures, albeit different approaches can produce different results. This study aimed to determine the implications of applying different psychometric methods for clinical trial outcomes.Methods: Individual participant data from 15 antidepressant treatment trials from Vivli.org were analyzed. Baseline (pretreatment) and 8-week (range 4-12 weeks) outcome data from the Montgomery-Asberg Depression Rating Scale were subjected to best-practice factor analysis (FA), item response theory (IRT), and network analysis (NA) approaches. Trial outcomes for the original summative scores and psychometric-model scores were assessed using multilevel models. Percentage differences in Cohen's d effect sizes for the original summative and psychometrically modeled scores were the effects of interest.Results: Each method produced unidimensional models, but the modified scales varied from 7 to 10 items. Treatment effects (d = 0.072) were unchanged for IRT (10 items), decreased by 1.3%-2.8% (eight-item abbreviated d = 0.070; weighted score d = 0.071) for NA, and increased by 11%-12.5% (seven-item abbreviated model d = 0.081; weighted score d = 0.080) for FA.Discussion: IRT and NA yielded negligible differences in effect outcomes relative to original trials. FA increased effect sizes and may be the most effective method for identifying the items on which placebo and treatment group outcomes differ.","PeriodicalId":20891,"journal":{"name":"Psychological Medicine","volume":"55 ","pages":"e289"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Do complex psychometric analyses really matter? Comparing multiple approaches using individual participant data from antidepressant trials.\",\"authors\":\"David Byrne, Frank Doyle, Susan Brannick, Robert M Carney, Pim Cuijpers, Alexandra L Dima, Kenneth E Freedland, Suzanne Guerin, David Hevey, Bishember Kathuria, Emma Wallace, Fiona Boland\",\"doi\":\"10.1017/S0033291725101785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Psychometric methods are used to remove underperforming items and reduce error in existing measures, albeit different approaches can produce different results. This study aimed to determine the implications of applying different psychometric methods for clinical trial outcomes.Methods: Individual participant data from 15 antidepressant treatment trials from Vivli.org were analyzed. Baseline (pretreatment) and 8-week (range 4-12 weeks) outcome data from the Montgomery-Asberg Depression Rating Scale were subjected to best-practice factor analysis (FA), item response theory (IRT), and network analysis (NA) approaches. Trial outcomes for the original summative scores and psychometric-model scores were assessed using multilevel models. Percentage differences in Cohen's d effect sizes for the original summative and psychometrically modeled scores were the effects of interest.Results: Each method produced unidimensional models, but the modified scales varied from 7 to 10 items. Treatment effects (d = 0.072) were unchanged for IRT (10 items), decreased by 1.3%-2.8% (eight-item abbreviated d = 0.070; weighted score d = 0.071) for NA, and increased by 11%-12.5% (seven-item abbreviated model d = 0.081; weighted score d = 0.080) for FA.Discussion: IRT and NA yielded negligible differences in effect outcomes relative to original trials. FA increased effect sizes and may be the most effective method for identifying the items on which placebo and treatment group outcomes differ.\",\"PeriodicalId\":20891,\"journal\":{\"name\":\"Psychological Medicine\",\"volume\":\"55 \",\"pages\":\"e289\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Psychological Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1017/S0033291725101785\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1017/S0033291725101785","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}

引用次数: 0

摘要

背景：虽然不同的方法会产生不同的结果，但心理测量方法用于去除表现不佳的项目并减少现有测量中的误差。本研究旨在确定应用不同心理测量方法对临床试验结果的影响。方法：对来自Vivli.org网站的15项抗抑郁治疗试验的个体参与者数据进行分析。蒙哥马利-阿斯伯格抑郁评定量表的基线（预处理）和8周（范围4-12周）结果数据采用最佳实践因素分析（FA）、项目反应理论（IRT）和网络分析（NA）方法。原始总结性得分和心理测量模型得分的试验结果使用多水平模型进行评估。原始总结性和心理测量模型得分的科恩效应大小的百分比差异是兴趣的影响。结果：每一种方法均产生一维模型，但修改后的量表从7项到10项不等。IRT（10项）的治疗效果（d = 0.072）不变，NA的治疗效果下降1.3% ~ 2.8%(8项缩写d = 0.070，加权评分d = 0.071)， FA的治疗效果增加11% ~ 12.5%（7项缩写d = 0.081，加权评分d = 0.080）。讨论：相对于原始试验，IRT和NA在效果结果上的差异可以忽略不计。FA增加了效应量，可能是识别安慰剂组和治疗组结果不同的项目的最有效方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Do complex psychometric analyses really matter? Comparing multiple approaches using individual participant data from antidepressant trials.

Background: Psychometric methods are used to remove underperforming items and reduce error in existing measures, albeit different approaches can produce different results. This study aimed to determine the implications of applying different psychometric methods for clinical trial outcomes.

Methods: Individual participant data from 15 antidepressant treatment trials from Vivli.org were analyzed. Baseline (pretreatment) and 8-week (range 4-12 weeks) outcome data from the Montgomery-Asberg Depression Rating Scale were subjected to best-practice factor analysis (FA), item response theory (IRT), and network analysis (NA) approaches. Trial outcomes for the original summative scores and psychometric-model scores were assessed using multilevel models. Percentage differences in Cohen's d effect sizes for the original summative and psychometrically modeled scores were the effects of interest.

Results: Each method produced unidimensional models, but the modified scales varied from 7 to 10 items. Treatment effects (d = 0.072) were unchanged for IRT (10 items), decreased by 1.3%-2.8% (eight-item abbreviated d = 0.070; weighted score d = 0.071) for NA, and increased by 11%-12.5% (seven-item abbreviated model d = 0.081; weighted score d = 0.080) for FA.

Discussion: IRT and NA yielded negligible differences in effect outcomes relative to original trials. FA increased effect sizes and may be the most effective method for identifying the items on which placebo and treatment group outcomes differ.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Psychological Medicine 医学-精神病学

CiteScore

11.30

自引率

4.30%

发文量

711

审稿时长

3-6 weeks

期刊介绍： Now in its fifth decade of publication, Psychological Medicine is a leading international journal in the fields of psychiatry, related aspects of psychology and basic sciences. From 2014, there are 16 issues a year, each featuring original articles reporting key research being undertaken worldwide, together with shorter editorials by distinguished scholars and an important book review section. The journal''s success is clearly demonstrated by a consistently high impact factor.