David Byrne, Frank Doyle, Susan Brannick, Robert M Carney, Pim Cuijpers, Alexandra L Dima, Kenneth E Freedland, Suzanne Guerin, David Hevey, Bishember Kathuria, Emma Wallace, Fiona Boland
{"title":"复杂的心理测量分析真的重要吗?使用抗抑郁药物试验的个体参与者数据比较多种方法。","authors":"David Byrne, Frank Doyle, Susan Brannick, Robert M Carney, Pim Cuijpers, Alexandra L Dima, Kenneth E Freedland, Suzanne Guerin, David Hevey, Bishember Kathuria, Emma Wallace, Fiona Boland","doi":"10.1017/S0033291725101785","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Psychometric methods are used to remove underperforming items and reduce error in existing measures, albeit different approaches can produce different results. This study aimed to determine the implications of applying different psychometric methods for clinical trial outcomes.</p><p><strong>Methods: </strong>Individual participant data from 15 antidepressant treatment trials from Vivli.org were analyzed. Baseline (pretreatment) and 8-week (range 4-12 weeks) outcome data from the Montgomery-Asberg Depression Rating Scale were subjected to best-practice factor analysis (FA), item response theory (IRT), and network analysis (NA) approaches. Trial outcomes for the original summative scores and psychometric-model scores were assessed using multilevel models. Percentage differences in Cohen's <i>d</i> effect sizes for the original summative and psychometrically modeled scores were the effects of interest.</p><p><strong>Results: </strong>Each method produced unidimensional models, but the modified scales varied from 7 to 10 items. Treatment effects (<i>d</i> = 0.072) were unchanged for IRT (10 items), decreased by 1.3%-2.8% (eight-item abbreviated <i>d</i> = 0.070; weighted score <i>d</i> = 0.071) for NA, and increased by 11%-12.5% (seven-item abbreviated model <i>d</i> = 0.081; weighted score <i>d</i> = 0.080) for FA.</p><p><strong>Discussion: </strong>IRT and NA yielded negligible differences in effect outcomes relative to original trials. FA increased effect sizes and may be the most effective method for identifying the items on which placebo and treatment group outcomes differ.</p>","PeriodicalId":20891,"journal":{"name":"Psychological Medicine","volume":"55 ","pages":"e289"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Do complex psychometric analyses really matter? Comparing multiple approaches using individual participant data from antidepressant trials.\",\"authors\":\"David Byrne, Frank Doyle, Susan Brannick, Robert M Carney, Pim Cuijpers, Alexandra L Dima, Kenneth E Freedland, Suzanne Guerin, David Hevey, Bishember Kathuria, Emma Wallace, Fiona Boland\",\"doi\":\"10.1017/S0033291725101785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Psychometric methods are used to remove underperforming items and reduce error in existing measures, albeit different approaches can produce different results. This study aimed to determine the implications of applying different psychometric methods for clinical trial outcomes.</p><p><strong>Methods: </strong>Individual participant data from 15 antidepressant treatment trials from Vivli.org were analyzed. Baseline (pretreatment) and 8-week (range 4-12 weeks) outcome data from the Montgomery-Asberg Depression Rating Scale were subjected to best-practice factor analysis (FA), item response theory (IRT), and network analysis (NA) approaches. Trial outcomes for the original summative scores and psychometric-model scores were assessed using multilevel models. Percentage differences in Cohen's <i>d</i> effect sizes for the original summative and psychometrically modeled scores were the effects of interest.</p><p><strong>Results: </strong>Each method produced unidimensional models, but the modified scales varied from 7 to 10 items. Treatment effects (<i>d</i> = 0.072) were unchanged for IRT (10 items), decreased by 1.3%-2.8% (eight-item abbreviated <i>d</i> = 0.070; weighted score <i>d</i> = 0.071) for NA, and increased by 11%-12.5% (seven-item abbreviated model <i>d</i> = 0.081; weighted score <i>d</i> = 0.080) for FA.</p><p><strong>Discussion: </strong>IRT and NA yielded negligible differences in effect outcomes relative to original trials. FA increased effect sizes and may be the most effective method for identifying the items on which placebo and treatment group outcomes differ.</p>\",\"PeriodicalId\":20891,\"journal\":{\"name\":\"Psychological Medicine\",\"volume\":\"55 \",\"pages\":\"e289\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Psychological Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1017/S0033291725101785\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1017/S0033291725101785","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
Do complex psychometric analyses really matter? Comparing multiple approaches using individual participant data from antidepressant trials.
Background: Psychometric methods are used to remove underperforming items and reduce error in existing measures, albeit different approaches can produce different results. This study aimed to determine the implications of applying different psychometric methods for clinical trial outcomes.
Methods: Individual participant data from 15 antidepressant treatment trials from Vivli.org were analyzed. Baseline (pretreatment) and 8-week (range 4-12 weeks) outcome data from the Montgomery-Asberg Depression Rating Scale were subjected to best-practice factor analysis (FA), item response theory (IRT), and network analysis (NA) approaches. Trial outcomes for the original summative scores and psychometric-model scores were assessed using multilevel models. Percentage differences in Cohen's d effect sizes for the original summative and psychometrically modeled scores were the effects of interest.
Results: Each method produced unidimensional models, but the modified scales varied from 7 to 10 items. Treatment effects (d = 0.072) were unchanged for IRT (10 items), decreased by 1.3%-2.8% (eight-item abbreviated d = 0.070; weighted score d = 0.071) for NA, and increased by 11%-12.5% (seven-item abbreviated model d = 0.081; weighted score d = 0.080) for FA.
Discussion: IRT and NA yielded negligible differences in effect outcomes relative to original trials. FA increased effect sizes and may be the most effective method for identifying the items on which placebo and treatment group outcomes differ.
期刊介绍:
Now in its fifth decade of publication, Psychological Medicine is a leading international journal in the fields of psychiatry, related aspects of psychology and basic sciences. From 2014, there are 16 issues a year, each featuring original articles reporting key research being undertaken worldwide, together with shorter editorials by distinguished scholars and an important book review section. The journal''s success is clearly demonstrated by a consistently high impact factor.