应用先进的心理测量方法产生不同的随机试验效应大小：使用汉密尔顿抑郁症评定量表对抗抑郁药物研究的个体参与者数据进行二次分析。

IF 7.3 2区医学 Q1 HEALTH CARE SCIENCES & SERVICES

Journal of Clinical Epidemiology Pub Date : 2025-03-24 DOI:10.1016/j.jclinepi.2025.111762

David Byrne , Fiona Boland , Susan Brannick , Robert M. Carney , Pim Cuijpers , Alexandra L. Dima , Kenneth E. Freedland , Suzanne Guerin , David Hevey , Bishember Kathuria , Emma Wallace , Frank Doyle

{"title":"应用先进的心理测量方法产生不同的随机试验效应大小：使用汉密尔顿抑郁症评定量表对抗抑郁药物研究的个体参与者数据进行二次分析。","authors":"David Byrne , Fiona Boland , Susan Brannick , Robert M. Carney , Pim Cuijpers , Alexandra L. Dima , Kenneth E. Freedland , Suzanne Guerin , David Hevey , Bishember Kathuria , Emma Wallace , Frank Doyle","doi":"10.1016/j.jclinepi.2025.111762","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>As multiple sophisticated techniques are used to evaluate psychometric scales, in theory reducing error and enhancing the measurement of patient-reported outcomes, we aimed to determine whether applying different psychometric analyses would demonstrate important differences in treatment effects.</div></div><div><h3>Study Design and Setting</h3><div>We conducted a secondary analysis of individual participant data (IPD) from 20 antidepressant treatment trials obtained from Vivli.org (<em>n</em> = 6843). Pooled item-level data from the Hamilton Rating Scale for Depression (HRSD-17) were analyzed using confirmatory factory analysis (CFA), item response theory (IRT), and network analysis (NA). Multilevel models were used to analyze differences in trial effects at approximately 8 weeks (range 4–12 weeks) post-treatment commencement, with standardized mean differences calculated as Cohen's d. The effect size outcomes for the original total depression scores were compared with psychometrically informed outcomes based on abbreviated and weighted depression scores.</div></div><div><h3>Results</h3><div>Several items performed poorly during psychometric analyses and were eliminated, resulting in different models being obtained for each approach. Treatment effects were modified as follows per psychometric approach: 10.4%–14.9% increase for CFA, 0%–2.9% increase for IRT, and 14.9%–16.4% reduction for NA.</div></div><div><h3>Conclusion</h3><div>Psychometric analyses differentially moderate effect size outcomes depending on the method used. In a 20-trial sample, factor analytic approaches increased treatment effect sizes relative to the original outcomes, NA decreased them, and IRT results reflected original trial outcomes.</div></div><div><h3>Plain Language Summary</h3><div>This study aimed to determine if using advanced psychometrics methods would inform any clinically or statistically important differences in clinical trial outcomes when compared to original findings. We applied factor analysis (FA), item response theory (IRT), and network analysis (NA) to the most commonly used measure of depression in clinical settings – the Hamilton Rating Scale for Depression (HRSD) – to identify and remove nonperforming survey items and calculate weighted item scores. We found that the efficacy reported in trials increased when using FA to removed items, but decreased when using NA. There was almost no change in efficacy when using IRT. Using weighted scores based on respective models offered no additional utility in terms of increasing or decreasing efficacy outcomes.</div></div>","PeriodicalId":51079,"journal":{"name":"Journal of Clinical Epidemiology","volume":"183 ","pages":"Article 111762"},"PeriodicalIF":7.3000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Applying advanced psychometric approaches yields differential randomized trial effect sizes: secondary analysis of individual participant data from antidepressant studies using the Hamilton rating scale for depression\",\"authors\":\"David Byrne , Fiona Boland , Susan Brannick , Robert M. Carney , Pim Cuijpers , Alexandra L. Dima , Kenneth E. Freedland , Suzanne Guerin , David Hevey , Bishember Kathuria , Emma Wallace , Frank Doyle\",\"doi\":\"10.1016/j.jclinepi.2025.111762\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objectives</h3><div>As multiple sophisticated techniques are used to evaluate psychometric scales, in theory reducing error and enhancing the measurement of patient-reported outcomes, we aimed to determine whether applying different psychometric analyses would demonstrate important differences in treatment effects.</div></div><div><h3>Study Design and Setting</h3><div>We conducted a secondary analysis of individual participant data (IPD) from 20 antidepressant treatment trials obtained from Vivli.org (<em>n</em> = 6843). Pooled item-level data from the Hamilton Rating Scale for Depression (HRSD-17) were analyzed using confirmatory factory analysis (CFA), item response theory (IRT), and network analysis (NA). Multilevel models were used to analyze differences in trial effects at approximately 8 weeks (range 4–12 weeks) post-treatment commencement, with standardized mean differences calculated as Cohen's d. The effect size outcomes for the original total depression scores were compared with psychometrically informed outcomes based on abbreviated and weighted depression scores.</div></div><div><h3>Results</h3><div>Several items performed poorly during psychometric analyses and were eliminated, resulting in different models being obtained for each approach. Treatment effects were modified as follows per psychometric approach: 10.4%–14.9% increase for CFA, 0%–2.9% increase for IRT, and 14.9%–16.4% reduction for NA.</div></div><div><h3>Conclusion</h3><div>Psychometric analyses differentially moderate effect size outcomes depending on the method used. In a 20-trial sample, factor analytic approaches increased treatment effect sizes relative to the original outcomes, NA decreased them, and IRT results reflected original trial outcomes.</div></div><div><h3>Plain Language Summary</h3><div>This study aimed to determine if using advanced psychometrics methods would inform any clinically or statistically important differences in clinical trial outcomes when compared to original findings. We applied factor analysis (FA), item response theory (IRT), and network analysis (NA) to the most commonly used measure of depression in clinical settings – the Hamilton Rating Scale for Depression (HRSD) – to identify and remove nonperforming survey items and calculate weighted item scores. We found that the efficacy reported in trials increased when using FA to removed items, but decreased when using NA. There was almost no change in efficacy when using IRT. Using weighted scores based on respective models offered no additional utility in terms of increasing or decreasing efficacy outcomes.</div></div>\",\"PeriodicalId\":51079,\"journal\":{\"name\":\"Journal of Clinical Epidemiology\",\"volume\":\"183 \",\"pages\":\"Article 111762\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Clinical Epidemiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0895435625000952\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895435625000952","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

目的：由于使用多种复杂的技术来评估心理测量量表，理论上可以减少误差并增强对患者报告结果的测量，我们旨在确定应用不同的心理测量分析是否会显示出治疗效果的重要差异。研究设计和设置：我们对从Vivli.org获得的20项抗抑郁治疗试验的个体参与者数据进行了二次分析（n=6,843）。采用验证性工厂分析（CFA）、项目反应理论（IRT）和网络分析（NA）对HRSD-17的汇总项目水平数据进行分析。使用多水平模型分析治疗开始后约8周（范围4-12周）试验效果的差异，标准化平均差异以Cohen’s d计算。将原始总抑郁评分的效应大小结果与基于简化和加权抑郁评分的心理测量学结果进行比较。结果：有几个项目在心理测量分析中表现不佳，并被淘汰，导致每种方法获得不同的模型。根据心理测量方法，治疗效果修改如下：CFA增加10.4%-14.9%，IRT增加0%-2.9%，NA减少14.9%-16.4%。结论：心理测量分析根据所使用的方法，对效应大小结果有差异调节。在20个试验样本中，因子分析方法相对于原始结果增加了治疗效应量，NA降低了治疗效应量，IRT结果反映了原始试验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Applying advanced psychometric approaches yields differential randomized trial effect sizes: secondary analysis of individual participant data from antidepressant studies using the Hamilton rating scale for depression

Objectives

As multiple sophisticated techniques are used to evaluate psychometric scales, in theory reducing error and enhancing the measurement of patient-reported outcomes, we aimed to determine whether applying different psychometric analyses would demonstrate important differences in treatment effects.

Study Design and Setting

We conducted a secondary analysis of individual participant data (IPD) from 20 antidepressant treatment trials obtained from Vivli.org (n = 6843). Pooled item-level data from the Hamilton Rating Scale for Depression (HRSD-17) were analyzed using confirmatory factory analysis (CFA), item response theory (IRT), and network analysis (NA). Multilevel models were used to analyze differences in trial effects at approximately 8 weeks (range 4–12 weeks) post-treatment commencement, with standardized mean differences calculated as Cohen's d. The effect size outcomes for the original total depression scores were compared with psychometrically informed outcomes based on abbreviated and weighted depression scores.

Results

Several items performed poorly during psychometric analyses and were eliminated, resulting in different models being obtained for each approach. Treatment effects were modified as follows per psychometric approach: 10.4%–14.9% increase for CFA, 0%–2.9% increase for IRT, and 14.9%–16.4% reduction for NA.

Conclusion

Psychometric analyses differentially moderate effect size outcomes depending on the method used. In a 20-trial sample, factor analytic approaches increased treatment effect sizes relative to the original outcomes, NA decreased them, and IRT results reflected original trial outcomes.

Plain Language Summary

This study aimed to determine if using advanced psychometrics methods would inform any clinically or statistically important differences in clinical trial outcomes when compared to original findings. We applied factor analysis (FA), item response theory (IRT), and network analysis (NA) to the most commonly used measure of depression in clinical settings – the Hamilton Rating Scale for Depression (HRSD) – to identify and remove nonperforming survey items and calculate weighted item scores. We found that the efficacy reported in trials increased when using FA to removed items, but decreased when using NA. There was almost no change in efficacy when using IRT. Using weighted scores based on respective models offered no additional utility in terms of increasing or decreasing efficacy outcomes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Clinical Epidemiology 医学-公共卫生、环境卫生与职业卫生

CiteScore

12.00

自引率

6.90%

发文量

320

审稿时长

44 days

期刊介绍： The Journal of Clinical Epidemiology strives to enhance the quality of clinical and patient-oriented healthcare research by advancing and applying innovative methods in conducting, presenting, synthesizing, disseminating, and translating research results into optimal clinical practice. Special emphasis is placed on training new generations of scientists and clinical practice leaders.