Using patient-reported outcome measures in clinical trials: perspectives for and against a modular approach

IF 6.7 2区医学 Q1 MEDICINE, GENERAL & INTERNAL

Medical Journal of Australia Pub Date : 2025-03-11 DOI:10.5694/mja2.52629

Carrie-Anne Ng, Jessica Roydhouse, Tim Luckett, Richard De Abreu Lourenco, Brendan Mulhern

{"title":"Using patient-reported outcome measures in clinical trials: perspectives for and against a modular approach","authors":"Carrie-Anne Ng, Jessica Roydhouse, Tim Luckett, Richard De Abreu Lourenco, Brendan Mulhern","doi":"10.5694/mja2.52629","DOIUrl":null,"url":null,"abstract":"There is growing acknowledgement among international stakeholders, including regulators, health technology assessment (HTA) bodies and professional societies, that the inclusion of patient-reported outcome measures (PROMs) in clinical trials should be guided by clear rationales.1 In this regard, existing PROMs may overreach or fall short in measuring domains most relevant for a specific study's context, population, treatments and stakeholders.In response to the need for flexibility in assessing specific health-related quality of life (HRQoL) domains and regulatory recommendations,1 PROM developers and stakeholders have provided guidance on a modular approach.2-4 Although most guidance has focused on cancer clinical trials, the same principles are applicable to any context where PROMs are used (eg, clinical practice). In trials, the modular approach has been defined as the selection and assessment of specific patient-relevant and clinically relevant domains of interest, purposefully selected from multidomain PROMs, and then independently scored and interpreted.3, 4Selection of the most relevant disease- and treatment-specific domains can be based on previous research, consumer advice and clinical experience. Stakeholders have also recommended using conceptual frameworks and core patient-reported outcome (PROs) sets to guide the selection of PROM domains.5 HRQoL concepts identified as primary or key secondary outcomes should ideally be assessed using the most valid, reliable and dedicated measures, which are often more in-depth than domains in multidomain PROMs used to provide a broad overview. For example, if pain is a primary outcome in a trial investigating an anti-cancer treatment, the Brief Pain Inventory might be a comprehensive substitution for the two-item domain of pain from the widely used cancer-specific HRQoL PROM, the European Organisation for Research and Treatment of Cancer Quality of Life Core Questionnaire (EORTC QLQ-C30).6 Secondary outcomes, such as appetite loss and constipation, might be adequately assessed by the QLQ-C30's single-item domains for these outcomes. Thus, we propose three possible applications of the modular approach (Box 1): (i) using a study-specific conceptual framework, incorporating dedicated PROMs or domains to measure relevant HRQoL concepts; (ii) using a full-length PROM, removing domains less relevant to the study context or not covered by a core PRO set; or (iii) using a full-length PROM, substituting domains that are primary or key secondary outcomes with dedicated PROMs or domains.Box 2 presents the cases for and against the use of a modular approach. In this perspective article, we discuss some of these arguments in the context of assessing clinical efficacy and conducting economic evaluations.A modular approach can address respondent burden associated with lengthy PROMs and the potential for missing or poor quality PRO data in clinical trials7 (Box 3, Example 1). However, the aim of a modular approach is to prioritise assessment of the most important domains in each context, rather than to reduce PROM length. Indeed, patients may be willing to complete longer measures if items are relevant to their experience.10 Additionally, administering domains relevant to the clinical trial context can improve sensitivity to clinically important changes in PROs (Box 3, Example 2).The modular approach also enables flexibility in assessing different domains at various timepoints. Domains that are conceptually proximal to the disease and treatment (eg, nausea severity) are more likely to change over a short period of time than more distal domains (eg, emotional function), and may therefore require more frequent assessment.11To preserve the psychometric properties of domains resulting from a modular approach, working with PROM developers and following published recommendations is encouraged.2 If selected domains are obtained from PROMs validated in the target population, they are likely to retain psychometric properties, such as content and construct validity. As additional psychometric testing may not be required, this can be beneficial for novel treatments, given faster evaluation and approval times,12 or for patients with rare cancers, where developing or validating bespoke disease-specific PROMs may be challenging.The assessment of a broad set of items promotes the integration of PROMs into early-phase trials (phase 1 or phase 2) for capturing potential symptomatic adverse events and HRQoL problems. However, once these issues are identified, investigators must decide which domains to assess in late-phase trials, and to what extent. It is unsurprising that many may default to using full-length PROMs to maintain comparability with existing studies, avoid missing unexpected effects and meet HTA agencies’ requirements for economic evaluation.To complicate matters, the modular approach necessitates greater scrutiny in the balance between the depth and breadth of issues and respondent burden (ie, time taken). Although this choice can be aided by international guidance and patient and public involvement,13, 14 it becomes less clear for domains amidst an abundance of options from various PROMs for the same HRQoL concept. Studies comparing the psychometric properties of PROMs in various study contexts can assist in this selection15 but, traditionally, conclusions have been drawn by assessing full-length PROMs instead of their domains. Psychometric evidence for specific combinations of subscales and items obtained from the EORTC suite are emerging,16 but evidence for the combination of subscales across different PROMs is lacking. This raises several questions requiring further conceptual and empirical consideration: Should only domains that have demonstrated the best psychometric evidence be selected and combined? How will a mixture of recall periods and response options across PROMs impact face validity and response distributions? What strategies can avoid biased selection of domains that may favour certain effects while downplaying others?Guidance on the modular approach suggests that item order can affect the validity of subscales because responses may be influenced by the preceding questions.2, 4 These potential biases are usually addressed during early development of a PROM, which supports the argument for keeping PROMs in their original format unless there is potential for retesting. However, item order effects are mostly theoretical, with limited research available that has tested these directly.17 Order effects have primarily been considered at the questionnaire level and conclusions regarding their presence have only been conservatively applied to the PROMs assessed.17The modular approach involves replacing existing full-length PROMs with a focused selection of HRQoL domains most relevant to a particular clinical trial context. Several considerations must be addressed before integrating the modular approach into clinical trials, including achieving broader acceptance from regulatory bodies (eg, HTA agencies). These considerations include expanding and clarifying the definition of what constitutes a module and how to apply the modular approach, while ensuring its suitability for economic evaluations. Further evidence that alleviates concerns associated with a modular approach, particularly evidence supporting its psychometric validity, would improve its acceptability in the field.No relevant disclosures.Not commissioned; externally peer reviewed.","PeriodicalId":18214,"journal":{"name":"Medical Journal of Australia","volume":"222 7","pages":"327-330"},"PeriodicalIF":6.7000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.5694/mja2.52629","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical Journal of Australia","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.5694/mja2.52629","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

There is growing acknowledgement among international stakeholders, including regulators, health technology assessment (HTA) bodies and professional societies, that the inclusion of patient-reported outcome measures (PROMs) in clinical trials should be guided by clear rationales.¹ In this regard, existing PROMs may overreach or fall short in measuring domains most relevant for a specific study's context, population, treatments and stakeholders.

In response to the need for flexibility in assessing specific health-related quality of life (HRQoL) domains and regulatory recommendations,¹ PROM developers and stakeholders have provided guidance on a modular approach.^2-4 Although most guidance has focused on cancer clinical trials, the same principles are applicable to any context where PROMs are used (eg, clinical practice). In trials, the modular approach has been defined as the selection and assessment of specific patient-relevant and clinically relevant domains of interest, purposefully selected from multidomain PROMs, and then independently scored and interpreted.^{3, 4}

Selection of the most relevant disease- and treatment-specific domains can be based on previous research, consumer advice and clinical experience. Stakeholders have also recommended using conceptual frameworks and core patient-reported outcome (PROs) sets to guide the selection of PROM domains.⁵ HRQoL concepts identified as primary or key secondary outcomes should ideally be assessed using the most valid, reliable and dedicated measures, which are often more in-depth than domains in multidomain PROMs used to provide a broad overview. For example, if pain is a primary outcome in a trial investigating an anti-cancer treatment, the Brief Pain Inventory might be a comprehensive substitution for the two-item domain of pain from the widely used cancer-specific HRQoL PROM, the European Organisation for Research and Treatment of Cancer Quality of Life Core Questionnaire (EORTC QLQ-C30).⁶ Secondary outcomes, such as appetite loss and constipation, might be adequately assessed by the QLQ-C30's single-item domains for these outcomes. Thus, we propose three possible applications of the modular approach (Box 1): (i) using a study-specific conceptual framework, incorporating dedicated PROMs or domains to measure relevant HRQoL concepts; (ii) using a full-length PROM, removing domains less relevant to the study context or not covered by a core PRO set; or (iii) using a full-length PROM, substituting domains that are primary or key secondary outcomes with dedicated PROMs or domains.

Box 2 presents the cases for and against the use of a modular approach. In this perspective article, we discuss some of these arguments in the context of assessing clinical efficacy and conducting economic evaluations.

A modular approach can address respondent burden associated with lengthy PROMs and the potential for missing or poor quality PRO data in clinical trials⁷ (Box 3, Example 1). However, the aim of a modular approach is to prioritise assessment of the most important domains in each context, rather than to reduce PROM length. Indeed, patients may be willing to complete longer measures if items are relevant to their experience.¹⁰ Additionally, administering domains relevant to the clinical trial context can improve sensitivity to clinically important changes in PROs (Box 3, Example 2).

The modular approach also enables flexibility in assessing different domains at various timepoints. Domains that are conceptually proximal to the disease and treatment (eg, nausea severity) are more likely to change over a short period of time than more distal domains (eg, emotional function), and may therefore require more frequent assessment.¹¹

To preserve the psychometric properties of domains resulting from a modular approach, working with PROM developers and following published recommendations is encouraged.² If selected domains are obtained from PROMs validated in the target population, they are likely to retain psychometric properties, such as content and construct validity. As additional psychometric testing may not be required, this can be beneficial for novel treatments, given faster evaluation and approval times,¹² or for patients with rare cancers, where developing or validating bespoke disease-specific PROMs may be challenging.

The assessment of a broad set of items promotes the integration of PROMs into early-phase trials (phase 1 or phase 2) for capturing potential symptomatic adverse events and HRQoL problems. However, once these issues are identified, investigators must decide which domains to assess in late-phase trials, and to what extent. It is unsurprising that many may default to using full-length PROMs to maintain comparability with existing studies, avoid missing unexpected effects and meet HTA agencies’ requirements for economic evaluation.

To complicate matters, the modular approach necessitates greater scrutiny in the balance between the depth and breadth of issues and respondent burden (ie, time taken). Although this choice can be aided by international guidance and patient and public involvement,^{13, 14} it becomes less clear for domains amidst an abundance of options from various PROMs for the same HRQoL concept. Studies comparing the psychometric properties of PROMs in various study contexts can assist in this selection¹⁵ but, traditionally, conclusions have been drawn by assessing full-length PROMs instead of their domains. Psychometric evidence for specific combinations of subscales and items obtained from the EORTC suite are emerging,¹⁶ but evidence for the combination of subscales across different PROMs is lacking. This raises several questions requiring further conceptual and empirical consideration: Should only domains that have demonstrated the best psychometric evidence be selected and combined? How will a mixture of recall periods and response options across PROMs impact face validity and response distributions? What strategies can avoid biased selection of domains that may favour certain effects while downplaying others?

Guidance on the modular approach suggests that item order can affect the validity of subscales because responses may be influenced by the preceding questions.^{2, 4} These potential biases are usually addressed during early development of a PROM, which supports the argument for keeping PROMs in their original format unless there is potential for retesting. However, item order effects are mostly theoretical, with limited research available that has tested these directly.¹⁷ Order effects have primarily been considered at the questionnaire level and conclusions regarding their presence have only been conservatively applied to the PROMs assessed.¹⁷

The modular approach involves replacing existing full-length PROMs with a focused selection of HRQoL domains most relevant to a particular clinical trial context. Several considerations must be addressed before integrating the modular approach into clinical trials, including achieving broader acceptance from regulatory bodies (eg, HTA agencies). These considerations include expanding and clarifying the definition of what constitutes a module and how to apply the modular approach, while ensuring its suitability for economic evaluations. Further evidence that alleviates concerns associated with a modular approach, particularly evidence supporting its psychometric validity, would improve its acceptability in the field.

No relevant disclosures.

Not commissioned; externally peer reviewed.

查看原文本刊更多论文

在临床试验中使用患者报告的结果测量：支持和反对模块化方法的观点。

包括监管机构、卫生技术评估（HTA）机构和专业协会在内的国际利益攸关方越来越认识到，在临床试验中纳入患者报告的结果措施（PROMs）应以明确的理由为指导在这方面，现有的prom在测量与特定研究的背景、人口、治疗和利益相关者最相关的领域方面可能超出或不足。针对评估特定健康相关生活质量（HRQoL）领域的灵活性需求和监管建议，1 PROM开发者和利益攸关方提供了关于模块化方法的指导。2-4尽管大多数指南都集中在癌症临床试验上，但同样的原则也适用于使用PROMs的任何环境（例如临床实践）。在试验中，模块化方法被定义为选择和评估特定的患者相关和临床相关的兴趣域，有目的地从多域prom中选择，然后独立评分和解释。3,4选择最相关的疾病和特定治疗领域可以基于以前的研究、消费者建议和临床经验。利益相关者还建议使用概念框架和核心患者报告结果（PROs）集来指导PROM领域的选择确定为主要或关键次要结果的HRQoL概念理想情况下应该使用最有效、最可靠和最专用的度量来评估，这些度量通常比用于提供广泛概述的多域prom中的域更深入。例如，如果疼痛是一项研究抗癌治疗的试验的主要结果，那么简短疼痛清单可能是一个全面的替代广泛使用的癌症特异性HRQoL PROM的两项疼痛领域，即欧洲癌症研究和治疗组织生活质量核心问卷（EORTC QLQ-C30）次要结果，如食欲减退和便秘，可以通过QLQ-C30的单项域对这些结果进行充分评估。因此，我们提出了模块化方法的三种可能的应用（框1）：(i)使用研究特定的概念框架，结合专用的prom或域来测量相关的HRQoL概念；（ii）使用全长PROM，删除与研究背景不太相关或未被核心PRO集覆盖的域；或（iii）使用全长PROM，用专用PROM或域替代主要或关键次要结果的域。框2给出了支持和反对使用模块化方法的案例。在这篇观点文章中，我们在评估临床疗效和进行经济评估的背景下讨论其中的一些论点。模块化方法可以解决与冗长的PROM相关的应答者负担，以及临床试验中可能缺失或质量差的PRO数据7（方框3，例1）。然而，模块化方法的目的是优先评估每种情况下最重要的领域，而不是减少PROM长度。事实上，如果项目与他们的经历相关，病人可能愿意完成更长时间的测试此外，管理与临床试验背景相关的域可以提高对PROs临床重要变化的敏感性（方框3，例2）。模块化方法还可以灵活地在不同时间点评估不同的域。概念上离疾病和治疗最近的领域（如恶心严重程度）比远端的领域（如情绪功能）更有可能在短时间内发生变化，因此可能需要更频繁的评估。为了保留由模块化方法产生的领域的心理测量特性，鼓励与PROM开发人员合作并遵循已发表的建议如果选择的领域是从目标人群中验证的prom中获得的，则它们可能保留心理测量特性，例如内容和结构效度。由于可能不需要额外的心理测量测试，这对于新的治疗方法可能是有益的，因为评估和批准时间更快，12或者对于患有罕见癌症的患者来说，开发或验证定制的疾病特异性prom可能具有挑战性。对一系列广泛项目的评估促进了将PROMs纳入早期试验（第一阶段或第二阶段），以捕获潜在的症状性不良事件和HRQoL问题。然而，一旦确定了这些问题，研究人员必须决定在后期试验中评估哪些领域，以及评估到什么程度。毫不奇怪，许多人可能默认使用全长prom，以保持与现有研究的可比性，避免遗漏意想不到的效果，并满足HTA机构对经济评估的要求。使问题复杂化的是，模块化方法需要在问题的深度和广度与答辩人的负担（即所花费的时间）之间进行更严格的审查。虽然这一选择可以通过国际指导以及耐心和公众的参与来辅助，但在来自不同PROMs的相同HRQoL概念的大量选择中，这一选择变得不那么明确。比较不同研究背景下prom的心理测量特性的研究有助于这一选择，但传统上，结论是通过评估全长prom而不是其领域得出的。从EORTC套件中获得的子量表和项目的特定组合的心理测量证据正在出现，但缺乏跨不同prom的子量表组合的证据。这就提出了几个问题，需要进一步的概念和经验考虑：是否应该只选择和合并那些表现出最佳心理测量证据的领域？跨prom的回忆期和反应选项的混合将如何影响面对效度和反应分布？有什么策略可以避免偏向于某些效应而轻视其他效应的领域选择？关于模块化方法的指南表明，项目顺序可以影响子量表的有效性，因为回答可能受到前面问题的影响。这些潜在的偏差通常在PROM的早期开发中得到解决，这支持了保持PROM原始格式的论点，除非有可能重新测试。然而，物品顺序效应大多是理论上的，直接测试这些的研究有限顺序效应主要是在问卷水平上考虑的，关于它们存在的结论只保守地应用于评估的prom。模块化方法包括用与特定临床试验环境最相关的HRQoL域的重点选择取代现有的全长prom。在将模块化方法整合到临床试验之前，必须考虑几个问题，包括获得监管机构（如HTA机构）的广泛接受。这些考虑包括扩大和澄清构成模块的定义以及如何应用模块化方法，同时确保其适合于经济评估。进一步的证据减轻了与模块化方法相关的担忧，特别是支持其心理测量效度的证据，将提高其在该领域的可接受性。无相关披露。不是委托;外部同行评审。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical Journal of Australia 医学-医学：内科

CiteScore

9.40

自引率

5.30%

发文量

410

审稿时长

3-8 weeks

期刊介绍： The Medical Journal of Australia (MJA) stands as Australia's foremost general medical journal, leading the dissemination of high-quality research and commentary to shape health policy and influence medical practices within the country. Under the leadership of Professor Virginia Barbour, the expert editorial team at MJA is dedicated to providing authors with a constructive and collaborative peer-review and publication process. Established in 1914, the MJA has evolved into a modern journal that upholds its founding values, maintaining a commitment to supporting the medical profession by delivering high-quality and pertinent information essential to medical practice.