Sample Size and Model Prediction Accuracy in EQ-5D-5L Valuations Studies: Expected Out-of-Sample Accuracy Based on Resampling with Different Sample Sizes and Alternative Model Specifications

IF 1.9 Q3 HEALTH CARE SCIENCES & SERVICES
T. M. Hansen, K. Stavem, K. Rand
{"title":"Sample Size and Model Prediction Accuracy in EQ-5D-5L Valuations Studies: Expected Out-of-Sample Accuracy Based on Resampling with Different Sample Sizes and Alternative Model Specifications","authors":"T. M. Hansen, K. Stavem, K. Rand","doi":"10.1177/23814683221083839","DOIUrl":null,"url":null,"abstract":"Background. National valuation studies are costly, with ∼1000 face-to-face interviews recommended, and some countries may deem such studies infeasible. Building on previous studies exploring sample size, we determined the effect of sample size and alternative model specifications on prediction accuracy of modeled coefficients in EQ-5D-5L value set generating regression analyses. Methods. Data sets (n = 50 to ∼1000) were simulated from 3 valuation studies, resampled at the respondent level and randomly drawn 1000 times with replacement. We estimated utilities for each subsample with leave-one-out at the block level using regression models (8 or 20 parameter; with or without a random intercept; time tradeoff [TTO] data only or TTO + discrete choice experiment [DCE] data). Prediction accuracy, root mean square error (RMSE), was calculated by comparing to censored mean predicted values to the left-out block in the full data set. Linear regression was used to estimate the relative effect of changes in sample size and each model specification. Results. Results showed that doubling the sample size decreased RMSE by on average 0.012. Effects of other model specifications were smaller but can when combined compensate for loss in prediction accuracy from a small sample size. For models using TTO data only, 8-parameter models clearly outperformed 20-parameter models. Adding a random intercept, or including DCE responses, also improved mean RMSE, most prominently for variants of the 20-parameter models. Conclusions. The prediction accuracy impact of further increases in sample size after 300 to 500 were smaller than the impact of combining alternative modeling choices. Hybrid modeling, use of constrained models, and inclusion of random intercepts all substantially improve the expected prediction accuracy. Beyond a minimum of 300 to 500 respondents, the sample size may be better informed by other considerations, such as legitimacy and representativeness, than by the technical prediction accuracy achievable. Highlights Increases in sample size beyond a minimum in the range of 300 to 500 respondents provide smaller gains in expected prediction accuracy than alternative modeling approaches. Constrained, nonlinear models; time tradeoff + discrete choice experiment hybrid modeling; and including a random intercept all improved the prediction accuracy of models estimating values for the EQ-5D-5L based on data from 3 different valuation studies. The tested modeling choices can compensate for smaller sample sizes.","PeriodicalId":36567,"journal":{"name":"MDM Policy and Practice","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MDM Policy and Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/23814683221083839","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 7

Abstract

Background. National valuation studies are costly, with ∼1000 face-to-face interviews recommended, and some countries may deem such studies infeasible. Building on previous studies exploring sample size, we determined the effect of sample size and alternative model specifications on prediction accuracy of modeled coefficients in EQ-5D-5L value set generating regression analyses. Methods. Data sets (n = 50 to ∼1000) were simulated from 3 valuation studies, resampled at the respondent level and randomly drawn 1000 times with replacement. We estimated utilities for each subsample with leave-one-out at the block level using regression models (8 or 20 parameter; with or without a random intercept; time tradeoff [TTO] data only or TTO + discrete choice experiment [DCE] data). Prediction accuracy, root mean square error (RMSE), was calculated by comparing to censored mean predicted values to the left-out block in the full data set. Linear regression was used to estimate the relative effect of changes in sample size and each model specification. Results. Results showed that doubling the sample size decreased RMSE by on average 0.012. Effects of other model specifications were smaller but can when combined compensate for loss in prediction accuracy from a small sample size. For models using TTO data only, 8-parameter models clearly outperformed 20-parameter models. Adding a random intercept, or including DCE responses, also improved mean RMSE, most prominently for variants of the 20-parameter models. Conclusions. The prediction accuracy impact of further increases in sample size after 300 to 500 were smaller than the impact of combining alternative modeling choices. Hybrid modeling, use of constrained models, and inclusion of random intercepts all substantially improve the expected prediction accuracy. Beyond a minimum of 300 to 500 respondents, the sample size may be better informed by other considerations, such as legitimacy and representativeness, than by the technical prediction accuracy achievable. Highlights Increases in sample size beyond a minimum in the range of 300 to 500 respondents provide smaller gains in expected prediction accuracy than alternative modeling approaches. Constrained, nonlinear models; time tradeoff + discrete choice experiment hybrid modeling; and including a random intercept all improved the prediction accuracy of models estimating values for the EQ-5D-5L based on data from 3 different valuation studies. The tested modeling choices can compensate for smaller sample sizes.
EQ-5D-5L估值研究中的样本量和模型预测精度:基于不同样本量和可选模型规格重采样的期望样本外精度
背景。国家评估研究是昂贵的,建议进行约1000次面对面访谈,一些国家可能认为这种研究是不可行的。在以往样本量研究的基础上,我们确定了样本量和可选模型规格对EQ-5D-5L值集模型系数预测精度的影响。方法。数据集(n = 50 ~ 1000)从3个评估研究中模拟,在被调查者水平上重新抽样,并随机抽取1000次。我们使用回归模型(8或20个参数;有或没有随机截距的;时间权衡[TTO]数据或TTO +离散选择实验[DCE]数据)。预测精度,即均方根误差(RMSE),是通过将删减后的均值预测值与整个数据集中的遗漏块进行比较来计算的。线性回归用于估计样本量和每个模型规格变化的相对影响。结果。结果表明,样本量增加一倍,均方根误差平均降低0.012。其他模型规格的影响较小,但结合起来可以弥补小样本量的预测精度损失。对于仅使用TTO数据的模型,8参数模型明显优于20参数模型。添加随机截距,或包括DCE响应,也提高了平均RMSE,最显著的是20参数模型的变体。结论。在300到500之后进一步增加样本量对预测精度的影响小于组合其他模型选择的影响。混合建模、约束模型的使用以及随机截点的包含都大大提高了预期的预测精度。在至少300至500名受访者之外,考虑合法性和代表性等其他因素,可能比考虑可实现的技术预测准确性更好地了解样本量。在300到500个被调查者的最小范围内,样本量的增加比其他建模方法在预期预测精度方面的收益要小。约束的非线性模型;时间权衡+离散选择实验混合建模;并纳入随机截距,均提高了基于3个不同估值研究数据的EQ-5D-5L估值模型的预测精度。经过测试的建模选择可以补偿较小的样本量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
MDM Policy and Practice
MDM Policy and Practice Medicine-Health Policy
CiteScore
2.50
自引率
0.00%
发文量
28
审稿时长
15 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信