{"title":"利用机器学习和传统回归方法将EORTC QLQ-C30和QLQ-LC13与肺癌患者的SF-6D效用指数进行映射。","authors":"Longlin Jiang, Kexun Li, Simiao Lu, Zhou Hong, Yifang Wang, Qin Xie, Qin He, Sirui Wei, Aoru Zhou, Hong Kang, Xuefeng Leng, Qing Yang, Yan Miao","doi":"10.1186/s12955-025-02394-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Preference-based measures of health-related quality of life (HRQoL), such as the Short Form Six-Dimension (SF-6D) is essential for health economic evaluations. However, these measures are rarely included in clinical trials for lung cancer. This study aims to develop mapping algorithms to predict SF-6D health utility scores from the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core (EORTC QLQ-C30) and Quality of Life Questionnaire-Lung Cancer 13 (QLQ-LC13).</p><p><strong>Method: </strong>The study sample comprised a Chinese population with lung cancer (n = 625). Traditional regression techniques, including Ordinary Least Squares regression, Generalized Linear Model, as well as machine learning techniques, such as Gradient Boosting Tree, Support Vector Regression, Ridge Regression are used. Five-fold cross-validation was performed. The performance metrics used to evaluate the models including R<sup>2</sup>, root mean square error (RMSE),mean absolute error (MAE) and mean absolute percentage error (MAPE) were used to screen the optimal model.</p><p><strong>Results: </strong>The mean and median of SF-6D health utility values were 0.774 (SD = 0.154) and 7.795, respectively. The model with the best mapping performance was the Ridge regression model Five-fold cross-validation (CV) results show that the Ridge regression model has the best mapping performance, the final prediction indexes are R<sup>2</sup> = 0.753, RMSE = 0.074, MAE = 0.057, MAPE = 8.169%.</p><p><strong>Conclusions: </strong>This study developed an optimized mapping algorithm to predict the utility index from the QLQ-C30 QLQ-LC13 to the SF-6D. This algorithm offers provides an effective alternative for estimating SF-6D estimation when the preference-based health utility values are unavailable.</p>","PeriodicalId":12980,"journal":{"name":"Health and Quality of Life Outcomes","volume":"23 1","pages":"66"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220268/pdf/","citationCount":"0","resultStr":"{\"title\":\"Mapping the EORTC QLQ-C30 and QLQ-LC13 to the SF-6D utility index in patients with lung cancer using machine learning and traditional regression methods.\",\"authors\":\"Longlin Jiang, Kexun Li, Simiao Lu, Zhou Hong, Yifang Wang, Qin Xie, Qin He, Sirui Wei, Aoru Zhou, Hong Kang, Xuefeng Leng, Qing Yang, Yan Miao\",\"doi\":\"10.1186/s12955-025-02394-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Preference-based measures of health-related quality of life (HRQoL), such as the Short Form Six-Dimension (SF-6D) is essential for health economic evaluations. However, these measures are rarely included in clinical trials for lung cancer. This study aims to develop mapping algorithms to predict SF-6D health utility scores from the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core (EORTC QLQ-C30) and Quality of Life Questionnaire-Lung Cancer 13 (QLQ-LC13).</p><p><strong>Method: </strong>The study sample comprised a Chinese population with lung cancer (n = 625). Traditional regression techniques, including Ordinary Least Squares regression, Generalized Linear Model, as well as machine learning techniques, such as Gradient Boosting Tree, Support Vector Regression, Ridge Regression are used. Five-fold cross-validation was performed. The performance metrics used to evaluate the models including R<sup>2</sup>, root mean square error (RMSE),mean absolute error (MAE) and mean absolute percentage error (MAPE) were used to screen the optimal model.</p><p><strong>Results: </strong>The mean and median of SF-6D health utility values were 0.774 (SD = 0.154) and 7.795, respectively. The model with the best mapping performance was the Ridge regression model Five-fold cross-validation (CV) results show that the Ridge regression model has the best mapping performance, the final prediction indexes are R<sup>2</sup> = 0.753, RMSE = 0.074, MAE = 0.057, MAPE = 8.169%.</p><p><strong>Conclusions: </strong>This study developed an optimized mapping algorithm to predict the utility index from the QLQ-C30 QLQ-LC13 to the SF-6D. This algorithm offers provides an effective alternative for estimating SF-6D estimation when the preference-based health utility values are unavailable.</p>\",\"PeriodicalId\":12980,\"journal\":{\"name\":\"Health and Quality of Life Outcomes\",\"volume\":\"23 1\",\"pages\":\"66\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220268/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health and Quality of Life Outcomes\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12955-025-02394-8\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health and Quality of Life Outcomes","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12955-025-02394-8","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Mapping the EORTC QLQ-C30 and QLQ-LC13 to the SF-6D utility index in patients with lung cancer using machine learning and traditional regression methods.
Background: Preference-based measures of health-related quality of life (HRQoL), such as the Short Form Six-Dimension (SF-6D) is essential for health economic evaluations. However, these measures are rarely included in clinical trials for lung cancer. This study aims to develop mapping algorithms to predict SF-6D health utility scores from the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core (EORTC QLQ-C30) and Quality of Life Questionnaire-Lung Cancer 13 (QLQ-LC13).
Method: The study sample comprised a Chinese population with lung cancer (n = 625). Traditional regression techniques, including Ordinary Least Squares regression, Generalized Linear Model, as well as machine learning techniques, such as Gradient Boosting Tree, Support Vector Regression, Ridge Regression are used. Five-fold cross-validation was performed. The performance metrics used to evaluate the models including R2, root mean square error (RMSE),mean absolute error (MAE) and mean absolute percentage error (MAPE) were used to screen the optimal model.
Results: The mean and median of SF-6D health utility values were 0.774 (SD = 0.154) and 7.795, respectively. The model with the best mapping performance was the Ridge regression model Five-fold cross-validation (CV) results show that the Ridge regression model has the best mapping performance, the final prediction indexes are R2 = 0.753, RMSE = 0.074, MAE = 0.057, MAPE = 8.169%.
Conclusions: This study developed an optimized mapping algorithm to predict the utility index from the QLQ-C30 QLQ-LC13 to the SF-6D. This algorithm offers provides an effective alternative for estimating SF-6D estimation when the preference-based health utility values are unavailable.
期刊介绍:
Health and Quality of Life Outcomes is an open access, peer-reviewed, journal offering high quality articles, rapid publication and wide diffusion in the public domain.
Health and Quality of Life Outcomes considers original manuscripts on the Health-Related Quality of Life (HRQOL) assessment for evaluation of medical and psychosocial interventions. It also considers approaches and studies on psychometric properties of HRQOL and patient reported outcome measures, including cultural validation of instruments if they provide information about the impact of interventions. The journal publishes study protocols and reviews summarising the present state of knowledge concerning a particular aspect of HRQOL and patient reported outcome measures. Reviews should generally follow systematic review methodology. Comments on articles and letters to the editor are welcome.