Tianqi Hong PhD , Shitong Xie PhD , Xinran Liu MSc , Jing Wu PhD , Gang Chen PhD
{"title":"机器学习方法在映射研究中比回归模型表现更好吗?系统评价。","authors":"Tianqi Hong PhD , Shitong Xie PhD , Xinran Liu MSc , Jing Wu PhD , Gang Chen PhD","doi":"10.1016/j.jval.2024.12.010","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>To identify how machine learning (ML) approaches were implemented in mapping studies and to determine the extent to which ML improved performance compared with regression models (RMs).</div></div><div><h3>Methods</h3><div>A systematic literature search was conducted in 12 databases from inception to December 2023 to identify studies that applied ML to develop mapping algorithms. A data template was applied to extract data set information, source and target measures, ML approaches and RMs, mapping types (direct vs indirect), goodness-of-fit indicators (mean absolute error, mean squared error, root mean squared error, R-squared, and intraclass correlation coefficient), and validation methods. Differences in goodness-of-fit indicators between ML and RMs were summarized. Potential advantages and challenges for ML were further discussed.</div></div><div><h3>Results</h3><div>Thirteen mapping studies were identified, in which both ML and RM were adopted. Bayesian networks were the most frequently used ML approach (n = 6), followed by the least absolute shrinkage and selection operator (n = 4). The ordinary least square model was the most used RM (n = 8), followed by the censored least absolute deviation and multinomial logit models (n = 5 each). The average improvement in the goodness-of-fit of ML compared with that of RMs by indicators were 0.007 (mean absolute error), 0.004 (mean squared error), 0.058 (R-squared), 0.016 (intraclass correlation coefficient), and −0.0004 (root mean squared error).</div></div><div><h3>Conclusions</h3><div>There is an increasing number of studies using ML in developing mapping algorithms. Generally, a minor improvement of goodness-of-fit was observed compared with RMs when using mean-based comparisons. Issues such as how to interpret, apply, and externally validate the ML-based outputs would affect their implementation. Future studies are warranted to verify advantages of ML approaches.</div></div>","PeriodicalId":23508,"journal":{"name":"Value in Health","volume":"28 5","pages":"Pages 800-811"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Do Machine Learning Approaches Perform Better Than Regression Models in Mapping Studies? A Systematic Review\",\"authors\":\"Tianqi Hong PhD , Shitong Xie PhD , Xinran Liu MSc , Jing Wu PhD , Gang Chen PhD\",\"doi\":\"10.1016/j.jval.2024.12.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objectives</h3><div>To identify how machine learning (ML) approaches were implemented in mapping studies and to determine the extent to which ML improved performance compared with regression models (RMs).</div></div><div><h3>Methods</h3><div>A systematic literature search was conducted in 12 databases from inception to December 2023 to identify studies that applied ML to develop mapping algorithms. A data template was applied to extract data set information, source and target measures, ML approaches and RMs, mapping types (direct vs indirect), goodness-of-fit indicators (mean absolute error, mean squared error, root mean squared error, R-squared, and intraclass correlation coefficient), and validation methods. Differences in goodness-of-fit indicators between ML and RMs were summarized. Potential advantages and challenges for ML were further discussed.</div></div><div><h3>Results</h3><div>Thirteen mapping studies were identified, in which both ML and RM were adopted. Bayesian networks were the most frequently used ML approach (n = 6), followed by the least absolute shrinkage and selection operator (n = 4). The ordinary least square model was the most used RM (n = 8), followed by the censored least absolute deviation and multinomial logit models (n = 5 each). The average improvement in the goodness-of-fit of ML compared with that of RMs by indicators were 0.007 (mean absolute error), 0.004 (mean squared error), 0.058 (R-squared), 0.016 (intraclass correlation coefficient), and −0.0004 (root mean squared error).</div></div><div><h3>Conclusions</h3><div>There is an increasing number of studies using ML in developing mapping algorithms. Generally, a minor improvement of goodness-of-fit was observed compared with RMs when using mean-based comparisons. Issues such as how to interpret, apply, and externally validate the ML-based outputs would affect their implementation. Future studies are warranted to verify advantages of ML approaches.</div></div>\",\"PeriodicalId\":23508,\"journal\":{\"name\":\"Value in Health\",\"volume\":\"28 5\",\"pages\":\"Pages 800-811\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Value in Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1098301525000452\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Value in Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1098301525000452","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
Do Machine Learning Approaches Perform Better Than Regression Models in Mapping Studies? A Systematic Review
Objectives
To identify how machine learning (ML) approaches were implemented in mapping studies and to determine the extent to which ML improved performance compared with regression models (RMs).
Methods
A systematic literature search was conducted in 12 databases from inception to December 2023 to identify studies that applied ML to develop mapping algorithms. A data template was applied to extract data set information, source and target measures, ML approaches and RMs, mapping types (direct vs indirect), goodness-of-fit indicators (mean absolute error, mean squared error, root mean squared error, R-squared, and intraclass correlation coefficient), and validation methods. Differences in goodness-of-fit indicators between ML and RMs were summarized. Potential advantages and challenges for ML were further discussed.
Results
Thirteen mapping studies were identified, in which both ML and RM were adopted. Bayesian networks were the most frequently used ML approach (n = 6), followed by the least absolute shrinkage and selection operator (n = 4). The ordinary least square model was the most used RM (n = 8), followed by the censored least absolute deviation and multinomial logit models (n = 5 each). The average improvement in the goodness-of-fit of ML compared with that of RMs by indicators were 0.007 (mean absolute error), 0.004 (mean squared error), 0.058 (R-squared), 0.016 (intraclass correlation coefficient), and −0.0004 (root mean squared error).
Conclusions
There is an increasing number of studies using ML in developing mapping algorithms. Generally, a minor improvement of goodness-of-fit was observed compared with RMs when using mean-based comparisons. Issues such as how to interpret, apply, and externally validate the ML-based outputs would affect their implementation. Future studies are warranted to verify advantages of ML approaches.
期刊介绍:
Value in Health contains original research articles for pharmacoeconomics, health economics, and outcomes research (clinical, economic, and patient-reported outcomes/preference-based research), as well as conceptual and health policy articles that provide valuable information for health care decision-makers as well as the research community. As the official journal of ISPOR, Value in Health provides a forum for researchers, as well as health care decision-makers to translate outcomes research into health care decisions.