Ludovico Cavallaro, Vittoria Ardito, Michael Drummond, Oriana Ciani
{"title":"Machine Learning-Assisted Health Economics and Policy Reviews: A Comparative Assessment.","authors":"Ludovico Cavallaro, Vittoria Ardito, Michael Drummond, Oriana Ciani","doi":"10.1007/s40258-025-00963-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The growth of scientific literature in health economics and policy represents a challenge for researchers conducting literature reviews. This study explores the adoption of a machine learning (ML) tool to enhance title and abstract screening. By retrospectively assessing its performance against the manual screening of a recent scoping review, we aimed to evaluate its reliability and potential for streamlining future reviews.</p><p><strong>Methods: </strong>ASReview was utilised in 'Simulation Mode' to evaluate the percentage of relevant records found (RRF) during title/abstract screening. A dataset of 10,246 unique records from three databases was considered, with 135 relevant records labelled. Performance was assessed across three scenarios with varying levels of prior knowledge (PK) (i.e., 5, 10, or 15 records), using both sampling and heuristic stopping criteria, with 100 simulations conducted for each scenario.</p><p><strong>Results: </strong>The ML tool demonstrated strong performance in facilitating the screening process. Using the sampling criterion, median RRF values stabilised at 97% with 25% of the sample screened, saving reviewers approximately 32 working days. The heuristic criterion showed similar median values, but greater variability due to premature conclusions upon reaching the threshold. While higher PK levels improved early-stage performance, the ML tool's accuracy stabilised as screening progressed, even with minimal PK.</p><p><strong>Conclusions: </strong>This study highlights the potential of ML tools to enhance the efficiency of title and abstract screening in health economics and policy literature reviews. To fully realise this potential, it is essential for regulatory bodies to establish comprehensive guidelines that ensure ML-assisted reviews uphold rigorous evidence quality standards, thereby enhancing their integrity and reliability.</p>","PeriodicalId":8065,"journal":{"name":"Applied Health Economics and Health Policy","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Health Economics and Health Policy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40258-025-00963-y","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: The growth of scientific literature in health economics and policy represents a challenge for researchers conducting literature reviews. This study explores the adoption of a machine learning (ML) tool to enhance title and abstract screening. By retrospectively assessing its performance against the manual screening of a recent scoping review, we aimed to evaluate its reliability and potential for streamlining future reviews.
Methods: ASReview was utilised in 'Simulation Mode' to evaluate the percentage of relevant records found (RRF) during title/abstract screening. A dataset of 10,246 unique records from three databases was considered, with 135 relevant records labelled. Performance was assessed across three scenarios with varying levels of prior knowledge (PK) (i.e., 5, 10, or 15 records), using both sampling and heuristic stopping criteria, with 100 simulations conducted for each scenario.
Results: The ML tool demonstrated strong performance in facilitating the screening process. Using the sampling criterion, median RRF values stabilised at 97% with 25% of the sample screened, saving reviewers approximately 32 working days. The heuristic criterion showed similar median values, but greater variability due to premature conclusions upon reaching the threshold. While higher PK levels improved early-stage performance, the ML tool's accuracy stabilised as screening progressed, even with minimal PK.
Conclusions: This study highlights the potential of ML tools to enhance the efficiency of title and abstract screening in health economics and policy literature reviews. To fully realise this potential, it is essential for regulatory bodies to establish comprehensive guidelines that ensure ML-assisted reviews uphold rigorous evidence quality standards, thereby enhancing their integrity and reliability.
期刊介绍:
Applied Health Economics and Health Policy provides timely publication of cutting-edge research and expert opinion from this increasingly important field, making it a vital resource for payers, providers and researchers alike. The journal includes high quality economic research and reviews of all aspects of healthcare from various perspectives and countries, designed to communicate the latest applied information in health economics and health policy.
While emphasis is placed on information with practical applications, a strong basis of underlying scientific rigor is maintained.