Evaluation of lung cancer risk prediction models to select lung cancer screening participants in China: a real-world analysis in regional healthcare big data, Yinzhou, China

IF 7.6 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES
Ziqing Ye , Yongyue Wei
{"title":"Evaluation of lung cancer risk prediction models to select lung cancer screening participants in China: a real-world analysis in regional healthcare big data, Yinzhou, China","authors":"Ziqing Ye ,&nbsp;Yongyue Wei","doi":"10.1016/j.lanwpc.2024.101354","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>A number of lung cancer prediction models have been developed worldwide. However, few validation studies have been conducted on Chinese populations. The objective of this study is to evaluate the feasibility and efficacy of 17 global lung cancer risk prediction models when applied to Chinese healthcare big data.</div></div><div><h3>Methods</h3><div>The study included individuals with information recorded in the Yinzhou regional health care database from January 1, 2010 to December 31, 2021. Seventeen lung cancer risk prediction models (Bach, Spitz, Hoggart, PLCO<sub>m2012</sub>, Korean Men, PLCO<sub>all2014</sub>, Pittsburgh Predictor, LLPi, LCRAT, HUNT, JPHC, Reduced HUNT, LLPv3, LCRS, OWL, UCL-I, Shanghai-LCM) were evaluated for their performance in overall population and subgroups. The discrimination of the 17 models was assessed using the Harrell's C-index and time-dependent area under the curve (AUC) as metrics. The calibration of the models was evaluated using the expected-to-observed ratio (EOR) and calibration curves. Moreover, the models were recalibrated in the Yinzhou population, and the calibration of the recalibrated models was evaluated.</div></div><div><h3>Findings</h3><div>A total of 907,200 study participants were included in the analysis, comprising 69,263 smokers and 837,937 non-smokers. Of the 17 models initially considered, only 6 (Bach, Hoggart, Pittsburgh Predictor, JPHC, Reduced HUNT, UCL-I) were available in the Yinzhou regional health care database with complete predictor data. Models that predicted risk over a ten-year period (Bach, JPHC, LCRS, and Shanghai-LCM) exhibited C-indices and AUCs of 0.75 or greater in the ever smokers. The majority of models demonstrated an overestimation of incidence risk in the ever smokers and an underestimation in the never smokers. The JPHC and LCRS models exhibited the most optimal calibration curves and the best EOR, whereas the other prediction models had suboptimal calibration. After recalibration, all models showed improved calibration; meanwhile, the JPHC and LCRS models retained the highest level of calibration.</div></div><div><h3>Interpretation</h3><div>Only six models can be directly applied to the Yinzhou regional health care database. The JPHC model developed for the Japanese population and the LCRS model developed based on the China Kadoorie Biobank (CKB) performed better in the Chinese population than other models.</div></div><div><h3>Funding</h3><div>This work was supported by the <span>National Natural Science Foundation of China</span> (82473728 to Y.W.) and <span>Medical and Health Science and Technology Project of Zhejiang Province</span>, China.</div></div>","PeriodicalId":22792,"journal":{"name":"The Lancet Regional Health: Western Pacific","volume":"55 ","pages":"Article 101354"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Lancet Regional Health: Western Pacific","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666606524003481","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Background

A number of lung cancer prediction models have been developed worldwide. However, few validation studies have been conducted on Chinese populations. The objective of this study is to evaluate the feasibility and efficacy of 17 global lung cancer risk prediction models when applied to Chinese healthcare big data.

Methods

The study included individuals with information recorded in the Yinzhou regional health care database from January 1, 2010 to December 31, 2021. Seventeen lung cancer risk prediction models (Bach, Spitz, Hoggart, PLCOm2012, Korean Men, PLCOall2014, Pittsburgh Predictor, LLPi, LCRAT, HUNT, JPHC, Reduced HUNT, LLPv3, LCRS, OWL, UCL-I, Shanghai-LCM) were evaluated for their performance in overall population and subgroups. The discrimination of the 17 models was assessed using the Harrell's C-index and time-dependent area under the curve (AUC) as metrics. The calibration of the models was evaluated using the expected-to-observed ratio (EOR) and calibration curves. Moreover, the models were recalibrated in the Yinzhou population, and the calibration of the recalibrated models was evaluated.

Findings

A total of 907,200 study participants were included in the analysis, comprising 69,263 smokers and 837,937 non-smokers. Of the 17 models initially considered, only 6 (Bach, Hoggart, Pittsburgh Predictor, JPHC, Reduced HUNT, UCL-I) were available in the Yinzhou regional health care database with complete predictor data. Models that predicted risk over a ten-year period (Bach, JPHC, LCRS, and Shanghai-LCM) exhibited C-indices and AUCs of 0.75 or greater in the ever smokers. The majority of models demonstrated an overestimation of incidence risk in the ever smokers and an underestimation in the never smokers. The JPHC and LCRS models exhibited the most optimal calibration curves and the best EOR, whereas the other prediction models had suboptimal calibration. After recalibration, all models showed improved calibration; meanwhile, the JPHC and LCRS models retained the highest level of calibration.

Interpretation

Only six models can be directly applied to the Yinzhou regional health care database. The JPHC model developed for the Japanese population and the LCRS model developed based on the China Kadoorie Biobank (CKB) performed better in the Chinese population than other models.

Funding

This work was supported by the National Natural Science Foundation of China (82473728 to Y.W.) and Medical and Health Science and Technology Project of Zhejiang Province, China.
求助全文
约1分钟内获得全文 求助全文
来源期刊
The Lancet Regional Health: Western Pacific
The Lancet Regional Health: Western Pacific Medicine-Pediatrics, Perinatology and Child Health
CiteScore
8.80
自引率
2.80%
发文量
305
审稿时长
11 weeks
期刊介绍: The Lancet Regional Health – Western Pacific, a gold open access journal, is an integral part of The Lancet's global initiative advocating for healthcare quality and access worldwide. It aims to advance clinical practice and health policy in the Western Pacific region, contributing to enhanced health outcomes. The journal publishes high-quality original research shedding light on clinical practice and health policy in the region. It also includes reviews, commentaries, and opinion pieces covering diverse regional health topics, such as infectious diseases, non-communicable diseases, child and adolescent health, maternal and reproductive health, aging health, mental health, the health workforce and systems, and health policy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信