[Contribution of the large-scale population cohort in disease risk prediction model study: taking United Kingdom Biobank as an example].

Q1 Medicine
C X Zhu, Y X Song, Y T Hao, F Chen, Y Y Wei
{"title":"[Contribution of the large-scale population cohort in disease risk prediction model study: taking United Kingdom Biobank as an example].","authors":"C X Zhu, Y X Song, Y T Hao, F Chen, Y Y Wei","doi":"10.3760/cma.j.cn112338-20240507-00245","DOIUrl":null,"url":null,"abstract":"<p><p>The disease risk prediction model is the basis of precision prevention and an essential reference for clinical treatment decisions. The development of risk prediction models requires the support of a large amount of high-quality data. A large population cohort study is an important basis for this study. The United Kingdom Biobank (UKB), as a mega-population cohort and biobank, has played an essential role in the exploration of disease etiology and research related to disease prevention and control, with its rich baseline and follow-up data and concepts and mechanisms shared globally. This study followed PRISMA guidelines and included 210 articles with corresponding authors from 18 countries, of which 58 (27.62%) were from the UKB. A total of 491 disease risk prediction models were extracted for cancer, cardiovascular and cerebrovascular diseases, endocrine and metabolic diseases, respiratory diseases, and other diseases and their subgroups, of which 132 were developed by UKB without validation, 183 were developed by UKB with internal validation, 17 were developed by UKB with external validation, and 159 were developed by external development with UKB validation. A total of 188 models used only macro variables (38.29%), and 303 models combined macro and micro variables (61.71%). Model construction methods included survival outcome models, logistic regression, and machine learning. Survival outcome models were dominated by Cox proportional risk regression models and a few models considering competitive risk, accelerated failure models, or different baseline risk functions. Machine learning models included random forest, XGBoost, CatBoost, support vector machine, convolutional neural network, and other methods. The UKB is an essential resource for multiple disease risk prediction modeling studies.</p>","PeriodicalId":23968,"journal":{"name":"中华流行病学杂志","volume":"45 10","pages":"1433-1440"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华流行病学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112338-20240507-00245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

The disease risk prediction model is the basis of precision prevention and an essential reference for clinical treatment decisions. The development of risk prediction models requires the support of a large amount of high-quality data. A large population cohort study is an important basis for this study. The United Kingdom Biobank (UKB), as a mega-population cohort and biobank, has played an essential role in the exploration of disease etiology and research related to disease prevention and control, with its rich baseline and follow-up data and concepts and mechanisms shared globally. This study followed PRISMA guidelines and included 210 articles with corresponding authors from 18 countries, of which 58 (27.62%) were from the UKB. A total of 491 disease risk prediction models were extracted for cancer, cardiovascular and cerebrovascular diseases, endocrine and metabolic diseases, respiratory diseases, and other diseases and their subgroups, of which 132 were developed by UKB without validation, 183 were developed by UKB with internal validation, 17 were developed by UKB with external validation, and 159 were developed by external development with UKB validation. A total of 188 models used only macro variables (38.29%), and 303 models combined macro and micro variables (61.71%). Model construction methods included survival outcome models, logistic regression, and machine learning. Survival outcome models were dominated by Cox proportional risk regression models and a few models considering competitive risk, accelerated failure models, or different baseline risk functions. Machine learning models included random forest, XGBoost, CatBoost, support vector machine, convolutional neural network, and other methods. The UKB is an essential resource for multiple disease risk prediction modeling studies.

[大规模人群队列在疾病风险预测模型研究中的贡献:以英国生物库为例]。
疾病风险预测模型是精准预防的基础,也是临床治疗决策的重要参考。风险预测模型的建立需要大量高质量数据的支持。大型人群队列研究是这一研究的重要基础。英国生物库(UKB)作为一个超大规模的人群队列和生物库,以其丰富的基线和随访数据以及全球共享的理念和机制,在疾病病因学探索和疾病防控相关研究中发挥了至关重要的作用。本研究遵循 PRISMA 指南,收录了来自 18 个国家、有通讯作者的 210 篇文章,其中 58 篇(27.62%)来自英国生物库。共提取了 491 个疾病风险预测模型,涉及癌症、心脑血管疾病、内分泌和代谢疾病、呼吸系统疾病、其他疾病及其亚组,其中 132 个模型由英国生物统计局开发,未经验证;183 个模型由英国生物统计局开发,经内部验证;17 个模型由英国生物统计局开发,经外部验证;159 个模型由外部开发,经英国生物统计局验证。共有 188 个模型只使用了宏观变量(占 38.29%),303 个模型结合了宏观和微观变量(占 61.71%)。模型构建方法包括生存结果模型、逻辑回归和机器学习。生存结果模型以 Cox 比例风险回归模型为主,少数模型考虑了竞争风险、加速失败模型或不同的基线风险函数。机器学习模型包括随机森林、XGBoost、CatBoost、支持向量机、卷积神经网络和其他方法。UKB 是多种疾病风险预测建模研究的重要资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
中华流行病学杂志
中华流行病学杂志 Medicine-Medicine (all)
CiteScore
5.60
自引率
0.00%
发文量
8981
期刊介绍: Chinese Journal of Epidemiology, established in 1981, is an advanced academic periodical in epidemiology and related disciplines in China, which, according to the principle of integrating theory with practice, mainly reports the major progress in epidemiological research. The columns of the journal include commentary, expert forum, original article, field investigation, disease surveillance, laboratory research, clinical epidemiology, basic theory or method and review, etc.  The journal is included by more than ten major biomedical databases and index systems worldwide, such as been indexed in Scopus, PubMed/MEDLINE, PubMed Central (PMC), Europe PubMed Central, Embase, Chemical Abstract, Chinese Science and Technology Paper and Citation Database (CSTPCD), Chinese core journal essentials overview, Chinese Science Citation Database (CSCD) core database, Chinese Biological Medical Disc (CBMdisc), and Chinese Medical Citation Index (CMCI), etc. It is one of the core academic journals and carefully selected core journals in preventive and basic medicine in China.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信