开发机器学习模型来预测日本健康检查参与的概率。

IF 3.2 3区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Public Health Pub Date : 2025-10-01 Epub Date: 2025-08-07 DOI:10.1016/j.puhe.2025.105889
Asuka Oyama, Midori Noguchi
{"title":"开发机器学习模型来预测日本健康检查参与的概率。","authors":"Asuka Oyama, Midori Noguchi","doi":"10.1016/j.puhe.2025.105889","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Enhancing health checkup participation is crucial for early detection and treatment of noncommunicable diseases and for improving public health. Effectively increasing health checkup rates requires identifying and encouraging individuals likely to adopt health-oriented behaviours. We aimed to develop a machine learning model to predict the participation probability in a specific health checkup in the following year.</p><p><strong>Study design: </strong>Retrospective cohort study.</p><p><strong>Methods: </strong>We analysed data from 58,863 National Health Insurance-insured individuals in Kochi Prefecture, Japan, who underwent specific health checkups during the fiscal years (FYs) 2013-2017. The dataset includes physical measurements, blood pressure measurements, blood and urine tests, and self-reported questionnaires. Predictive models for FY2018 participation were developed using LightGBM and evaluated using the area under the receiver operating characteristic curve (AUC) and reliability curves. SHAP was used to assess the feature's importance. External validation for FY2019 and FY2020 assessed temporal robustness.</p><p><strong>Results: </strong>Predictive accuracy for FY2018 was high, with AUCs of 0.824 (95 % confidence interval [95 % CI]: 0.813-0.835) for men and 0.820 (95 % CI: 0.810-0.830) for women. External validation of FY2019 showed AUCs of 0.821 and 0.807 for men and women, respectively. In FY2020, prediction accuracy declined, with AUCs of 0.798 and 0.794 for men and women, respectively. Key predictive features included years since the last checkup, past checkup frequency, age, systolic blood pressure, and lifestyle factors.</p><p><strong>Conclusions: </strong>By developing an accurate model to predict future health checkup participation, we identified a novel indicator that enables efficient, optimized recommendations and may help improve participation rates.</p>","PeriodicalId":49651,"journal":{"name":"Public Health","volume":"247 ","pages":"105889"},"PeriodicalIF":3.2000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of a machine learning model to predict the probability of health checkup participation in Japan.\",\"authors\":\"Asuka Oyama, Midori Noguchi\",\"doi\":\"10.1016/j.puhe.2025.105889\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Enhancing health checkup participation is crucial for early detection and treatment of noncommunicable diseases and for improving public health. Effectively increasing health checkup rates requires identifying and encouraging individuals likely to adopt health-oriented behaviours. We aimed to develop a machine learning model to predict the participation probability in a specific health checkup in the following year.</p><p><strong>Study design: </strong>Retrospective cohort study.</p><p><strong>Methods: </strong>We analysed data from 58,863 National Health Insurance-insured individuals in Kochi Prefecture, Japan, who underwent specific health checkups during the fiscal years (FYs) 2013-2017. The dataset includes physical measurements, blood pressure measurements, blood and urine tests, and self-reported questionnaires. Predictive models for FY2018 participation were developed using LightGBM and evaluated using the area under the receiver operating characteristic curve (AUC) and reliability curves. SHAP was used to assess the feature's importance. External validation for FY2019 and FY2020 assessed temporal robustness.</p><p><strong>Results: </strong>Predictive accuracy for FY2018 was high, with AUCs of 0.824 (95 % confidence interval [95 % CI]: 0.813-0.835) for men and 0.820 (95 % CI: 0.810-0.830) for women. External validation of FY2019 showed AUCs of 0.821 and 0.807 for men and women, respectively. In FY2020, prediction accuracy declined, with AUCs of 0.798 and 0.794 for men and women, respectively. Key predictive features included years since the last checkup, past checkup frequency, age, systolic blood pressure, and lifestyle factors.</p><p><strong>Conclusions: </strong>By developing an accurate model to predict future health checkup participation, we identified a novel indicator that enables efficient, optimized recommendations and may help improve participation rates.</p>\",\"PeriodicalId\":49651,\"journal\":{\"name\":\"Public Health\",\"volume\":\"247 \",\"pages\":\"105889\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Public Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.puhe.2025.105889\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/7 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Public Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.puhe.2025.105889","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

摘要

目标:加强健康检查的参与对于非传染性疾病的早期发现和治疗以及改善公众健康至关重要。要有效提高健康检查率,就必须查明并鼓励可能采取注重健康行为的个人。我们的目标是开发一个机器学习模型来预测第二年参加特定健康检查的概率。研究设计:回顾性队列研究。方法:我们分析了日本高知县58,863名国民健康保险参保人员的数据,这些人在2013-2017财政年度(FYs)期间接受了特定的健康检查。该数据集包括身体测量、血压测量、血液和尿液测试以及自我报告的问卷。使用LightGBM开发了2018财年参与的预测模型,并使用接受者工作特征曲线(AUC)和可靠性曲线下的面积进行了评估。使用SHAP来评估特征的重要性。2019财年和2020财年的外部验证评估了时间稳健性。结果:2018财年的预测准确性很高,男性的auc为0.824(95%可信区间[95% CI]: 0.813-0.835),女性为0.820 (95% CI: 0.810-0.830)。2019财年的外部验证显示,男性和女性的auc分别为0.821和0.807。在2020财年,预测准确性下降,男性和女性的auc分别为0.798和0.794。关键的预测特征包括自上次检查以来的年份、过去的检查频率、年龄、收缩压和生活方式因素。结论:通过建立一个准确的模型来预测未来的健康检查参与率,我们确定了一个新的指标,可以实现高效、优化的建议,并可能有助于提高参与率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development of a machine learning model to predict the probability of health checkup participation in Japan.

Objectives: Enhancing health checkup participation is crucial for early detection and treatment of noncommunicable diseases and for improving public health. Effectively increasing health checkup rates requires identifying and encouraging individuals likely to adopt health-oriented behaviours. We aimed to develop a machine learning model to predict the participation probability in a specific health checkup in the following year.

Study design: Retrospective cohort study.

Methods: We analysed data from 58,863 National Health Insurance-insured individuals in Kochi Prefecture, Japan, who underwent specific health checkups during the fiscal years (FYs) 2013-2017. The dataset includes physical measurements, blood pressure measurements, blood and urine tests, and self-reported questionnaires. Predictive models for FY2018 participation were developed using LightGBM and evaluated using the area under the receiver operating characteristic curve (AUC) and reliability curves. SHAP was used to assess the feature's importance. External validation for FY2019 and FY2020 assessed temporal robustness.

Results: Predictive accuracy for FY2018 was high, with AUCs of 0.824 (95 % confidence interval [95 % CI]: 0.813-0.835) for men and 0.820 (95 % CI: 0.810-0.830) for women. External validation of FY2019 showed AUCs of 0.821 and 0.807 for men and women, respectively. In FY2020, prediction accuracy declined, with AUCs of 0.798 and 0.794 for men and women, respectively. Key predictive features included years since the last checkup, past checkup frequency, age, systolic blood pressure, and lifestyle factors.

Conclusions: By developing an accurate model to predict future health checkup participation, we identified a novel indicator that enables efficient, optimized recommendations and may help improve participation rates.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Public Health
Public Health 医学-公共卫生、环境卫生与职业卫生
CiteScore
7.60
自引率
0.00%
发文量
280
审稿时长
37 days
期刊介绍: Public Health is an international, multidisciplinary peer-reviewed journal. It publishes original papers, reviews and short reports on all aspects of the science, philosophy, and practice of public health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信