Age Estimation From Blood Test Results Using a Random Forest Model.

IF 2.6 4区 医学 Q2 MEDICAL LABORATORY TECHNOLOGY
Satomi Kodera, Osamu Yokoi, Masaki Kaneko, Yuka Sato, Susumu Ito, Katsuhiko Hata
{"title":"Age Estimation From Blood Test Results Using a Random Forest Model.","authors":"Satomi Kodera, Osamu Yokoi, Masaki Kaneko, Yuka Sato, Susumu Ito, Katsuhiko Hata","doi":"10.1002/jcla.70064","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objectives: </strong>From a preventive medicine perspective, this study aims to clarify the role of screening data in aging and health problems by estimating age from screening data and verifying the number of data items required in widely used screening tests.</p><p><strong>Materials and methods: </strong>A random forest model was applied to 11554 men and women (3043 and 8511, respectively) aged 0-95 years who underwent screening tests (60 blood tests, 8 urine tests and 2 saliva tests) between February 2020 and August 2023. All analyses were conducted in Python 3.10.12.</p><p><strong>Results: </strong>Using all 71 items including gender, a high accuracy of R<sup>2</sup> = 0.7010 was achieved with 9243 training datasets (80% of total). R<sup>2</sup> decreased slightly to 0.6937 when data items were reduced to 15 by removing less important variables. When datasets numbered fewer than 800 or data items fewer than 7, R<sup>2</sup> fell below 0.6. Notably, postmenopausal women tended to have higher estimated ages compared to premenopausal women.</p><p><strong>Conclusions: </strong>Age estimation from blood data using the random forest model (blood age) is sufficiently precise for assessing physical aging state. Blood age, as well as other biological ages estimated from various omics estimators, was shown to be a very promising method for exploring the problems of aging such as metabolic syndrome and frail syndrome.</p>","PeriodicalId":15509,"journal":{"name":"Journal of Clinical Laboratory Analysis","volume":" ","pages":"e70064"},"PeriodicalIF":2.6000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Laboratory Analysis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/jcla.70064","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background and objectives: From a preventive medicine perspective, this study aims to clarify the role of screening data in aging and health problems by estimating age from screening data and verifying the number of data items required in widely used screening tests.

Materials and methods: A random forest model was applied to 11554 men and women (3043 and 8511, respectively) aged 0-95 years who underwent screening tests (60 blood tests, 8 urine tests and 2 saliva tests) between February 2020 and August 2023. All analyses were conducted in Python 3.10.12.

Results: Using all 71 items including gender, a high accuracy of R2 = 0.7010 was achieved with 9243 training datasets (80% of total). R2 decreased slightly to 0.6937 when data items were reduced to 15 by removing less important variables. When datasets numbered fewer than 800 or data items fewer than 7, R2 fell below 0.6. Notably, postmenopausal women tended to have higher estimated ages compared to premenopausal women.

Conclusions: Age estimation from blood data using the random forest model (blood age) is sufficiently precise for assessing physical aging state. Blood age, as well as other biological ages estimated from various omics estimators, was shown to be a very promising method for exploring the problems of aging such as metabolic syndrome and frail syndrome.

使用随机森林模型从血液测试结果估计年龄。
背景与目的:本研究旨在从预防医学的角度出发,通过对筛查数据进行年龄估算,并验证广泛使用的筛查试验所需数据项的数量,阐明筛查数据在老龄化与健康问题中的作用。材料与方法:随机森林模型应用于11554名年龄在0-95岁之间的男性和女性(分别为3043名和8511名),他们在2020年2月至2023年8月期间接受了筛查测试(60项血液测试、8项尿液测试和2项唾液测试)。所有分析均在Python 3.10.12中进行。结果:使用包括性别在内的全部71个条目,9243个训练数据集(占总数的80%)获得了R2 = 0.7010的高准确率。当通过删除不太重要的变量将数据项减少到15时,R2略微下降到0.6937。当数据集少于800或数据项少于7时,R2小于0.6。值得注意的是,与绝经前妇女相比,绝经后妇女的估计年龄往往更高。结论:使用随机森林模型(血液年龄)从血液数据中估计年龄,对于评估身体衰老状态是足够精确的。血液年龄,以及从各种组学估计器估计的其他生物年龄,被证明是一种非常有前途的方法来探索衰老问题,如代谢综合征和虚弱综合征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Clinical Laboratory Analysis
Journal of Clinical Laboratory Analysis 医学-医学实验技术
CiteScore
5.60
自引率
7.40%
发文量
584
审稿时长
6-12 weeks
期刊介绍: Journal of Clinical Laboratory Analysis publishes original articles on newly developing modes of technology and laboratory assays, with emphasis on their application in current and future clinical laboratory testing. This includes reports from the following fields: immunochemistry and toxicology, hematology and hematopathology, immunopathology, molecular diagnostics, microbiology, genetic testing, immunohematology, and clinical chemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信