使用回归树模型自动划分性别和年龄以估算参考区间

IF 1.1 4区 医学 Q4 MEDICAL LABORATORY TECHNOLOGY
Sandra Klawitter, Johannes Böhm, Alexander Tolios, Julian E. Gebauer
{"title":"使用回归树模型自动划分性别和年龄以估算参考区间","authors":"Sandra Klawitter, Johannes Böhm, Alexander Tolios, Julian E. Gebauer","doi":"10.1515/labmed-2024-0083","DOIUrl":null,"url":null,"abstract":"Objectives Reference intervals (RI) play a decisive role in the interpretation of medical laboratory results. An important step in the determination of RI is age- and sex specific partitioning, which is usually based on an empirical approach by graphical representation. In this study, we evaluate an automated machine learning approach. Methods This study uses pediatric data from the CALIPER RI (Canadian laboratory initiative on pediatric reference intervals) study. The calculation of potential partitions is carried out using a regression tree model included in the <jats:monospace>rpart</jats:monospace> package of the statistical programming language R. The Harris &amp; Boyd method is used to compare the corresponding partitions suggested by <jats:monospace>rpart</jats:monospace> and CALIPER. For better comparability, the reference ranges of the partitions of both approaches are then calculated using <jats:monospace>reflimR</jats:monospace>. Results Most of the partitions suggested by <jats:monospace>rpart</jats:monospace> or CALIPER show sufficient heterogeneity among themselves to justify age- and/or sex-specific RI partitioning. With only few individual exceptions, both methods yield comparable results. The partitions of both approaches for albumin and <jats:italic>γ</jats:italic>-glutamyltransferase are very similar to each other. For creatinine <jats:monospace>rpart</jats:monospace> suggests a slightly earlier distinction between the sexes. Alkaline phosphatase shows the most pronounced differences. In addition to a considerable earlier sex split, <jats:monospace>rpart</jats:monospace> suggests different age intervals for both sexes, resulting in three partitions for females and four partitions for males. Conclusions Our findings indicate that the automated analysis provided by <jats:monospace>rpart</jats:monospace> yields results that comparable to traditional methods. Nevertheless, the medical plausibility of the automatic suggestions needs to be validated by human experts.","PeriodicalId":55986,"journal":{"name":"Journal of Laboratory Medicine","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated sex and age partitioning for the estimation of reference intervals using a regression tree model\",\"authors\":\"Sandra Klawitter, Johannes Böhm, Alexander Tolios, Julian E. Gebauer\",\"doi\":\"10.1515/labmed-2024-0083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives Reference intervals (RI) play a decisive role in the interpretation of medical laboratory results. An important step in the determination of RI is age- and sex specific partitioning, which is usually based on an empirical approach by graphical representation. In this study, we evaluate an automated machine learning approach. Methods This study uses pediatric data from the CALIPER RI (Canadian laboratory initiative on pediatric reference intervals) study. The calculation of potential partitions is carried out using a regression tree model included in the <jats:monospace>rpart</jats:monospace> package of the statistical programming language R. The Harris &amp; Boyd method is used to compare the corresponding partitions suggested by <jats:monospace>rpart</jats:monospace> and CALIPER. For better comparability, the reference ranges of the partitions of both approaches are then calculated using <jats:monospace>reflimR</jats:monospace>. Results Most of the partitions suggested by <jats:monospace>rpart</jats:monospace> or CALIPER show sufficient heterogeneity among themselves to justify age- and/or sex-specific RI partitioning. With only few individual exceptions, both methods yield comparable results. The partitions of both approaches for albumin and <jats:italic>γ</jats:italic>-glutamyltransferase are very similar to each other. For creatinine <jats:monospace>rpart</jats:monospace> suggests a slightly earlier distinction between the sexes. Alkaline phosphatase shows the most pronounced differences. In addition to a considerable earlier sex split, <jats:monospace>rpart</jats:monospace> suggests different age intervals for both sexes, resulting in three partitions for females and four partitions for males. Conclusions Our findings indicate that the automated analysis provided by <jats:monospace>rpart</jats:monospace> yields results that comparable to traditional methods. Nevertheless, the medical plausibility of the automatic suggestions needs to be validated by human experts.\",\"PeriodicalId\":55986,\"journal\":{\"name\":\"Journal of Laboratory Medicine\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Laboratory Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1515/labmed-2024-0083\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Laboratory Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1515/labmed-2024-0083","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目标 参考区间(RI)在解释医学实验室结果中起着决定性作用。确定参考区间的一个重要步骤是根据年龄和性别进行分区,这通常是通过图形表示的经验方法进行的。在本研究中,我们对一种自动机器学习方法进行了评估。方法 本研究使用的儿科数据来自 CALIPER RI(加拿大儿科参考区间实验室倡议)研究。使用统计编程语言 R 的 rpart 软件包中的回归树模型计算潜在的分区。Harris & Boyd 方法用于比较 rpart 和 CALIPER 提出的相应分区。为了更好地进行比较,还使用 reflimR 计算了两种方法的分区参考范围。结果 rpart 或 CALIPER 提出的大多数分区都显示出了足够的异质性,因此有理由对不同年龄和/或性别的 RI 进行分区。除了个别例外情况,两种方法得出的结果具有可比性。两种方法对白蛋白和 γ-谷氨酰转移酶的分区结果非常相似。就肌酐而言,rpart 表明性别差异稍早。碱性磷酸酶的差异最为明显。除了较早出现性别分化外,rpart 还显示男女的年龄间隔不同,女性分为三个部分,男性分为四个部分。结论 我们的研究结果表明,rpart 提供的自动分析结果与传统方法不相上下。不过,自动建议的医学合理性还需要人类专家的验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automated sex and age partitioning for the estimation of reference intervals using a regression tree model
Objectives Reference intervals (RI) play a decisive role in the interpretation of medical laboratory results. An important step in the determination of RI is age- and sex specific partitioning, which is usually based on an empirical approach by graphical representation. In this study, we evaluate an automated machine learning approach. Methods This study uses pediatric data from the CALIPER RI (Canadian laboratory initiative on pediatric reference intervals) study. The calculation of potential partitions is carried out using a regression tree model included in the rpart package of the statistical programming language R. The Harris & Boyd method is used to compare the corresponding partitions suggested by rpart and CALIPER. For better comparability, the reference ranges of the partitions of both approaches are then calculated using reflimR. Results Most of the partitions suggested by rpart or CALIPER show sufficient heterogeneity among themselves to justify age- and/or sex-specific RI partitioning. With only few individual exceptions, both methods yield comparable results. The partitions of both approaches for albumin and γ-glutamyltransferase are very similar to each other. For creatinine rpart suggests a slightly earlier distinction between the sexes. Alkaline phosphatase shows the most pronounced differences. In addition to a considerable earlier sex split, rpart suggests different age intervals for both sexes, resulting in three partitions for females and four partitions for males. Conclusions Our findings indicate that the automated analysis provided by rpart yields results that comparable to traditional methods. Nevertheless, the medical plausibility of the automatic suggestions needs to be validated by human experts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Laboratory Medicine
Journal of Laboratory Medicine Mathematics-Discrete Mathematics and Combinatorics
CiteScore
2.50
自引率
0.00%
发文量
39
审稿时长
10 weeks
期刊介绍: The Journal of Laboratory Medicine (JLM) is a bi-monthly published journal that reports on the latest developments in laboratory medicine. Particular focus is placed on the diagnostic aspects of the clinical laboratory, although technical, regulatory, and educational topics are equally covered. The Journal specializes in the publication of high-standard, competent and timely review articles on clinical, methodological and pathogenic aspects of modern laboratory diagnostics. These reviews are critically reviewed by expert reviewers and JLM’s Associate Editors who are specialists in the various subdisciplines of laboratory medicine. In addition, JLM publishes original research articles, case reports, point/counterpoint articles and letters to the editor, all of which are peer reviewed by at least two experts in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信