Training machine learning-based spirometry reference equations: a comparison with GAMLSS and GLI reference equations.

IF 1.4 Q3 RESPIRATORY SYSTEM

European Clinical Respiratory Journal Pub Date : 2025-10-03 eCollection Date: 2025-01-01 DOI:10.1080/20018525.2025.2565853

Walid Al-Qerem, Anan Jarab, Judith Eberhardt

{"title":"Training machine learning-based spirometry reference equations: a comparison with GAMLSS and GLI reference equations.","authors":"Walid Al-Qerem, Anan Jarab, Judith Eberhardt","doi":"10.1080/20018525.2025.2565853","DOIUrl":null,"url":null,"abstract":"Introduction: Interpretation of spirometry data depends on the availability of reference equations that reflect the physiological norms of the assessed population. Although GAMLSS models provide clinically acceptable models, they may lack simplicity and ease of application. This study evaluated the efficiency of machine learning (ML)-based spirometry reference equations as an alternative for Jordanian adults.Method: In this cross-sectional study, ML models were trained using age and height to predict FEV₁, FVC and FEV₁/FVC. Model development was based on the same datasets previously used to construct GAMLSS-based Jordanian equations, which included 1,948 participants (54.2% females). External validation was performed on a newly recruited sample of healthy, non-smoking adults (n = 487, 46.6% females).Results: ML predicted and lower limits of normal (LLNs) values were compared with those from the Jordanian GAMLSS, GLI equations, using z-score distributions, residual plots, and clinical diagnostic agreement. For both sexes, ML models consistently produced comparable mean squared errors (MSE) to the Jordanian GAMLSS equations and lower MSE values and z-scores closer to zero when compared with global reference equations. Agreement analyses revealed that the ML and Jordanian models more reliably classified individuals within ± 0.5 and ± 1.0 z-score thresholds, emphasizing their superior calibration. ML and Jordanian models were the only ones to classify all the healthy study sample as normal spirometry.Conclusion: ML-derived spirometry equations demonstrated strong alignment with the observed data and outperformed global standards in representing Jordanian adults. These findings support the use of reference equations customized for specific regions in respiratory diagnostics.","PeriodicalId":11872,"journal":{"name":"European Clinical Respiratory Journal","volume":"12 1","pages":"2565853"},"PeriodicalIF":1.4000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12498361/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Clinical Respiratory Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/20018525.2025.2565853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Interpretation of spirometry data depends on the availability of reference equations that reflect the physiological norms of the assessed population. Although GAMLSS models provide clinically acceptable models, they may lack simplicity and ease of application. This study evaluated the efficiency of machine learning (ML)-based spirometry reference equations as an alternative for Jordanian adults.

Method: In this cross-sectional study, ML models were trained using age and height to predict FEV₁, FVC and FEV₁/FVC. Model development was based on the same datasets previously used to construct GAMLSS-based Jordanian equations, which included 1,948 participants (54.2% females). External validation was performed on a newly recruited sample of healthy, non-smoking adults (n = 487, 46.6% females).

Results: ML predicted and lower limits of normal (LLNs) values were compared with those from the Jordanian GAMLSS, GLI equations, using z-score distributions, residual plots, and clinical diagnostic agreement. For both sexes, ML models consistently produced comparable mean squared errors (MSE) to the Jordanian GAMLSS equations and lower MSE values and z-scores closer to zero when compared with global reference equations. Agreement analyses revealed that the ML and Jordanian models more reliably classified individuals within ± 0.5 and ± 1.0 z-score thresholds, emphasizing their superior calibration. ML and Jordanian models were the only ones to classify all the healthy study sample as normal spirometry.

Conclusion: ML-derived spirometry equations demonstrated strong alignment with the observed data and outperformed global standards in representing Jordanian adults. These findings support the use of reference equations customized for specific regions in respiratory diagnostics.

查看原文本刊更多论文

训练基于机器学习的肺活量测定参考方程：与GAMLSS和GLI参考方程的比较。

简介：肺活量测定数据的解释取决于反映被评估人群生理规范的参考方程的可用性。虽然GAMLSS模型提供了临床可接受的模型，但它们可能缺乏简单性和易用性。本研究评估了基于机器学习（ML）的肺活量测定参考方程作为约旦成年人的替代方案的效率。方法：在横断面研究中，使用年龄和身高训练ML模型来预测FEV₁、FVC和FEV₁/FVC。模型开发基于先前用于构建基于gamlss的约旦方程的相同数据集，其中包括1948名参与者（54.2%为女性）。外部验证在新招募的健康非吸烟成年人样本中进行（n = 487，女性46.6%）。结果：使用z-score分布、残差图和临床诊断一致性，将ML预测值和正常（lln）值的下限与约旦GAMLSS、GLI方程的值进行比较。对于两性，ML模型始终产生与约旦GAMLSS方程相当的均方误差（MSE），并且与全球参考方程相比，MSE值更低，z分数更接近于零。一致性分析显示，ML和Jordanian模型在±0.5和±1.0 z-score阈值范围内更可靠地对个体进行分类，强调了它们的校准优势。ML和Jordanian模型是唯一将所有健康研究样本归类为正常肺活量的模型。结论：ml导出的肺活量测定方程与观察到的数据有很强的一致性，并且在代表约旦成年人方面优于全球标准。这些发现支持在呼吸诊断中使用针对特定区域定制的参考方程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊