Development of machine learning models for chronic fatigue prediction in granulomatosis with polyangiitis.

IF 2.1 4区 医学 Q3 RHEUMATOLOGY
Alexandre Moura Dos Santos, Samuel Katsuyuki Shinjo
{"title":"Development of machine learning models for chronic fatigue prediction in granulomatosis with polyangiitis.","authors":"Alexandre Moura Dos Santos, Samuel Katsuyuki Shinjo","doi":"10.1186/s42358-025-00482-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Chronic fatigue severely compromises the quality of life in patients with granulomatosis with polyangiitis (GPA). Traditional diagnostic methods are often time-consuming, relying on clinical expertise and detailed questionnaires. This study aimed to develop a machine learning model capable of predicting chronic fatigue in GPA patients based on clinical data, with a particular focus on improving diagnostic capacity in regions with limited access to specialists.</p><p><strong>Methods: </strong>This cross-sectional study collected data on fatigue (measured by the Modified Fatigue Impact Scale, MFIS), functional ability (Health Assessment Questionnaire, HAQ), disease activity (Birmingham Vasculitis Activity Score, BVAS), comorbidities, medication use, physical activity (International Physical Activity Questionnaire - Short Form, IPAQ-SF), and demographic characteristics. Four machine learning algorithms-logistic regression, decision tree, random forest, and extreme gradient boosting-were assessed using a 70/30 train-test split. Model performance was evaluated using area under the curve (AUC), accuracy, F1 score, recall, and precision. Statistical comparisons were performed using Welch's t-test and the Wilcoxon-Mann-Whitney U test for continuous variables, while the chi-square test or Fisher's exact test was applied to categorical variables, with significance set at P < 0.05. All analyses were conducted using R version 4.4.1 for Windows.</p><p><strong>Results: </strong>Forty-five patients were assessed: 62.2% were female, with a median BMI of 27.72 kg/m² (23.2-30.1), a median age of 55.5 years, and a median disease duration of 12.0 years (6.0-17.0). Fatigue was reported by 20 patients (MFIS score ≥ 38), and seven patients (15.5%) had active disease according to the BVAS, which was similar between the fatigued and no fatigued groups (P > 0.05). The fatigued group had more acute-phase reactants and prednisone use (P < 0.05). The tree-based models achieved an AUC of approximately 0.80, outperforming the other models.</p><p><strong>Conclusion: </strong>Tree-based models demonstrated superior predictive performance in identifying chronic fatigue. The Random Forest model, in particular, highlighted higher disability in activities of daily living (HAQ), older age, and longer disease duration as key predictors. Although the models performed well, additional data and incorporation of clinically relevant variables may further enhance predictive accuracy. Patients with GPA who experienced fatigue showed higher glucocorticoid use and elevated acute-phase reactants, despite similar levels of disease activity, suggesting mechanisms beyond inflammation. Machine learning shows strong potential as a clinical tool for fatigue identification, especially in settings with limited access to specialist care.</p><p><strong>Trial registration: </strong>Universal Trial Number (UTN): U1111-1271-6003; Brazilian Clinical Trials Registry (ReBEC): RBR-9n4z2hh. Registration date: January 18, 2022, and Plataforma Brasil (CAAE # 41762820.1.0000.0068).</p>","PeriodicalId":48634,"journal":{"name":"Advances in Rheumatology","volume":"65 1","pages":"45"},"PeriodicalIF":2.1000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Rheumatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s42358-025-00482-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Chronic fatigue severely compromises the quality of life in patients with granulomatosis with polyangiitis (GPA). Traditional diagnostic methods are often time-consuming, relying on clinical expertise and detailed questionnaires. This study aimed to develop a machine learning model capable of predicting chronic fatigue in GPA patients based on clinical data, with a particular focus on improving diagnostic capacity in regions with limited access to specialists.

Methods: This cross-sectional study collected data on fatigue (measured by the Modified Fatigue Impact Scale, MFIS), functional ability (Health Assessment Questionnaire, HAQ), disease activity (Birmingham Vasculitis Activity Score, BVAS), comorbidities, medication use, physical activity (International Physical Activity Questionnaire - Short Form, IPAQ-SF), and demographic characteristics. Four machine learning algorithms-logistic regression, decision tree, random forest, and extreme gradient boosting-were assessed using a 70/30 train-test split. Model performance was evaluated using area under the curve (AUC), accuracy, F1 score, recall, and precision. Statistical comparisons were performed using Welch's t-test and the Wilcoxon-Mann-Whitney U test for continuous variables, while the chi-square test or Fisher's exact test was applied to categorical variables, with significance set at P < 0.05. All analyses were conducted using R version 4.4.1 for Windows.

Results: Forty-five patients were assessed: 62.2% were female, with a median BMI of 27.72 kg/m² (23.2-30.1), a median age of 55.5 years, and a median disease duration of 12.0 years (6.0-17.0). Fatigue was reported by 20 patients (MFIS score ≥ 38), and seven patients (15.5%) had active disease according to the BVAS, which was similar between the fatigued and no fatigued groups (P > 0.05). The fatigued group had more acute-phase reactants and prednisone use (P < 0.05). The tree-based models achieved an AUC of approximately 0.80, outperforming the other models.

Conclusion: Tree-based models demonstrated superior predictive performance in identifying chronic fatigue. The Random Forest model, in particular, highlighted higher disability in activities of daily living (HAQ), older age, and longer disease duration as key predictors. Although the models performed well, additional data and incorporation of clinically relevant variables may further enhance predictive accuracy. Patients with GPA who experienced fatigue showed higher glucocorticoid use and elevated acute-phase reactants, despite similar levels of disease activity, suggesting mechanisms beyond inflammation. Machine learning shows strong potential as a clinical tool for fatigue identification, especially in settings with limited access to specialist care.

Trial registration: Universal Trial Number (UTN): U1111-1271-6003; Brazilian Clinical Trials Registry (ReBEC): RBR-9n4z2hh. Registration date: January 18, 2022, and Plataforma Brasil (CAAE # 41762820.1.0000.0068).

多血管炎肉芽肿慢性疲劳预测机器学习模型的发展。
背景:慢性疲劳严重影响肉芽肿合并多血管炎(GPA)患者的生活质量。传统的诊断方法往往耗时,依赖于临床专业知识和详细的问卷调查。本研究旨在开发一种机器学习模型,能够根据临床数据预测GPA患者的慢性疲劳,特别侧重于提高专家有限地区的诊断能力。方法:本横断面研究收集了疲劳(采用修正疲劳影响量表,MFIS测量)、功能能力(健康评估问卷,HAQ)、疾病活动性(伯明翰血管炎活动评分,BVAS)、合并症、药物使用、体力活动(国际体力活动问卷-短表,IPAQ-SF)和人口统计学特征的数据。四种机器学习算法——逻辑回归、决策树、随机森林和极端梯度提升——使用70/30训练测试分割进行评估。使用曲线下面积(AUC)、准确率、F1评分、召回率和准确率来评估模型的性能。对连续变量采用Welch's t检验和wilcox - mann - whitney U检验进行统计学比较,对分类变量采用卡方检验或Fisher精确检验,显著性设为P。结果:共评估45例患者:62.2%为女性,中位BMI为27.72 kg/m²(23.2-30.1),中位年龄为55.5岁,中位病程为12.0年(6.0-17.0)。20例患者报告疲劳(MFIS评分≥38),7例患者(15.5%)根据BVAS有活动性疾病,疲劳组和非疲劳组之间差异无统计学意义(P < 0.05)。疲劳组有更多的急性期反应物和强的松使用(P结论:基于树的模型在识别慢性疲劳方面表现出更好的预测性能。随机森林模型特别强调了日常生活活动(HAQ)中较高的残疾程度、年龄和较长的疾病持续时间是关键的预测因素。虽然模型表现良好,但额外的数据和临床相关变量的合并可能会进一步提高预测的准确性。经历疲劳的GPA患者表现出更高的糖皮质激素使用和急性期反应物升高,尽管疾病活动水平相似,这表明炎症以外的机制。机器学习作为疲劳识别的临床工具显示出强大的潜力,特别是在专业护理有限的环境中。试验注册:通用试验号(UTN): U1111-1271-6003;巴西临床试验注册中心(ReBEC): RBR-9n4z2hh。注册日期:2022年1月18日和平台巴西(CAAE # 41762820.1.0000.0068)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Rheumatology
Advances in Rheumatology Medicine-Rheumatology
CiteScore
4.00
自引率
4.30%
发文量
41
审稿时长
53 weeks
期刊介绍: Formerly named Revista Brasileira de Reumatologia, the journal is celebrating its 60th year of publication. Advances in Rheumatology is an international, open access journal publishing pre-clinical, translational and clinical studies on all aspects of paediatric and adult rheumatic diseases, including degenerative, inflammatory and autoimmune conditions. The journal is the official publication of the Brazilian Society of Rheumatology and welcomes original research (including systematic reviews and meta-analyses), literature reviews, guidelines and letters arising from published material.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信