Aoyang Guo , Yanran Chen , Hongyang Liu , Shujun Gao , Xinyi Huang , Dongzhou Liu , Qianqian Zhao , Xiaoping Hong
{"title":"预测和验证系统性红斑狼疮间质性肺疾病的风险","authors":"Aoyang Guo , Yanran Chen , Hongyang Liu , Shujun Gao , Xinyi Huang , Dongzhou Liu , Qianqian Zhao , Xiaoping Hong","doi":"10.1016/j.ijmedinf.2025.105839","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Our study aimed to<!--> <!-->construct a web-based calculator to predict high risk patients of interstitial lung disease (ILD) in systemic lupus erythematosus (SLE).</div></div><div><h3>Methods</h3><div>This retrospective study comprised training and test cohorts, including 581 and 86 patients, respectively. Univariate, least absolute shrinkage and selection operator (LASSO), random forest (RF), eXtreme Gradient Boosting (XGBoost), and logistic regression (LR) analyses were performed. A Venn diagram was used to investigate critical features. Receiver operating characteristic (ROC) analysis and decision curve analysis were used to evaluate the model’s performance. Risk stratification was performed using the best ROC cut-off value. The web-based calculator was established using Streamlit software.</div></div><div><h3>Results</h3><div>Characteristics such as Raynaud’s phenomenon, pulmonary artery systolic pressure, serositis, anti-U1RNP antibodies, anti-Ro52 antibodies, C-reactive protein, age, and disease course were associated with SLE complicated by ILD (SLE-ILD). LR-Venn, RF-Venn, XGBoost-Venn, LASSO-logic, RF, and XGBoost models were constructed. In training cohort, the XGBoost model demonstrated the highest area under the ROC curve (AUC, 0.890; cut-off value, 0.197; sensitivity, 0.793; specificity, 0.836) and provided<!--> <!-->a net<!--> <!-->benefit<!--> <!-->in decision curve analysis (odds ratio [OR] for SLE-ILD [high- vs. low-risk], 19.6). The model was validated in the test cohort (AUC, 0.866; sensitivity, 0.722; specificity, 0.897; OR, 22.7). Furthermore, an XGBoost model-based web calculator was developed.</div></div><div><h3>Conclusion</h3><div>Our web calculator (<span><span>https://st</span><svg><path></path></svg></span><span><span>-xgboost-app-kcv9qm.streamlit.app/</span><svg><path></path></svg></span>) greatly improved risk prediction for SLE-ILD and was implemented effectively.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"197 ","pages":"Article 105839"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting and validating the risk of interstitial lung disease in systemic lupus erythematosus\",\"authors\":\"Aoyang Guo , Yanran Chen , Hongyang Liu , Shujun Gao , Xinyi Huang , Dongzhou Liu , Qianqian Zhao , Xiaoping Hong\",\"doi\":\"10.1016/j.ijmedinf.2025.105839\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>Our study aimed to<!--> <!-->construct a web-based calculator to predict high risk patients of interstitial lung disease (ILD) in systemic lupus erythematosus (SLE).</div></div><div><h3>Methods</h3><div>This retrospective study comprised training and test cohorts, including 581 and 86 patients, respectively. Univariate, least absolute shrinkage and selection operator (LASSO), random forest (RF), eXtreme Gradient Boosting (XGBoost), and logistic regression (LR) analyses were performed. A Venn diagram was used to investigate critical features. Receiver operating characteristic (ROC) analysis and decision curve analysis were used to evaluate the model’s performance. Risk stratification was performed using the best ROC cut-off value. The web-based calculator was established using Streamlit software.</div></div><div><h3>Results</h3><div>Characteristics such as Raynaud’s phenomenon, pulmonary artery systolic pressure, serositis, anti-U1RNP antibodies, anti-Ro52 antibodies, C-reactive protein, age, and disease course were associated with SLE complicated by ILD (SLE-ILD). LR-Venn, RF-Venn, XGBoost-Venn, LASSO-logic, RF, and XGBoost models were constructed. In training cohort, the XGBoost model demonstrated the highest area under the ROC curve (AUC, 0.890; cut-off value, 0.197; sensitivity, 0.793; specificity, 0.836) and provided<!--> <!-->a net<!--> <!-->benefit<!--> <!-->in decision curve analysis (odds ratio [OR] for SLE-ILD [high- vs. low-risk], 19.6). The model was validated in the test cohort (AUC, 0.866; sensitivity, 0.722; specificity, 0.897; OR, 22.7). Furthermore, an XGBoost model-based web calculator was developed.</div></div><div><h3>Conclusion</h3><div>Our web calculator (<span><span>https://st</span><svg><path></path></svg></span><span><span>-xgboost-app-kcv9qm.streamlit.app/</span><svg><path></path></svg></span>) greatly improved risk prediction for SLE-ILD and was implemented effectively.</div></div>\",\"PeriodicalId\":54950,\"journal\":{\"name\":\"International Journal of Medical Informatics\",\"volume\":\"197 \",\"pages\":\"Article 105839\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-02-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386505625000565\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625000565","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Predicting and validating the risk of interstitial lung disease in systemic lupus erythematosus
Objective
Our study aimed to construct a web-based calculator to predict high risk patients of interstitial lung disease (ILD) in systemic lupus erythematosus (SLE).
Methods
This retrospective study comprised training and test cohorts, including 581 and 86 patients, respectively. Univariate, least absolute shrinkage and selection operator (LASSO), random forest (RF), eXtreme Gradient Boosting (XGBoost), and logistic regression (LR) analyses were performed. A Venn diagram was used to investigate critical features. Receiver operating characteristic (ROC) analysis and decision curve analysis were used to evaluate the model’s performance. Risk stratification was performed using the best ROC cut-off value. The web-based calculator was established using Streamlit software.
Results
Characteristics such as Raynaud’s phenomenon, pulmonary artery systolic pressure, serositis, anti-U1RNP antibodies, anti-Ro52 antibodies, C-reactive protein, age, and disease course were associated with SLE complicated by ILD (SLE-ILD). LR-Venn, RF-Venn, XGBoost-Venn, LASSO-logic, RF, and XGBoost models were constructed. In training cohort, the XGBoost model demonstrated the highest area under the ROC curve (AUC, 0.890; cut-off value, 0.197; sensitivity, 0.793; specificity, 0.836) and provided a net benefit in decision curve analysis (odds ratio [OR] for SLE-ILD [high- vs. low-risk], 19.6). The model was validated in the test cohort (AUC, 0.866; sensitivity, 0.722; specificity, 0.897; OR, 22.7). Furthermore, an XGBoost model-based web calculator was developed.
Conclusion
Our web calculator (https://st-xgboost-app-kcv9qm.streamlit.app/) greatly improved risk prediction for SLE-ILD and was implemented effectively.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.