影响肺移植受者预后的最重要因素分析:基于UNOS数据的多变量预测模型

IF 2.4 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL
Marsa Gholamzadeh, Reza Safdari, Mehrnaz Asadi Gharabaghi, Hamidreza Abtahi
{"title":"影响肺移植受者预后的最重要因素分析:基于UNOS数据的多变量预测模型","authors":"Marsa Gholamzadeh, Reza Safdari, Mehrnaz Asadi Gharabaghi, Hamidreza Abtahi","doi":"10.1136/bmjopen-2024-089796","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>In lung transplantation (LTx), a priority is assigned to each candidate on the waiting list. Our primary objective was to identify the key factors that influence the allocation of priorities in LTx using machine learning (ML) techniques to enhance the process of prioritising patients.</p><p><strong>Design: </strong>Developing a prediction model.</p><p><strong>Setting and participants: </strong>Our data were retrieved from the United Network for Organ Sharing (UNOS) open-source database of transplant patients between 2005 and 2023.</p><p><strong>Interventions: </strong>After the preprocessing process, a feature engineering technique was employed to select the most relevant features. Then, six ML models with optimised hyperparameters including multiple linear regression, random forest regressor (RF), support vector machine regressor, XGBoost regressor, a multilayer perceptron model and a deep learning model were developed based on the UNOS dataset.</p><p><strong>Primary and secondary outcome measures: </strong>The performance of each model was evaluated using R-squared (R<sup>2</sup>) and other error rate metrics. Next, the Shapley Additive Explanations (SHAP) technique was used to identify the most important features in the prediction.</p><p><strong>Results: </strong>The raw dataset contains 196 270 records with 545 features in all organs. After preprocessing, 32 966 records with 15 features remain. Among various models, the RF model achieved a high R<sup>2</sup> score. Additionally, the RF model exhibited the lowest error values, indicating its superior precision compared with other regression models. The SHAP technique in conjunction with the RF model revealed the 11 most important features for priority allocation. Subsequently, we developed a web-based decision support tool using Python and the Streamlit framework based on the best-fine-tuned model.</p><p><strong>Conclusion: </strong>The deployment of the ML model has the potential to act as an automated tool to aid physicians in assessing the priority of lung transplants and identifying significant factors that play a role in patient survival.</p>","PeriodicalId":9158,"journal":{"name":"BMJ Open","volume":"15 5","pages":"e089796"},"PeriodicalIF":2.4000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086922/pdf/","citationCount":"0","resultStr":"{\"title\":\"Analysis of the most influential factors affecting outcomes of lung transplant recipients: a multivariate prediction model based on UNOS Data.\",\"authors\":\"Marsa Gholamzadeh, Reza Safdari, Mehrnaz Asadi Gharabaghi, Hamidreza Abtahi\",\"doi\":\"10.1136/bmjopen-2024-089796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>In lung transplantation (LTx), a priority is assigned to each candidate on the waiting list. Our primary objective was to identify the key factors that influence the allocation of priorities in LTx using machine learning (ML) techniques to enhance the process of prioritising patients.</p><p><strong>Design: </strong>Developing a prediction model.</p><p><strong>Setting and participants: </strong>Our data were retrieved from the United Network for Organ Sharing (UNOS) open-source database of transplant patients between 2005 and 2023.</p><p><strong>Interventions: </strong>After the preprocessing process, a feature engineering technique was employed to select the most relevant features. Then, six ML models with optimised hyperparameters including multiple linear regression, random forest regressor (RF), support vector machine regressor, XGBoost regressor, a multilayer perceptron model and a deep learning model were developed based on the UNOS dataset.</p><p><strong>Primary and secondary outcome measures: </strong>The performance of each model was evaluated using R-squared (R<sup>2</sup>) and other error rate metrics. Next, the Shapley Additive Explanations (SHAP) technique was used to identify the most important features in the prediction.</p><p><strong>Results: </strong>The raw dataset contains 196 270 records with 545 features in all organs. After preprocessing, 32 966 records with 15 features remain. Among various models, the RF model achieved a high R<sup>2</sup> score. Additionally, the RF model exhibited the lowest error values, indicating its superior precision compared with other regression models. The SHAP technique in conjunction with the RF model revealed the 11 most important features for priority allocation. Subsequently, we developed a web-based decision support tool using Python and the Streamlit framework based on the best-fine-tuned model.</p><p><strong>Conclusion: </strong>The deployment of the ML model has the potential to act as an automated tool to aid physicians in assessing the priority of lung transplants and identifying significant factors that play a role in patient survival.</p>\",\"PeriodicalId\":9158,\"journal\":{\"name\":\"BMJ Open\",\"volume\":\"15 5\",\"pages\":\"e089796\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086922/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMJ Open\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1136/bmjopen-2024-089796\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/bmjopen-2024-089796","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

摘要

目的:在肺移植(LTx)中,优先分配给等待名单上的每个候选人。我们的主要目标是使用机器学习(ML)技术来确定影响LTx优先级分配的关键因素,以增强患者优先级分配的过程。设计:开发一个预测模型。环境和参与者:我们的数据检索自联合器官共享网络(UNOS) 2005年至2023年移植患者的开源数据库。干预措施:预处理后,采用特征工程技术选择最相关的特征。然后,基于UNOS数据集,开发了6个具有优化超参数的ML模型,包括多元线性回归、随机森林回归、支持向量机回归、XGBoost回归、多层感知器模型和深度学习模型。主要和次要结果测量:使用r²(R2)和其他错误率指标评估每个模型的性能。接下来,使用Shapley加性解释(SHAP)技术来识别预测中最重要的特征。结果:原始数据集包含196 270条记录,包含所有器官的545个特征。预处理后,保留了32 966条记录,15个特征。在各种模型中,RF模型的R2得分较高。此外,RF模型误差值最小,表明其精度优于其他回归模型。SHAP技术结合RF模型揭示了优先级分配的11个最重要的特征。随后,我们使用Python和基于最佳微调模型的Streamlit框架开发了一个基于web的决策支持工具。结论:ML模型的部署有可能作为一种自动化工具,帮助医生评估肺移植的优先级,并识别对患者生存起作用的重要因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis of the most influential factors affecting outcomes of lung transplant recipients: a multivariate prediction model based on UNOS Data.

Objectives: In lung transplantation (LTx), a priority is assigned to each candidate on the waiting list. Our primary objective was to identify the key factors that influence the allocation of priorities in LTx using machine learning (ML) techniques to enhance the process of prioritising patients.

Design: Developing a prediction model.

Setting and participants: Our data were retrieved from the United Network for Organ Sharing (UNOS) open-source database of transplant patients between 2005 and 2023.

Interventions: After the preprocessing process, a feature engineering technique was employed to select the most relevant features. Then, six ML models with optimised hyperparameters including multiple linear regression, random forest regressor (RF), support vector machine regressor, XGBoost regressor, a multilayer perceptron model and a deep learning model were developed based on the UNOS dataset.

Primary and secondary outcome measures: The performance of each model was evaluated using R-squared (R2) and other error rate metrics. Next, the Shapley Additive Explanations (SHAP) technique was used to identify the most important features in the prediction.

Results: The raw dataset contains 196 270 records with 545 features in all organs. After preprocessing, 32 966 records with 15 features remain. Among various models, the RF model achieved a high R2 score. Additionally, the RF model exhibited the lowest error values, indicating its superior precision compared with other regression models. The SHAP technique in conjunction with the RF model revealed the 11 most important features for priority allocation. Subsequently, we developed a web-based decision support tool using Python and the Streamlit framework based on the best-fine-tuned model.

Conclusion: The deployment of the ML model has the potential to act as an automated tool to aid physicians in assessing the priority of lung transplants and identifying significant factors that play a role in patient survival.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMJ Open
BMJ Open MEDICINE, GENERAL & INTERNAL-
CiteScore
4.40
自引率
3.40%
发文量
4510
审稿时长
2-3 weeks
期刊介绍: BMJ Open is an online, open access journal, dedicated to publishing medical research from all disciplines and therapeutic areas. The journal publishes all research study types, from study protocols to phase I trials to meta-analyses, including small or specialist studies. Publishing procedures are built around fully open peer review and continuous publication, publishing research online as soon as the article is ready.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信