{"title":"Artificial intelligence-assisted machine learning models for predicting lung cancer survival","authors":"Yue Yuan , Guolong Zhang , Yuqi Gu , Sicheng Hao , Chen Huang , Hongxia Xie , Wei Mi , Yingchun Zeng","doi":"10.1016/j.apjon.2025.100680","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study aimed to evaluate the feasibility of large language model-Advanced Data Analysis (ADA) in developing and implementing machine learning models to predict survival outcomes for lung cancer patients, with a focus on its implications for nursing practice.</div></div><div><h3>Methods</h3><div>A retrospective study design was employed using a dataset of lung cancer patients. Data included sociodemographic, clinical, treatment-specific, and comorbidity variables. Large language model-ADA was used to build and evaluate three machine learning models. Model performance was validated, and results were presented using calibration plots.</div></div><div><h3>Results</h3><div>Of 737 patients, the survival rate of this cohort was 73.3%, with a mean age of 59.32 years. Calibration plots indicated robust model reliability across all models. The Random Forest model demonstrated the highest predictive accuracy among the models. Most critical features identified were preoperative white blood cells (2.2%), preoperative lung function of Forced Expiratory Volume in one second (2.1%), preoperative arterial oxygen saturation (1.9%), preoperative partial pressure of oxygen (1.7%), preoperative albumin (1.6%), preoperative preparation time (1.5%), age at admission (1.5%), preoperative partial pressure of carbon dioxide (1.5%), preoperative hospital stay days (1.5%), and postoperative total days of thoracic tube drainage (1.4%).</div></div><div><h3>Conclusions</h3><div>Large language model-ADA effectively facilitates the development of machine learning models for lung cancer survival prediction, enabling non-technical health care professionals to harness the power of advanced analytics. The findings underscore the importance of preoperative factors in predicting outcomes, while also highlighting the need for external validation across diverse settings.</div></div>","PeriodicalId":8569,"journal":{"name":"Asia-Pacific Journal of Oncology Nursing","volume":"12 ","pages":"Article 100680"},"PeriodicalIF":2.4000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Journal of Oncology Nursing","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2347562525000289","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NURSING","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This study aimed to evaluate the feasibility of large language model-Advanced Data Analysis (ADA) in developing and implementing machine learning models to predict survival outcomes for lung cancer patients, with a focus on its implications for nursing practice.
Methods
A retrospective study design was employed using a dataset of lung cancer patients. Data included sociodemographic, clinical, treatment-specific, and comorbidity variables. Large language model-ADA was used to build and evaluate three machine learning models. Model performance was validated, and results were presented using calibration plots.
Results
Of 737 patients, the survival rate of this cohort was 73.3%, with a mean age of 59.32 years. Calibration plots indicated robust model reliability across all models. The Random Forest model demonstrated the highest predictive accuracy among the models. Most critical features identified were preoperative white blood cells (2.2%), preoperative lung function of Forced Expiratory Volume in one second (2.1%), preoperative arterial oxygen saturation (1.9%), preoperative partial pressure of oxygen (1.7%), preoperative albumin (1.6%), preoperative preparation time (1.5%), age at admission (1.5%), preoperative partial pressure of carbon dioxide (1.5%), preoperative hospital stay days (1.5%), and postoperative total days of thoracic tube drainage (1.4%).
Conclusions
Large language model-ADA effectively facilitates the development of machine learning models for lung cancer survival prediction, enabling non-technical health care professionals to harness the power of advanced analytics. The findings underscore the importance of preoperative factors in predicting outcomes, while also highlighting the need for external validation across diverse settings.