Novel Statistical-AI Method to Automate Discovery of Predictive Factors and Thresholds for 3 Year Survival, Dysphagia and Xerostomia for Patients with Head and Neck Cancers
{"title":"Novel Statistical-AI Method to Automate Discovery of Predictive Factors and Thresholds for 3 Year Survival, Dysphagia and Xerostomia for Patients with Head and Neck Cancers","authors":"","doi":"10.1016/j.ijrobp.2024.07.062","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose/Objective(s)</h3><div>Clinicians iteratively adjust treatment approaches to improve outcomes, but to date, automatable approaches for continuous learning of risk factors as these adjustments are made are lacking. We combined a large-scale, comprehensive real-world Learning Health System infrastructure (LHSI), with automated statistical profiling, visualization, and artificial intelligence (AI) approach to test evidence-based discovery of clinical factors for three endpoints: dysphagia, xerostomia, and 3-year survival for head and neck cancer patients.</div></div><div><h3>Materials/Methods</h3><div>Records for 964 patients treated for head and neck cancers with conventional fractionation between 2017 and 2022 were used. Combined information on demographics, diagnosis and staging, social determinants of health measures, chemotherapy, radiation therapy dose volume histogram curves, treatment details, laboratory values, and outcomes from the LHSI to winnow evidence for 485 candidate features. Univariate statistical profiling was performed using bootstrap resampling to detail confidence intervals for the following thresholds and metrics: area under the curve (AUC), sensitivity (SN), specificity (SP), F1, diagnostic odds ratio (DOR), <em>P</em> values for Wilcoxon Rank Sum (WRS), Kolmogorov-Smirnov (KS), and logistic fits of distributions detailed predictive evidence of individual features. Parsimonious XGBoost models were constructed with 10-fold cross validation using training (70%), validation (10%), and test (20%) sets. Probabilistic models utilizing statistical profiling logistic fits of distributions were used to benchmark XGBoost models.</div></div><div><h3>Results</h3><div>Incidence of dysphagia ≥ grade 3 within 1 year of treatment was low (11%). Xerostomia ≥ grade 2 (39% to 16%) and survival ≤ 3 years decreased (25% to 15%) over the time range. The strongest grade 2 xerostomia predictor was Glnd_Submand_Low: D15% [Gy] ≥ 45.2 with a logistic model quantifying a gradual rather than an abrupt increase in probability (13.5 + 0.18 (x-41.0 Gy)). Strongest predictive factors for lower likelihood of death by 3 years were GTV_High: Volume [cc] ≤ 21.1, GTV_Low: Volume [cc] ≤ 57.5, Baseline Neutrophil-Lymphocyte Ratio (NLR) ≤ 5.6, Monocyte-Lymphocyte Ratio (MLR) ≤0.56, Platelet-Lymphocyte ratio (PLR) ≤ 202.5. All predictors had WRS and KS <em>P</em> values < 0.02. Statistical profiling enabled detailing gains of XGBoost models with respect to individual features. Time period reductions in distribution of GTV volumes correlated with reductions in death by 3 years.</div></div><div><h3>Conclusion</h3><div>Combined use of LHSI, Statistical Profiling and Artificial Intelligence provided a basis for automating evidence-based discovery. Benchmarking AI models with simple probabilistic models provided a means of understanding when results are driven by general areas of overall risk vs. more complex interactions. The method can form a new approach to continuous learning and evidence-based development of clinical trial testable hypothesis and stratifications.</div></div>","PeriodicalId":14215,"journal":{"name":"International Journal of Radiation Oncology Biology Physics","volume":null,"pages":null},"PeriodicalIF":6.4000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Radiation Oncology Biology Physics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360301624008241","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose/Objective(s)
Clinicians iteratively adjust treatment approaches to improve outcomes, but to date, automatable approaches for continuous learning of risk factors as these adjustments are made are lacking. We combined a large-scale, comprehensive real-world Learning Health System infrastructure (LHSI), with automated statistical profiling, visualization, and artificial intelligence (AI) approach to test evidence-based discovery of clinical factors for three endpoints: dysphagia, xerostomia, and 3-year survival for head and neck cancer patients.
Materials/Methods
Records for 964 patients treated for head and neck cancers with conventional fractionation between 2017 and 2022 were used. Combined information on demographics, diagnosis and staging, social determinants of health measures, chemotherapy, radiation therapy dose volume histogram curves, treatment details, laboratory values, and outcomes from the LHSI to winnow evidence for 485 candidate features. Univariate statistical profiling was performed using bootstrap resampling to detail confidence intervals for the following thresholds and metrics: area under the curve (AUC), sensitivity (SN), specificity (SP), F1, diagnostic odds ratio (DOR), P values for Wilcoxon Rank Sum (WRS), Kolmogorov-Smirnov (KS), and logistic fits of distributions detailed predictive evidence of individual features. Parsimonious XGBoost models were constructed with 10-fold cross validation using training (70%), validation (10%), and test (20%) sets. Probabilistic models utilizing statistical profiling logistic fits of distributions were used to benchmark XGBoost models.
Results
Incidence of dysphagia ≥ grade 3 within 1 year of treatment was low (11%). Xerostomia ≥ grade 2 (39% to 16%) and survival ≤ 3 years decreased (25% to 15%) over the time range. The strongest grade 2 xerostomia predictor was Glnd_Submand_Low: D15% [Gy] ≥ 45.2 with a logistic model quantifying a gradual rather than an abrupt increase in probability (13.5 + 0.18 (x-41.0 Gy)). Strongest predictive factors for lower likelihood of death by 3 years were GTV_High: Volume [cc] ≤ 21.1, GTV_Low: Volume [cc] ≤ 57.5, Baseline Neutrophil-Lymphocyte Ratio (NLR) ≤ 5.6, Monocyte-Lymphocyte Ratio (MLR) ≤0.56, Platelet-Lymphocyte ratio (PLR) ≤ 202.5. All predictors had WRS and KS P values < 0.02. Statistical profiling enabled detailing gains of XGBoost models with respect to individual features. Time period reductions in distribution of GTV volumes correlated with reductions in death by 3 years.
Conclusion
Combined use of LHSI, Statistical Profiling and Artificial Intelligence provided a basis for automating evidence-based discovery. Benchmarking AI models with simple probabilistic models provided a means of understanding when results are driven by general areas of overall risk vs. more complex interactions. The method can form a new approach to continuous learning and evidence-based development of clinical trial testable hypothesis and stratifications.
期刊介绍:
International Journal of Radiation Oncology • Biology • Physics (IJROBP), known in the field as the Red Journal, publishes original laboratory and clinical investigations related to radiation oncology, radiation biology, medical physics, and both education and health policy as it relates to the field.
This journal has a particular interest in original contributions of the following types: prospective clinical trials, outcomes research, and large database interrogation. In addition, it seeks reports of high-impact innovations in single or combined modality treatment, tumor sensitization, normal tissue protection (including both precision avoidance and pharmacologic means), brachytherapy, particle irradiation, and cancer imaging. Technical advances related to dosimetry and conformal radiation treatment planning are of interest, as are basic science studies investigating tumor physiology and the molecular biology underlying cancer and normal tissue radiation response.