{"title":"Identifying chemotherapy beneficiaries in nasal and paranasal sinus cancers: epidemiological trends and machine learning insights.","authors":"Zihan Chen, Zongwei Huang, Yuhui Pan, Youliang Weng, Zijie Wu, Jing Wang, Wenxi Wu, Xinyi Hong, Xin Chen, Sufang Qiu","doi":"10.1186/s40001-025-02425-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Studies on the epidemiological characteristics, treatment strategies and prognosis of nasal and paranasal sinus cancer are still relatively limited.</p><p><strong>Methods: </strong>This study analyzed the age-adjusted incidence rates of nasal and paranasal sinus cancer from 1975 to 2020 using SEER database data. We conducted an in-depth examination of patients diagnosed between 2004 and 2015 with SEER*Stat software. A retrospective study from Fujian Provincial Cancer Hospital (2013-2020) provided an external validation set. Multiple imputation methods in R were used to address missing data. Survival analyses were performed using Kaplan-Meier and Cox proportional hazards models. Additionally, ten advanced machine learning models were utilized and evaluated in Python to predict patient survival outcomes.</p><p><strong>Results: </strong>This study analyzed data from 3,190 patients. The annual percent change (APC) in incidence rates per 100 000 person-years was 0.36 until 2012, subsequently decreasing to - 1.79. Among various predictive models, the gradient boosting classifier demonstrated superior performance with an area under the curve (AUC) of 0.699 and an accuracy rate of 0.708. Chemotherapy did not significantly influence overall mortality risk (HR = 0.93, 95% CI 0.82-1.05, P = 0.27). Chemotherapy showed potential benefits in specific patient subgroups.</p><p><strong>Conclusions: </strong>This study revealed a declining trend in incidence rates beginning in 2012. The gradient boosting model demonstrated robust performance, playing a crucial role in predicting patient prognosis and the significance of chemotherapy.</p>","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":"30 1","pages":"218"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11959763/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-025-02425-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Studies on the epidemiological characteristics, treatment strategies and prognosis of nasal and paranasal sinus cancer are still relatively limited.
Methods: This study analyzed the age-adjusted incidence rates of nasal and paranasal sinus cancer from 1975 to 2020 using SEER database data. We conducted an in-depth examination of patients diagnosed between 2004 and 2015 with SEER*Stat software. A retrospective study from Fujian Provincial Cancer Hospital (2013-2020) provided an external validation set. Multiple imputation methods in R were used to address missing data. Survival analyses were performed using Kaplan-Meier and Cox proportional hazards models. Additionally, ten advanced machine learning models were utilized and evaluated in Python to predict patient survival outcomes.
Results: This study analyzed data from 3,190 patients. The annual percent change (APC) in incidence rates per 100 000 person-years was 0.36 until 2012, subsequently decreasing to - 1.79. Among various predictive models, the gradient boosting classifier demonstrated superior performance with an area under the curve (AUC) of 0.699 and an accuracy rate of 0.708. Chemotherapy did not significantly influence overall mortality risk (HR = 0.93, 95% CI 0.82-1.05, P = 0.27). Chemotherapy showed potential benefits in specific patient subgroups.
Conclusions: This study revealed a declining trend in incidence rates beginning in 2012. The gradient boosting model demonstrated robust performance, playing a crucial role in predicting patient prognosis and the significance of chemotherapy.
背景:目前对鼻窦及副鼻窦癌的流行病学特征、治疗策略及预后的研究仍相对有限。方法:本研究使用SEER数据库数据,分析1975年至2020年鼻窦和副鼻窦癌的年龄调整发病率。我们使用SEER*Stat软件对2004年至2015年间诊断的患者进行了深入检查。福建省肿瘤医院2013-2020年的回顾性研究提供了外部验证集。在R中使用多种插值方法来解决缺失数据。生存率分析采用Kaplan-Meier和Cox比例风险模型。此外,在Python中使用并评估了十个先进的机器学习模型来预测患者的生存结果。结果:本研究分析了3190例患者的数据。到2012年,每10万人年发病率的年变化百分比(APC)为0.36,随后下降到- 1.79。在各种预测模型中,梯度增强分类器的曲线下面积(AUC)为0.699,准确率为0.708,表现出较好的性能。化疗对总死亡风险无显著影响(HR = 0.93, 95% CI 0.82-1.05, P = 0.27)。化疗在特定的患者亚组中显示出潜在的益处。结论:本研究显示从2012年开始发病率呈下降趋势。梯度增强模型表现出稳健的性能,在预测患者预后和化疗意义方面发挥着至关重要的作用。
期刊介绍:
European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.