Lung Cancer Prediction Using Curriculum Learning Based Deep Neural Networks

2021 IEEE International Conference on Digital Health (ICDH) Pub Date : 2021-09-01 DOI:10.1109/icdh52753.2021.00013

Jackson Zhou, Matloob Khushi, M. Moni, M. S. Uddin, S. Poon

{"title":"Lung Cancer Prediction Using Curriculum Learning Based Deep Neural Networks","authors":"Jackson Zhou, Matloob Khushi, M. Moni, M. S. Uddin, S. Poon","doi":"10.1109/icdh52753.2021.00013","DOIUrl":null,"url":null,"abstract":"The high incidence and low survival rate of lung cancers contribute to their high death count, and drive the development of lung cancer prediction models using demographic factors. The five year relative survival rate of small cell lung cancer in particular (6%) is four times less than that of non small cell lung cancer (23%), though no predictive models have been developed for it so far. This study aimed to expand on previous lung cancer prediction studies and develop improved models for general and small cell lung cancer prediction. Established machine learning models were considered, in addition to a novel curriculum learning based deep neural network. All models were evaluated using data from the National Cancer Institute's Prostate, Lung, Colorectal and Ovarian Cancer screening trial, with performance measured using the area under the receiver operator characteristic curve (AUROC). Random forest models were found to give the best performances in lung cancer prediction (bootstrap optimism corrected (BOC) $\\text{AUROC}\\ = {0.927}$), outperforming previous logistic regression models $(\\text{BOC} \\text{AUROC} ={0.859})$. Additionally, curriculum learning based neural networks were shown to outperform all other model types for small cell lung cancer prediction in particular (AUROCs of 0.873 and 0.882 across two feature sets). To conclude, high-performance models were developed for general and small cell lung cancer prediction, and could help improve non-invasive lung cancer prediction in a clinical setting.","PeriodicalId":93401,"journal":{"name":"2021 IEEE International Conference on Digital Health (ICDH)","volume":"2007 1","pages":"11-18"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Digital Health (ICDH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icdh52753.2021.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The high incidence and low survival rate of lung cancers contribute to their high death count, and drive the development of lung cancer prediction models using demographic factors. The five year relative survival rate of small cell lung cancer in particular (6%) is four times less than that of non small cell lung cancer (23%), though no predictive models have been developed for it so far. This study aimed to expand on previous lung cancer prediction studies and develop improved models for general and small cell lung cancer prediction. Established machine learning models were considered, in addition to a novel curriculum learning based deep neural network. All models were evaluated using data from the National Cancer Institute's Prostate, Lung, Colorectal and Ovarian Cancer screening trial, with performance measured using the area under the receiver operator characteristic curve (AUROC). Random forest models were found to give the best performances in lung cancer prediction (bootstrap optimism corrected (BOC) $\text{AUROC}\ = {0.927}$), outperforming previous logistic regression models $(\text{BOC} \text{AUROC} ={0.859})$. Additionally, curriculum learning based neural networks were shown to outperform all other model types for small cell lung cancer prediction in particular (AUROCs of 0.873 and 0.882 across two feature sets). To conclude, high-performance models were developed for general and small cell lung cancer prediction, and could help improve non-invasive lung cancer prediction in a clinical setting.

查看原文本刊更多论文

基于课程学习的深度神经网络肺癌预测

肺癌的高发病率和低生存率导致其高死亡率，并推动了基于人口统计学因素的肺癌预测模型的发展。特别是小细胞肺癌的5年相对生存率(6%)比非小细胞肺癌(23%)低4倍，尽管迄今为止还没有开发出预测模型。本研究旨在扩展先前肺癌预测研究，并开发改进的一般和小细胞肺癌预测模型。除了基于深度神经网络的课程学习之外，还考虑了已建立的机器学习模型。所有模型均采用美国国家癌症研究所前列腺癌、肺癌、结直肠癌和卵巢癌筛查试验的数据进行评估，并使用受试者操作者特征曲线(AUROC)下的面积来衡量其性能。随机森林模型在肺癌预测中表现最佳(bootstrap乐观修正(BOC) $\text{AUROC}\ ={0.927}$)，优于先前的逻辑回归模型$(\text{BOC} \text{AUROC} ={0.859})$。此外，基于课程学习的神经网络在小细胞肺癌预测方面的表现优于所有其他模型类型(两个特征集的auroc分别为0.873和0.882)。总之，我们开发了用于一般和小细胞肺癌预测的高性能模型，可以帮助改善临床环境中的非侵入性肺癌预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Conference on Digital Health (ICDH)

自引率

0.00%

发文量