Lung Cancer Prediction Using Curriculum Learning Based Deep Neural Networks

Jackson Zhou, Matloob Khushi, M. Moni, M. S. Uddin, S. Poon
{"title":"Lung Cancer Prediction Using Curriculum Learning Based Deep Neural Networks","authors":"Jackson Zhou, Matloob Khushi, M. Moni, M. S. Uddin, S. Poon","doi":"10.1109/icdh52753.2021.00013","DOIUrl":null,"url":null,"abstract":"The high incidence and low survival rate of lung cancers contribute to their high death count, and drive the development of lung cancer prediction models using demographic factors. The five year relative survival rate of small cell lung cancer in particular (6%) is four times less than that of non small cell lung cancer (23%), though no predictive models have been developed for it so far. This study aimed to expand on previous lung cancer prediction studies and develop improved models for general and small cell lung cancer prediction. Established machine learning models were considered, in addition to a novel curriculum learning based deep neural network. All models were evaluated using data from the National Cancer Institute's Prostate, Lung, Colorectal and Ovarian Cancer screening trial, with performance measured using the area under the receiver operator characteristic curve (AUROC). Random forest models were found to give the best performances in lung cancer prediction (bootstrap optimism corrected (BOC) $\\text{AUROC}\\ = {0.927}$), outperforming previous logistic regression models $(\\text{BOC} \\text{AUROC} ={0.859})$. Additionally, curriculum learning based neural networks were shown to outperform all other model types for small cell lung cancer prediction in particular (AUROCs of 0.873 and 0.882 across two feature sets). To conclude, high-performance models were developed for general and small cell lung cancer prediction, and could help improve non-invasive lung cancer prediction in a clinical setting.","PeriodicalId":93401,"journal":{"name":"2021 IEEE International Conference on Digital Health (ICDH)","volume":"2007 1","pages":"11-18"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Digital Health (ICDH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icdh52753.2021.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The high incidence and low survival rate of lung cancers contribute to their high death count, and drive the development of lung cancer prediction models using demographic factors. The five year relative survival rate of small cell lung cancer in particular (6%) is four times less than that of non small cell lung cancer (23%), though no predictive models have been developed for it so far. This study aimed to expand on previous lung cancer prediction studies and develop improved models for general and small cell lung cancer prediction. Established machine learning models were considered, in addition to a novel curriculum learning based deep neural network. All models were evaluated using data from the National Cancer Institute's Prostate, Lung, Colorectal and Ovarian Cancer screening trial, with performance measured using the area under the receiver operator characteristic curve (AUROC). Random forest models were found to give the best performances in lung cancer prediction (bootstrap optimism corrected (BOC) $\text{AUROC}\ = {0.927}$), outperforming previous logistic regression models $(\text{BOC} \text{AUROC} ={0.859})$. Additionally, curriculum learning based neural networks were shown to outperform all other model types for small cell lung cancer prediction in particular (AUROCs of 0.873 and 0.882 across two feature sets). To conclude, high-performance models were developed for general and small cell lung cancer prediction, and could help improve non-invasive lung cancer prediction in a clinical setting.
基于课程学习的深度神经网络肺癌预测
肺癌的高发病率和低生存率导致其高死亡率,并推动了基于人口统计学因素的肺癌预测模型的发展。特别是小细胞肺癌的5年相对生存率(6%)比非小细胞肺癌(23%)低4倍,尽管迄今为止还没有开发出预测模型。本研究旨在扩展先前肺癌预测研究,并开发改进的一般和小细胞肺癌预测模型。除了基于深度神经网络的课程学习之外,还考虑了已建立的机器学习模型。所有模型均采用美国国家癌症研究所前列腺癌、肺癌、结直肠癌和卵巢癌筛查试验的数据进行评估,并使用受试者操作者特征曲线(AUROC)下的面积来衡量其性能。随机森林模型在肺癌预测中表现最佳(bootstrap乐观修正(BOC) $\text{AUROC}\ ={0.927}$),优于先前的逻辑回归模型$(\text{BOC} \text{AUROC} ={0.859})$。此外,基于课程学习的神经网络在小细胞肺癌预测方面的表现优于所有其他模型类型(两个特征集的auroc分别为0.873和0.882)。总之,我们开发了用于一般和小细胞肺癌预测的高性能模型,可以帮助改善临床环境中的非侵入性肺癌预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信