Jackson Zhou, Matloob Khushi, M. Moni, M. S. Uddin, S. Poon
{"title":"Lung Cancer Prediction Using Curriculum Learning Based Deep Neural Networks","authors":"Jackson Zhou, Matloob Khushi, M. Moni, M. S. Uddin, S. Poon","doi":"10.1109/icdh52753.2021.00013","DOIUrl":null,"url":null,"abstract":"The high incidence and low survival rate of lung cancers contribute to their high death count, and drive the development of lung cancer prediction models using demographic factors. The five year relative survival rate of small cell lung cancer in particular (6%) is four times less than that of non small cell lung cancer (23%), though no predictive models have been developed for it so far. This study aimed to expand on previous lung cancer prediction studies and develop improved models for general and small cell lung cancer prediction. Established machine learning models were considered, in addition to a novel curriculum learning based deep neural network. All models were evaluated using data from the National Cancer Institute's Prostate, Lung, Colorectal and Ovarian Cancer screening trial, with performance measured using the area under the receiver operator characteristic curve (AUROC). Random forest models were found to give the best performances in lung cancer prediction (bootstrap optimism corrected (BOC) $\\text{AUROC}\\ = {0.927}$), outperforming previous logistic regression models $(\\text{BOC} \\text{AUROC} ={0.859})$. Additionally, curriculum learning based neural networks were shown to outperform all other model types for small cell lung cancer prediction in particular (AUROCs of 0.873 and 0.882 across two feature sets). To conclude, high-performance models were developed for general and small cell lung cancer prediction, and could help improve non-invasive lung cancer prediction in a clinical setting.","PeriodicalId":93401,"journal":{"name":"2021 IEEE International Conference on Digital Health (ICDH)","volume":"2007 1","pages":"11-18"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Digital Health (ICDH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icdh52753.2021.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The high incidence and low survival rate of lung cancers contribute to their high death count, and drive the development of lung cancer prediction models using demographic factors. The five year relative survival rate of small cell lung cancer in particular (6%) is four times less than that of non small cell lung cancer (23%), though no predictive models have been developed for it so far. This study aimed to expand on previous lung cancer prediction studies and develop improved models for general and small cell lung cancer prediction. Established machine learning models were considered, in addition to a novel curriculum learning based deep neural network. All models were evaluated using data from the National Cancer Institute's Prostate, Lung, Colorectal and Ovarian Cancer screening trial, with performance measured using the area under the receiver operator characteristic curve (AUROC). Random forest models were found to give the best performances in lung cancer prediction (bootstrap optimism corrected (BOC) $\text{AUROC}\ = {0.927}$), outperforming previous logistic regression models $(\text{BOC} \text{AUROC} ={0.859})$. Additionally, curriculum learning based neural networks were shown to outperform all other model types for small cell lung cancer prediction in particular (AUROCs of 0.873 and 0.882 across two feature sets). To conclude, high-performance models were developed for general and small cell lung cancer prediction, and could help improve non-invasive lung cancer prediction in a clinical setting.