Machine Learning with Data Science-Enabled Lung Cancer Diagnosis and Classification Using Computed Tomography Images

S. Kiran, Inderjeet Kaur, K. Thangaraj, V. Saveetha, R. Grace, N. Arulkumar
{"title":"Machine Learning with Data Science-Enabled Lung Cancer Diagnosis and Classification Using Computed Tomography Images","authors":"S. Kiran, Inderjeet Kaur, K. Thangaraj, V. Saveetha, R. Grace, N. Arulkumar","doi":"10.1142/s0219467822400022","DOIUrl":null,"url":null,"abstract":"In recent times, the healthcare industry has been generating a significant amount of data in distinct formats, such as electronic health records (EHR), clinical trials, genetic data, payments, scientific articles, wearables, and care management databases. Data science is useful for analysis (pattern recognition, hypothesis testing, risk valuation) and prediction. The major, primary usage of data science in the healthcare domain is in medical imaging. At the same time, lung cancer diagnosis has become a hot research topic, as automated disease detection poses numerous benefits. Although numerous approaches have existed in the literature for lung cancer diagnosis, the design of a novel model to automatically identify lung cancer is a challenging task. In this view, this paper designs an automated machine learning (ML) with data science-enabled lung cancer diagnosis and classification (MLDS-LCDC) using computed tomography (CT) images. The presented model initially employs Gaussian filtering (GF)-based pre-processing technique on the CT images collected from the lung cancer database. Besides, they are fed into the normalized cuts (Ncuts) technique where the nodule in the pre-processed image can be determined. Moreover, the oriented FAST and rotated BRIEF (ORB) technique is applied as a feature extractor. At last, sunflower optimization-based wavelet neural network (SFO-WNN) model is employed for the classification of lung cancer. In order to examine the diagnostic outcome of the MLDS-LCDC model, a set of experiments were carried out and the results are investigated in terms of different aspects. The resultant values demonstrated the effectiveness of the MLDS-LCDC model over the other state-of-the-art methods with the maximum sensitivity of 97.01%, specificity of 98.64%, and accuracy of 98.11%.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Image Graph.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219467822400022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In recent times, the healthcare industry has been generating a significant amount of data in distinct formats, such as electronic health records (EHR), clinical trials, genetic data, payments, scientific articles, wearables, and care management databases. Data science is useful for analysis (pattern recognition, hypothesis testing, risk valuation) and prediction. The major, primary usage of data science in the healthcare domain is in medical imaging. At the same time, lung cancer diagnosis has become a hot research topic, as automated disease detection poses numerous benefits. Although numerous approaches have existed in the literature for lung cancer diagnosis, the design of a novel model to automatically identify lung cancer is a challenging task. In this view, this paper designs an automated machine learning (ML) with data science-enabled lung cancer diagnosis and classification (MLDS-LCDC) using computed tomography (CT) images. The presented model initially employs Gaussian filtering (GF)-based pre-processing technique on the CT images collected from the lung cancer database. Besides, they are fed into the normalized cuts (Ncuts) technique where the nodule in the pre-processed image can be determined. Moreover, the oriented FAST and rotated BRIEF (ORB) technique is applied as a feature extractor. At last, sunflower optimization-based wavelet neural network (SFO-WNN) model is employed for the classification of lung cancer. In order to examine the diagnostic outcome of the MLDS-LCDC model, a set of experiments were carried out and the results are investigated in terms of different aspects. The resultant values demonstrated the effectiveness of the MLDS-LCDC model over the other state-of-the-art methods with the maximum sensitivity of 97.01%, specificity of 98.64%, and accuracy of 98.11%.
机器学习与数据科学支持肺癌诊断和分类使用计算机断层扫描图像
最近,医疗保健行业一直在以不同格式生成大量数据,例如电子健康记录(EHR)、临床试验、基因数据、支付、科学文章、可穿戴设备和护理管理数据库。数据科学对分析(模式识别、假设检验、风险评估)和预测很有用。数据科学在医疗保健领域的主要用途是医学成像。与此同时,肺癌诊断已成为一个热门的研究课题,因为自动化疾病检测带来了许多好处。虽然文献中已有许多方法用于肺癌诊断,但设计一种新的模型来自动识别肺癌是一项具有挑战性的任务。在这种观点下,本文设计了一个使用计算机断层扫描(CT)图像的具有数据科学支持的肺癌诊断和分类(MLDS-LCDC)的自动机器学习(ML)。该模型首先采用基于高斯滤波(GF)的预处理技术对肺癌数据库中的CT图像进行预处理。此外,它们被输入到归一化切割(Ncuts)技术中,可以确定预处理图像中的结节。此外,采用定向FAST和旋转BRIEF (ORB)技术作为特征提取器。最后,采用基于向日葵优化的小波神经网络(SFO-WNN)模型对肺癌进行分类。为了检验MLDS-LCDC模型的诊断效果,我们进行了一组实验,并从不同方面对结果进行了研究。结果表明,MLDS-LCDC模型的灵敏度为97.01%,特异性为98.64%,准确度为98.11%,优于其他最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信