Marcos Antonio Dias Lima, Carlos Frederico Motta Vasconcelos, Roberto Macoto Ichinose, Antonio Mauricio Ferreira Leite Miranda de Sá
{"title":"Classification of non-small cell lung cancer by histologic subtype using deep learning in public and private data sets of computed tomography images.","authors":"Marcos Antonio Dias Lima, Carlos Frederico Motta Vasconcelos, Roberto Macoto Ichinose, Antonio Mauricio Ferreira Leite Miranda de Sá","doi":"10.1590/0100-3984.2024.0093","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop a deep learning system to classify non-small cell lung cancer (NSCLC) by histologic subtype-adenocarcinoma or squamous cell carcinoma (SCC)-from computed tomography (CT) images in which the tumor regions were segmented, comparing our results with those of similar studies conducted in other countries and evaluating the accuracy of automated classification by using data from the Instituto Nacional de Câncer, Brazil.</p><p><strong>Materials and methods: </strong>To develop the classification system, we employed a 2D U-Net neural network for semantic segmentation, with data augmentation and preprocessing steps. It was pretrained on 28,506 CT images from The Cancer Image Archive, a private database, and validated on 2,015 of those images. To develop the classification algorithm, we used a VGG16-based network, modified for better performance, with 3,080 images of adenocarcinoma and SCC from the Instituto Nacional de Câncer database.</p><p><strong>Results: </strong>The algorithm achieved an accuracy of 84.5% for detecting adenocarcinoma and 89.6% for detecting SCC, with sensitivities of 91.7% and 90.4%, respectively, which are considered satisfactory when compared with the values obtained in similar studies.</p><p><strong>Conclusion: </strong>The system developed appears to provide accurate automated detection, as well as tumor segmentation and classification of NSCLC subtypes of a local population using deep learning networks trained using public image data sets. This method could assist oncological radiologists by improving the efficiency of preliminary diagnoses.</p>","PeriodicalId":20842,"journal":{"name":"Radiologia Brasileira","volume":"58 ","pages":"e20240093"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12142743/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiologia Brasileira","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1590/0100-3984.2024.0093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To develop a deep learning system to classify non-small cell lung cancer (NSCLC) by histologic subtype-adenocarcinoma or squamous cell carcinoma (SCC)-from computed tomography (CT) images in which the tumor regions were segmented, comparing our results with those of similar studies conducted in other countries and evaluating the accuracy of automated classification by using data from the Instituto Nacional de Câncer, Brazil.
Materials and methods: To develop the classification system, we employed a 2D U-Net neural network for semantic segmentation, with data augmentation and preprocessing steps. It was pretrained on 28,506 CT images from The Cancer Image Archive, a private database, and validated on 2,015 of those images. To develop the classification algorithm, we used a VGG16-based network, modified for better performance, with 3,080 images of adenocarcinoma and SCC from the Instituto Nacional de Câncer database.
Results: The algorithm achieved an accuracy of 84.5% for detecting adenocarcinoma and 89.6% for detecting SCC, with sensitivities of 91.7% and 90.4%, respectively, which are considered satisfactory when compared with the values obtained in similar studies.
Conclusion: The system developed appears to provide accurate automated detection, as well as tumor segmentation and classification of NSCLC subtypes of a local population using deep learning networks trained using public image data sets. This method could assist oncological radiologists by improving the efficiency of preliminary diagnoses.
目的:开发一个深度学习系统,根据肿瘤区域被分割的计算机断层扫描(CT)图像的组织学亚型(腺癌或鳞状细胞癌(SCC))对非小细胞肺癌(NSCLC)进行分类,将我们的结果与其他国家进行的类似研究结果进行比较,并使用巴西国立癌症研究所(Instituto Nacional de ncer)的数据评估自动分类的准确性。材料和方法:为了开发分类系统,我们采用二维U-Net神经网络进行语义分割,并进行数据增强和预处理步骤。它在来自癌症图像档案(一个私人数据库)的28506张CT图像上进行了预训练,并在其中的2015张图像上进行了验证。为了开发分类算法,我们使用了一个基于vgg16的网络,并对其进行了改进以获得更好的性能,该网络使用了来自Instituto national de cnc数据库的3080张腺癌和SCC图像。结果:该算法对腺癌的检测准确率为84.5%,对SCC的检测准确率为89.6%,灵敏度分别为91.7%和90.4%,与同类研究的结果相比,这是令人满意的。结论:开发的系统似乎可以提供准确的自动检测,以及使用使用公共图像数据集训练的深度学习网络对当地人群的NSCLC亚型进行肿瘤分割和分类。该方法可以帮助肿瘤放射科医师提高初步诊断的效率。