Indifference subspace of deep features for lung nodule classification from CT images

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2024-10-28 DOI:10.1016/j.eswa.2024.125571

Mohamad M.A. Ashames , Ahmet Demir , Mehmet Koc , Mehmet Fidan , Semih Ergin , Mehmet Bilginer Gulmezoglu , Atalay Barkana , Omer Nezih Gerek

{"title":"Indifference subspace of deep features for lung nodule classification from CT images","authors":"Mohamad M.A. Ashames , Ahmet Demir , Mehmet Koc , Mehmet Fidan , Semih Ergin , Mehmet Bilginer Gulmezoglu , Atalay Barkana , Omer Nezih Gerek","doi":"10.1016/j.eswa.2024.125571","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning (DL) has made substantial contributions to automated diagnoses in biomedical imaging, with various architectures extensively used for critical classifications such as lung nodule detection from CT scans. Despite satisfactory results from basic DL implementations, understanding DL’s inner mechanisms and parameter evolution remains understudied. DL layers typically favor nodes with larger activation values, facilitating a softmax-type decision post-training. This aligns with various alternative final-layer replacements like support vector machines (SVM), random forest, naive Bayes, and k-nearest neighbor (k-NN). However, replacing the decision layer with a classifier that operates in the so-called indifference subspace, like the common vector approach (CVA), may disrupt the standard paradigm, as it requires commonality in feature node magnitudes rather than large feature values. This study investigates the feasibility of adapting standard DL architectures to generate feature nodes with common magnitudes conducive to CVA fine-tuning. Surprisingly, we find that DL networks, even without explicit design for this purpose, can achieve remarkable classification accuracies through CVA, effectively on par with state-of-the-art results. The intriguing high classification accuracy is examined through the relationship between “indifference subspace” and “node value,” scrutinized via an expansive suite of DL architectures, with and without ImageNet pre-training. Although the aim of the study is limited to the possibility of subspace alignment in the feature layers of convolutional neural networks (CNNs), the results demonstrate that CVA fine-tuning not only challenges the prevailing paradigms within DL classifications but also unveils a novel pathway for possibly enhancing classification performance in biomedical imaging, particularly for lung nodule detection.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"262 ","pages":"Article 125571"},"PeriodicalIF":7.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424024382","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning (DL) has made substantial contributions to automated diagnoses in biomedical imaging, with various architectures extensively used for critical classifications such as lung nodule detection from CT scans. Despite satisfactory results from basic DL implementations, understanding DL’s inner mechanisms and parameter evolution remains understudied. DL layers typically favor nodes with larger activation values, facilitating a softmax-type decision post-training. This aligns with various alternative final-layer replacements like support vector machines (SVM), random forest, naive Bayes, and k-nearest neighbor (k-NN). However, replacing the decision layer with a classifier that operates in the so-called indifference subspace, like the common vector approach (CVA), may disrupt the standard paradigm, as it requires commonality in feature node magnitudes rather than large feature values. This study investigates the feasibility of adapting standard DL architectures to generate feature nodes with common magnitudes conducive to CVA fine-tuning. Surprisingly, we find that DL networks, even without explicit design for this purpose, can achieve remarkable classification accuracies through CVA, effectively on par with state-of-the-art results. The intriguing high classification accuracy is examined through the relationship between “indifference subspace” and “node value,” scrutinized via an expansive suite of DL architectures, with and without ImageNet pre-training. Although the aim of the study is limited to the possibility of subspace alignment in the feature layers of convolutional neural networks (CNNs), the results demonstrate that CVA fine-tuning not only challenges the prevailing paradigms within DL classifications but also unveils a novel pathway for possibly enhancing classification performance in biomedical imaging, particularly for lung nodule detection.

查看原文本刊更多论文

用于 CT 图像肺结节分类的深度特征无差异子空间

深度学习（DL）为生物医学成像领域的自动诊断做出了巨大贡献，各种架构被广泛用于关键分类，如从 CT 扫描中检测肺结节。尽管基本的 DL 实现取得了令人满意的结果，但人们对 DL 的内在机制和参数演化的了解仍然不足。DL 层通常偏向于激活值较大的节点，有利于训练后的软最大决策。这与支持向量机（SVM）、随机森林、天真贝叶斯和 k 近邻（k-NN）等各种最终层替代方法一致。然而，用在所谓的无差异子空间中运行的分类器（如公共向量方法（CVA））来替换决策层可能会破坏标准范式，因为它需要特征节点大小的共性，而不是大的特征值。本研究探讨了调整标准 DL 架构以生成具有有利于 CVA 微调的共同大小的特征节点的可行性。令人惊讶的是，我们发现即使没有为此目的进行明确的设计，DL 网络也能通过 CVA 获得显著的分类准确性，与最先进的结果不相上下。我们通过 "无偏好子空间 "和 "节点值 "之间的关系，对令人费解的高分类准确性进行了研究，并通过广泛的 DL 架构套件，在进行和不进行 ImageNet 预训练的情况下进行了仔细研究。虽然这项研究的目的仅限于卷积神经网络（CNN）特征层中子空间排列的可能性，但研究结果表明，CVA 微调不仅挑战了 DL 分类中的主流范式，还揭示了一种可能提高生物医学成像分类性能的新途径，尤其是在肺结节检测方面。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.