Mohamad M.A. Ashames , Ahmet Demir , Mehmet Koc , Mehmet Fidan , Semih Ergin , Mehmet Bilginer Gulmezoglu , Atalay Barkana , Omer Nezih Gerek
{"title":"Indifference subspace of deep features for lung nodule classification from CT images","authors":"Mohamad M.A. Ashames , Ahmet Demir , Mehmet Koc , Mehmet Fidan , Semih Ergin , Mehmet Bilginer Gulmezoglu , Atalay Barkana , Omer Nezih Gerek","doi":"10.1016/j.eswa.2024.125571","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning (DL) has made substantial contributions to automated diagnoses in biomedical imaging, with various architectures extensively used for critical classifications such as lung nodule detection from CT scans. Despite satisfactory results from basic DL implementations, understanding DL’s inner mechanisms and parameter evolution remains understudied. DL layers typically favor nodes with larger activation values, facilitating a softmax-type decision post-training. This aligns with various alternative final-layer replacements like support vector machines (SVM), random forest, naive Bayes, and k-nearest neighbor (k-NN). However, replacing the decision layer with a classifier that operates in the so-called indifference subspace, like the common vector approach (CVA), may disrupt the standard paradigm, as it requires commonality in feature node magnitudes rather than large feature values. This study investigates the feasibility of adapting standard DL architectures to generate feature nodes with common magnitudes conducive to CVA fine-tuning. Surprisingly, we find that DL networks, even without explicit design for this purpose, can achieve remarkable classification accuracies through CVA, effectively on par with state-of-the-art results. The intriguing high classification accuracy is examined through the relationship between “indifference subspace” and “node value,” scrutinized via an expansive suite of DL architectures, with and without ImageNet pre-training. Although the aim of the study is limited to the possibility of subspace alignment in the feature layers of convolutional neural networks (CNNs), the results demonstrate that CVA fine-tuning not only challenges the prevailing paradigms within DL classifications but also unveils a novel pathway for possibly enhancing classification performance in biomedical imaging, particularly for lung nodule detection.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"262 ","pages":"Article 125571"},"PeriodicalIF":7.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424024382","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning (DL) has made substantial contributions to automated diagnoses in biomedical imaging, with various architectures extensively used for critical classifications such as lung nodule detection from CT scans. Despite satisfactory results from basic DL implementations, understanding DL’s inner mechanisms and parameter evolution remains understudied. DL layers typically favor nodes with larger activation values, facilitating a softmax-type decision post-training. This aligns with various alternative final-layer replacements like support vector machines (SVM), random forest, naive Bayes, and k-nearest neighbor (k-NN). However, replacing the decision layer with a classifier that operates in the so-called indifference subspace, like the common vector approach (CVA), may disrupt the standard paradigm, as it requires commonality in feature node magnitudes rather than large feature values. This study investigates the feasibility of adapting standard DL architectures to generate feature nodes with common magnitudes conducive to CVA fine-tuning. Surprisingly, we find that DL networks, even without explicit design for this purpose, can achieve remarkable classification accuracies through CVA, effectively on par with state-of-the-art results. The intriguing high classification accuracy is examined through the relationship between “indifference subspace” and “node value,” scrutinized via an expansive suite of DL architectures, with and without ImageNet pre-training. Although the aim of the study is limited to the possibility of subspace alignment in the feature layers of convolutional neural networks (CNNs), the results demonstrate that CVA fine-tuning not only challenges the prevailing paradigms within DL classifications but also unveils a novel pathway for possibly enhancing classification performance in biomedical imaging, particularly for lung nodule detection.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.