MIAFEx: An attention-based feature extraction method for medical image classification

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-09-15 DOI:10.1016/j.knosys.2025.114468

Oscar Ramos-Soto , Jorge Ramos-Frutos , Ezequiel Pérez-Zarate , Diego Oliva , Sandra E. Balderas-Mata

{"title":"MIAFEx: An attention-based feature extraction method for medical image classification","authors":"Oscar Ramos-Soto , Jorge Ramos-Frutos , Ezequiel Pérez-Zarate , Diego Oliva , Sandra E. Balderas-Mata","doi":"10.1016/j.knosys.2025.114468","DOIUrl":null,"url":null,"abstract":"<div><div>Feature extraction techniques are crucial in medical image classification; however, classical feature extractors, in addition to traditional machine learning classifiers, often exhibit significant limitations in providing sufficient discriminative information for complex image sets. While Convolutional Neural Networks (CNNs) and Vision Transformer (ViT) have shown promise in feature extraction, they are prone to overfitting due to the inherent characteristics of medical imaging data, including small sample sizes or high intra-class variance. In this work, the Medical Image Attention-based Feature Extractor (MIAFEx) is proposed, a novel method that employs a learnable refinement mechanism to enhance the classification token within the Transformer encoder architecture. This mechanism adjusts the token based on learned weights, improving the extraction of salient features and enhancing the model’s adaptability to the challenges presented by medical imaging data. The MIAFEx output feature quality is compared against classical feature extractors using traditional and hybrid classifiers. Also, the performance of these features is compared against modern CNN and ViT models in classification tasks, demonstrating their superiority in accuracy and robustness across multiple complex medical imaging datasets. This advantage is particularly pronounced in scenarios with limited training data, where traditional and modern models often struggle to generalize effectively. The source code of this proposal can be found at <span><span>github.com/Oscar-RamosS/Medical-Image-Attention-based-Feature-Extractor-MIAFEx</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114468"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125015072","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Feature extraction techniques are crucial in medical image classification; however, classical feature extractors, in addition to traditional machine learning classifiers, often exhibit significant limitations in providing sufficient discriminative information for complex image sets. While Convolutional Neural Networks (CNNs) and Vision Transformer (ViT) have shown promise in feature extraction, they are prone to overfitting due to the inherent characteristics of medical imaging data, including small sample sizes or high intra-class variance. In this work, the Medical Image Attention-based Feature Extractor (MIAFEx) is proposed, a novel method that employs a learnable refinement mechanism to enhance the classification token within the Transformer encoder architecture. This mechanism adjusts the token based on learned weights, improving the extraction of salient features and enhancing the model’s adaptability to the challenges presented by medical imaging data. The MIAFEx output feature quality is compared against classical feature extractors using traditional and hybrid classifiers. Also, the performance of these features is compared against modern CNN and ViT models in classification tasks, demonstrating their superiority in accuracy and robustness across multiple complex medical imaging datasets. This advantage is particularly pronounced in scenarios with limited training data, where traditional and modern models often struggle to generalize effectively. The source code of this proposal can be found at github.com/Oscar-RamosS/Medical-Image-Attention-based-Feature-Extractor-MIAFEx.

查看原文本刊更多论文

MIAFEx：一种基于注意力的医学图像分类特征提取方法

特征提取技术是医学图像分类的关键技术。然而，除了传统的机器学习分类器之外，经典的特征提取器在为复杂图像集提供足够的判别信息方面经常表现出明显的局限性。虽然卷积神经网络（cnn）和视觉变换（ViT）在特征提取方面表现出了良好的前景，但由于医学成像数据的固有特征，包括小样本量或高类内方差，它们容易过度拟合。在这项工作中，提出了基于医学图像注意力的特征提取器（MIAFEx），这是一种采用可学习的改进机制来增强Transformer编码器架构中的分类令牌的新方法。该机制根据学习到的权重调整token，提高了显著特征的提取，增强了模型对医学影像数据挑战的适应性。将MIAFEx输出的特征质量与使用传统分类器和混合分类器的经典特征提取器进行比较。此外，将这些特征与现代CNN和ViT模型在分类任务中的性能进行了比较，证明了它们在多个复杂医学成像数据集上的准确性和鲁棒性优势。这种优势在训练数据有限的情况下尤其明显，传统和现代模型往往难以有效地进行泛化。该提案的源代码可以在github.com/Oscar-RamosS/Medical-Image-Attention-based-Feature-Extractor-MIAFEx上找到。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.