EffiCAT:通过多数据集融合和关注机制实现皮肤病分类的协同方法

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL
A. Sasithradevi , S. Kanimozhi , Parasa Sasidhar , Pavan Kumar Pulipati , Elavarthi Sruthi , P. Prakash
{"title":"EffiCAT:通过多数据集融合和关注机制实现皮肤病分类的协同方法","authors":"A. Sasithradevi ,&nbsp;S. Kanimozhi ,&nbsp;Parasa Sasidhar ,&nbsp;Pavan Kumar Pulipati ,&nbsp;Elavarthi Sruthi ,&nbsp;P. Prakash","doi":"10.1016/j.bspc.2024.107141","DOIUrl":null,"url":null,"abstract":"<div><div>Early and accurate diagnosis of skin diseases is essential for their efficient treatment and effective management. Conventional approaches typically depend on the use of a single dataset, which can introduce biases and limit the generalizability of the models due to dataset-specific idiosyncrasies. This study presents a novel hybrid model, named EffiCAT (EfficientNet Concatenation Attention Technology), for the categorization of skin diseases, specifically focusing on four classes named Actinic Keratosis (ACK), Basal Cell Carcinoma (BCC), Melanoma (MEL), and Melanocytic Nevus (NEV). EffiCAT enhances traditional approaches by integrating features from two different convolutional neural networks, EfficientNet B0 and EfficientNet B4, through feature concatenation. This is followed by applying advanced attention modules, specifically a Dual Channel Attention Layer applied twice and a Convolutional Block Attention Module (CBAM), to refine feature representation and focus on relevant patterns more effectively. Our method is evaluated on a combined dataset composed of HAM10000 and PAD-UFES-20, which enhances the diversity and volume of training samples to improve generalization across various skin types and conditions. The inclusion of multiple datasets helps mitigate the biases associated with single-dataset training and enhances the robustness of the model. EffiCAT attained a test accuracy of 94.48%, with precision, recall, and F1 score all closely aligned at 94.48%. These metrics not only illustrate the efficacy of our method but also underscore its superiority in handling varied and complex skin disease presentations through refined attention-driven feature concatenation. Additionally, external validation was performed on the ISIC 2018 dataset, where the model achieved a test accuracy of 92.08%, with precision of 92.45%, recall of 92.08%, and an F1 score of 92.15%, further confirming its robustness and generalizability. The model’s architecture efficiently leverages concatenated features enriched with attention mechanisms, setting a new standard for image-based diagnostic models.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107141"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EffiCAT: A synergistic approach to skin disease classification through multi-dataset fusion and attention mechanisms\",\"authors\":\"A. Sasithradevi ,&nbsp;S. Kanimozhi ,&nbsp;Parasa Sasidhar ,&nbsp;Pavan Kumar Pulipati ,&nbsp;Elavarthi Sruthi ,&nbsp;P. Prakash\",\"doi\":\"10.1016/j.bspc.2024.107141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Early and accurate diagnosis of skin diseases is essential for their efficient treatment and effective management. Conventional approaches typically depend on the use of a single dataset, which can introduce biases and limit the generalizability of the models due to dataset-specific idiosyncrasies. This study presents a novel hybrid model, named EffiCAT (EfficientNet Concatenation Attention Technology), for the categorization of skin diseases, specifically focusing on four classes named Actinic Keratosis (ACK), Basal Cell Carcinoma (BCC), Melanoma (MEL), and Melanocytic Nevus (NEV). EffiCAT enhances traditional approaches by integrating features from two different convolutional neural networks, EfficientNet B0 and EfficientNet B4, through feature concatenation. This is followed by applying advanced attention modules, specifically a Dual Channel Attention Layer applied twice and a Convolutional Block Attention Module (CBAM), to refine feature representation and focus on relevant patterns more effectively. Our method is evaluated on a combined dataset composed of HAM10000 and PAD-UFES-20, which enhances the diversity and volume of training samples to improve generalization across various skin types and conditions. The inclusion of multiple datasets helps mitigate the biases associated with single-dataset training and enhances the robustness of the model. EffiCAT attained a test accuracy of 94.48%, with precision, recall, and F1 score all closely aligned at 94.48%. These metrics not only illustrate the efficacy of our method but also underscore its superiority in handling varied and complex skin disease presentations through refined attention-driven feature concatenation. Additionally, external validation was performed on the ISIC 2018 dataset, where the model achieved a test accuracy of 92.08%, with precision of 92.45%, recall of 92.08%, and an F1 score of 92.15%, further confirming its robustness and generalizability. The model’s architecture efficiently leverages concatenated features enriched with attention mechanisms, setting a new standard for image-based diagnostic models.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"100 \",\"pages\":\"Article 107141\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809424011996\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424011996","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

皮肤病的早期准确诊断对于高效治疗和有效管理至关重要。传统方法通常依赖于使用单一数据集,这可能会引入偏差,并因数据集的特异性而限制模型的普适性。本研究提出了一种名为 EffiCAT(EfficientNet Concatenation Attention Technology)的新型混合模型,用于对皮肤病进行分类,尤其侧重于四类皮肤病,分别是角化性皮肤病(ACK)、基底细胞癌(BCC)、黑色素瘤(MEL)和黑素细胞痣(NEV)。EffiCAT 通过特征串联将来自两个不同卷积神经网络(EfficientNet B0 和 EfficientNet B4)的特征整合在一起,从而改进了传统方法。然后再应用高级注意力模块,特别是应用两次的双通道注意力层和卷积块注意力模块(CBAM),以完善特征表示并更有效地关注相关模式。我们的方法是在由 HAM10000 和 PAD-UFES-20 组成的组合数据集上进行评估的,该数据集增强了训练样本的多样性和数量,从而提高了在各种皮肤类型和条件下的泛化能力。包含多个数据集有助于减轻与单一数据集训练相关的偏差,并增强模型的鲁棒性。EffiCAT 的测试准确率达到 94.48%,精确度、召回率和 F1 分数均接近 94.48%。这些指标不仅说明了我们的方法的有效性,还强调了它在通过精炼的注意力驱动特征串联处理各种复杂的皮肤病表现方面的优越性。此外,我们还在 ISIC 2018 数据集上进行了外部验证,该模型的测试准确率为 92.08%,精确率为 92.45%,召回率为 92.08%,F1 得分为 92.15%,进一步证实了其稳健性和普适性。该模型的架构有效利用了富含注意力机制的串联特征,为基于图像的诊断模型设定了新标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
EffiCAT: A synergistic approach to skin disease classification through multi-dataset fusion and attention mechanisms
Early and accurate diagnosis of skin diseases is essential for their efficient treatment and effective management. Conventional approaches typically depend on the use of a single dataset, which can introduce biases and limit the generalizability of the models due to dataset-specific idiosyncrasies. This study presents a novel hybrid model, named EffiCAT (EfficientNet Concatenation Attention Technology), for the categorization of skin diseases, specifically focusing on four classes named Actinic Keratosis (ACK), Basal Cell Carcinoma (BCC), Melanoma (MEL), and Melanocytic Nevus (NEV). EffiCAT enhances traditional approaches by integrating features from two different convolutional neural networks, EfficientNet B0 and EfficientNet B4, through feature concatenation. This is followed by applying advanced attention modules, specifically a Dual Channel Attention Layer applied twice and a Convolutional Block Attention Module (CBAM), to refine feature representation and focus on relevant patterns more effectively. Our method is evaluated on a combined dataset composed of HAM10000 and PAD-UFES-20, which enhances the diversity and volume of training samples to improve generalization across various skin types and conditions. The inclusion of multiple datasets helps mitigate the biases associated with single-dataset training and enhances the robustness of the model. EffiCAT attained a test accuracy of 94.48%, with precision, recall, and F1 score all closely aligned at 94.48%. These metrics not only illustrate the efficacy of our method but also underscore its superiority in handling varied and complex skin disease presentations through refined attention-driven feature concatenation. Additionally, external validation was performed on the ISIC 2018 dataset, where the model achieved a test accuracy of 92.08%, with precision of 92.45%, recall of 92.08%, and an F1 score of 92.15%, further confirming its robustness and generalizability. The model’s architecture efficiently leverages concatenated features enriched with attention mechanisms, setting a new standard for image-based diagnostic models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信