A. Sasithradevi , S. Kanimozhi , Parasa Sasidhar , Pavan Kumar Pulipati , Elavarthi Sruthi , P. Prakash
{"title":"EffiCAT:通过多数据集融合和关注机制实现皮肤病分类的协同方法","authors":"A. Sasithradevi , S. Kanimozhi , Parasa Sasidhar , Pavan Kumar Pulipati , Elavarthi Sruthi , P. Prakash","doi":"10.1016/j.bspc.2024.107141","DOIUrl":null,"url":null,"abstract":"<div><div>Early and accurate diagnosis of skin diseases is essential for their efficient treatment and effective management. Conventional approaches typically depend on the use of a single dataset, which can introduce biases and limit the generalizability of the models due to dataset-specific idiosyncrasies. This study presents a novel hybrid model, named EffiCAT (EfficientNet Concatenation Attention Technology), for the categorization of skin diseases, specifically focusing on four classes named Actinic Keratosis (ACK), Basal Cell Carcinoma (BCC), Melanoma (MEL), and Melanocytic Nevus (NEV). EffiCAT enhances traditional approaches by integrating features from two different convolutional neural networks, EfficientNet B0 and EfficientNet B4, through feature concatenation. This is followed by applying advanced attention modules, specifically a Dual Channel Attention Layer applied twice and a Convolutional Block Attention Module (CBAM), to refine feature representation and focus on relevant patterns more effectively. Our method is evaluated on a combined dataset composed of HAM10000 and PAD-UFES-20, which enhances the diversity and volume of training samples to improve generalization across various skin types and conditions. The inclusion of multiple datasets helps mitigate the biases associated with single-dataset training and enhances the robustness of the model. EffiCAT attained a test accuracy of 94.48%, with precision, recall, and F1 score all closely aligned at 94.48%. These metrics not only illustrate the efficacy of our method but also underscore its superiority in handling varied and complex skin disease presentations through refined attention-driven feature concatenation. Additionally, external validation was performed on the ISIC 2018 dataset, where the model achieved a test accuracy of 92.08%, with precision of 92.45%, recall of 92.08%, and an F1 score of 92.15%, further confirming its robustness and generalizability. The model’s architecture efficiently leverages concatenated features enriched with attention mechanisms, setting a new standard for image-based diagnostic models.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107141"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EffiCAT: A synergistic approach to skin disease classification through multi-dataset fusion and attention mechanisms\",\"authors\":\"A. Sasithradevi , S. Kanimozhi , Parasa Sasidhar , Pavan Kumar Pulipati , Elavarthi Sruthi , P. Prakash\",\"doi\":\"10.1016/j.bspc.2024.107141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Early and accurate diagnosis of skin diseases is essential for their efficient treatment and effective management. Conventional approaches typically depend on the use of a single dataset, which can introduce biases and limit the generalizability of the models due to dataset-specific idiosyncrasies. This study presents a novel hybrid model, named EffiCAT (EfficientNet Concatenation Attention Technology), for the categorization of skin diseases, specifically focusing on four classes named Actinic Keratosis (ACK), Basal Cell Carcinoma (BCC), Melanoma (MEL), and Melanocytic Nevus (NEV). EffiCAT enhances traditional approaches by integrating features from two different convolutional neural networks, EfficientNet B0 and EfficientNet B4, through feature concatenation. This is followed by applying advanced attention modules, specifically a Dual Channel Attention Layer applied twice and a Convolutional Block Attention Module (CBAM), to refine feature representation and focus on relevant patterns more effectively. Our method is evaluated on a combined dataset composed of HAM10000 and PAD-UFES-20, which enhances the diversity and volume of training samples to improve generalization across various skin types and conditions. The inclusion of multiple datasets helps mitigate the biases associated with single-dataset training and enhances the robustness of the model. EffiCAT attained a test accuracy of 94.48%, with precision, recall, and F1 score all closely aligned at 94.48%. These metrics not only illustrate the efficacy of our method but also underscore its superiority in handling varied and complex skin disease presentations through refined attention-driven feature concatenation. Additionally, external validation was performed on the ISIC 2018 dataset, where the model achieved a test accuracy of 92.08%, with precision of 92.45%, recall of 92.08%, and an F1 score of 92.15%, further confirming its robustness and generalizability. The model’s architecture efficiently leverages concatenated features enriched with attention mechanisms, setting a new standard for image-based diagnostic models.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"100 \",\"pages\":\"Article 107141\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809424011996\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424011996","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
EffiCAT: A synergistic approach to skin disease classification through multi-dataset fusion and attention mechanisms
Early and accurate diagnosis of skin diseases is essential for their efficient treatment and effective management. Conventional approaches typically depend on the use of a single dataset, which can introduce biases and limit the generalizability of the models due to dataset-specific idiosyncrasies. This study presents a novel hybrid model, named EffiCAT (EfficientNet Concatenation Attention Technology), for the categorization of skin diseases, specifically focusing on four classes named Actinic Keratosis (ACK), Basal Cell Carcinoma (BCC), Melanoma (MEL), and Melanocytic Nevus (NEV). EffiCAT enhances traditional approaches by integrating features from two different convolutional neural networks, EfficientNet B0 and EfficientNet B4, through feature concatenation. This is followed by applying advanced attention modules, specifically a Dual Channel Attention Layer applied twice and a Convolutional Block Attention Module (CBAM), to refine feature representation and focus on relevant patterns more effectively. Our method is evaluated on a combined dataset composed of HAM10000 and PAD-UFES-20, which enhances the diversity and volume of training samples to improve generalization across various skin types and conditions. The inclusion of multiple datasets helps mitigate the biases associated with single-dataset training and enhances the robustness of the model. EffiCAT attained a test accuracy of 94.48%, with precision, recall, and F1 score all closely aligned at 94.48%. These metrics not only illustrate the efficacy of our method but also underscore its superiority in handling varied and complex skin disease presentations through refined attention-driven feature concatenation. Additionally, external validation was performed on the ISIC 2018 dataset, where the model achieved a test accuracy of 92.08%, with precision of 92.45%, recall of 92.08%, and an F1 score of 92.15%, further confirming its robustness and generalizability. The model’s architecture efficiently leverages concatenated features enriched with attention mechanisms, setting a new standard for image-based diagnostic models.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.