Rabul Saikia, Sakshi Gupta, Anupam Sarma, Ngangbam Herojit Singh, Deepak Gupta, Muhammad Attique Khan, Ashit Kumar Dutta, Salam Shuleenda Devi
{"title":"mhals - net:一个基于正交SoftMax层的全局上下文感知的无注意力变压器网络,用于检测急性淋巴细胞白血病亚型","authors":"Rabul Saikia, Sakshi Gupta, Anupam Sarma, Ngangbam Herojit Singh, Deepak Gupta, Muhammad Attique Khan, Ashit Kumar Dutta, Salam Shuleenda Devi","doi":"10.1002/ima.70189","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Recent advancements in Deep Learning (DL) have enabled the development of Computer-Aided Diagnosis (CAD) systems for detecting Acute Lymphoblastic Leukemia (ALL) and its subtypes. However, this field faces challenges, particularly due to limited annotated datasets and a lack of efficient modalities. To address these issues, we propose MHAOSL-net, a novel hybrid DL model specifically designed for the accurate classification of B-ALL, T-ALL, and normal cells in blood smear images. The key contributions lie in the integration of four primary components: (1) lightweight MobileNetV2 for backbone feature extraction, (2) a Global Context Information Convolutional Block Attention Module (GCI-CBAM) for refined local representation using contextual information, (3) an Attention-Free Transformer (AFT) that captures global dependencies replacing traditional self-attention, and (4) an Orthogonal SoftMax Layer (OSL) that improves class separability by enforcing orthogonality in the decision space. This unified architecture not only reduces computational overhead but also improves classification performance and generalizability. To the best of our knowledge, this is the first framework that combines an AFT with an OSL in the context of leukemia subtype detection. The performance analysis of the proposed 3-class classification scheme has been assessed on two novel datasets, namely <i>BBCI_B&T_ALL_2024</i> and heterogeneous datasets. The experimental results show that the proposed scheme provides better performance, with 99.52% accuracy, 99.36% average precision, and 99.36% average F1-score on the <i>BBCI_B&T_ALL_2024</i>. Similarly, it achieves better performance with 99.55% accuracy, 99.40% average precision, and 99.40% average F1-score on the heterogeneous dataset. The qualitative investigation using Gradient-weighted Class Activation Mapping (Grad-CAM) visualization also confirms the efficacy of the proposed model for detecting B-ALL, T-ALL, and normal cells. The comparative studies establish the superiority of the proposed scheme over other state-of-the-art approaches. These findings indicate that MHAOSL-Net offers a promising and efficient solution for reliable ALL subtype detection in clinical settings.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 5","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MHAOSL-Net: A Global Context-Aware Attention Free Transformer Network With Orthogonal SoftMax Layer to Detect Subtypes of Acute Lymphoblastic Leukemia\",\"authors\":\"Rabul Saikia, Sakshi Gupta, Anupam Sarma, Ngangbam Herojit Singh, Deepak Gupta, Muhammad Attique Khan, Ashit Kumar Dutta, Salam Shuleenda Devi\",\"doi\":\"10.1002/ima.70189\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Recent advancements in Deep Learning (DL) have enabled the development of Computer-Aided Diagnosis (CAD) systems for detecting Acute Lymphoblastic Leukemia (ALL) and its subtypes. However, this field faces challenges, particularly due to limited annotated datasets and a lack of efficient modalities. To address these issues, we propose MHAOSL-net, a novel hybrid DL model specifically designed for the accurate classification of B-ALL, T-ALL, and normal cells in blood smear images. The key contributions lie in the integration of four primary components: (1) lightweight MobileNetV2 for backbone feature extraction, (2) a Global Context Information Convolutional Block Attention Module (GCI-CBAM) for refined local representation using contextual information, (3) an Attention-Free Transformer (AFT) that captures global dependencies replacing traditional self-attention, and (4) an Orthogonal SoftMax Layer (OSL) that improves class separability by enforcing orthogonality in the decision space. This unified architecture not only reduces computational overhead but also improves classification performance and generalizability. To the best of our knowledge, this is the first framework that combines an AFT with an OSL in the context of leukemia subtype detection. The performance analysis of the proposed 3-class classification scheme has been assessed on two novel datasets, namely <i>BBCI_B&T_ALL_2024</i> and heterogeneous datasets. The experimental results show that the proposed scheme provides better performance, with 99.52% accuracy, 99.36% average precision, and 99.36% average F1-score on the <i>BBCI_B&T_ALL_2024</i>. Similarly, it achieves better performance with 99.55% accuracy, 99.40% average precision, and 99.40% average F1-score on the heterogeneous dataset. The qualitative investigation using Gradient-weighted Class Activation Mapping (Grad-CAM) visualization also confirms the efficacy of the proposed model for detecting B-ALL, T-ALL, and normal cells. The comparative studies establish the superiority of the proposed scheme over other state-of-the-art approaches. These findings indicate that MHAOSL-Net offers a promising and efficient solution for reliable ALL subtype detection in clinical settings.</p>\\n </div>\",\"PeriodicalId\":14027,\"journal\":{\"name\":\"International Journal of Imaging Systems and Technology\",\"volume\":\"35 5\",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Imaging Systems and Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ima.70189\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70189","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
MHAOSL-Net: A Global Context-Aware Attention Free Transformer Network With Orthogonal SoftMax Layer to Detect Subtypes of Acute Lymphoblastic Leukemia
Recent advancements in Deep Learning (DL) have enabled the development of Computer-Aided Diagnosis (CAD) systems for detecting Acute Lymphoblastic Leukemia (ALL) and its subtypes. However, this field faces challenges, particularly due to limited annotated datasets and a lack of efficient modalities. To address these issues, we propose MHAOSL-net, a novel hybrid DL model specifically designed for the accurate classification of B-ALL, T-ALL, and normal cells in blood smear images. The key contributions lie in the integration of four primary components: (1) lightweight MobileNetV2 for backbone feature extraction, (2) a Global Context Information Convolutional Block Attention Module (GCI-CBAM) for refined local representation using contextual information, (3) an Attention-Free Transformer (AFT) that captures global dependencies replacing traditional self-attention, and (4) an Orthogonal SoftMax Layer (OSL) that improves class separability by enforcing orthogonality in the decision space. This unified architecture not only reduces computational overhead but also improves classification performance and generalizability. To the best of our knowledge, this is the first framework that combines an AFT with an OSL in the context of leukemia subtype detection. The performance analysis of the proposed 3-class classification scheme has been assessed on two novel datasets, namely BBCI_B&T_ALL_2024 and heterogeneous datasets. The experimental results show that the proposed scheme provides better performance, with 99.52% accuracy, 99.36% average precision, and 99.36% average F1-score on the BBCI_B&T_ALL_2024. Similarly, it achieves better performance with 99.55% accuracy, 99.40% average precision, and 99.40% average F1-score on the heterogeneous dataset. The qualitative investigation using Gradient-weighted Class Activation Mapping (Grad-CAM) visualization also confirms the efficacy of the proposed model for detecting B-ALL, T-ALL, and normal cells. The comparative studies establish the superiority of the proposed scheme over other state-of-the-art approaches. These findings indicate that MHAOSL-Net offers a promising and efficient solution for reliable ALL subtype detection in clinical settings.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.