Aswathy Elma Aby , S. Salaji , K.K. Anilkumar , Tintu Rajan
{"title":"Classification of acute myeloid leukemia by pre-trained deep neural networks: A comparison with different activation functions","authors":"Aswathy Elma Aby , S. Salaji , K.K. Anilkumar , Tintu Rajan","doi":"10.1016/j.medengphy.2024.104277","DOIUrl":null,"url":null,"abstract":"<div><div>Acute Myeloid Leukemia(AML) is a rapidly progressing cancer affecting blood and bone marrow, marked by the swift proliferation of abnormal myeloid cells. Effective treatment requires precise classification of AML subtypes. Conventional classification methods rely on manual microscopic analysis, which is time-consuming and variable, while traditional machine learning approaches often struggle with feature extraction and generalization. This underscores the need for automated detection methods which enhance this process by minimizing manual effort. Activation functions are critical in neural networks, introducing non-linearity that influences training convergence, computational efficiency, and model performance. This study evaluates the effectiveness of CNN architectures, Sequential (VGG16), Directed Acyclic Graph (InceptionV3), and Residual (ResNet50v2) in distinguishing between AML subtypes: AML without maturation, Acute Monoblastic Leukemia, and Pure Erythroid Leukemia, using peripheral blood smear images, while also investigating the impact of different activation functions on model accuracy and training time. The results show that ResNet50v2 achieves the shortest training time, while InceptionV3 takes the longest due to its complex architecture. GELU delivers the highest accuracy, reaching 94.02 % in InceptionV3 and 92.53 % in ResNet50v2, while SELU achieves the highest accuracy for VGG16 at 92.83 %. Mish provides competitive accuracy with lower training time than GELU, while Softplus and Softsign consistently perform poorly. This research demonstrates the potential of CNNs for automating AML subtype classification and identifies GELU as the most effective activation function. Future work could explore data augmentation, optimized activation functions, and attention mechanisms to improve classification performance.</div></div>","PeriodicalId":49836,"journal":{"name":"Medical Engineering & Physics","volume":"135 ","pages":"Article 104277"},"PeriodicalIF":1.7000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical Engineering & Physics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350453324001772","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Acute Myeloid Leukemia(AML) is a rapidly progressing cancer affecting blood and bone marrow, marked by the swift proliferation of abnormal myeloid cells. Effective treatment requires precise classification of AML subtypes. Conventional classification methods rely on manual microscopic analysis, which is time-consuming and variable, while traditional machine learning approaches often struggle with feature extraction and generalization. This underscores the need for automated detection methods which enhance this process by minimizing manual effort. Activation functions are critical in neural networks, introducing non-linearity that influences training convergence, computational efficiency, and model performance. This study evaluates the effectiveness of CNN architectures, Sequential (VGG16), Directed Acyclic Graph (InceptionV3), and Residual (ResNet50v2) in distinguishing between AML subtypes: AML without maturation, Acute Monoblastic Leukemia, and Pure Erythroid Leukemia, using peripheral blood smear images, while also investigating the impact of different activation functions on model accuracy and training time. The results show that ResNet50v2 achieves the shortest training time, while InceptionV3 takes the longest due to its complex architecture. GELU delivers the highest accuracy, reaching 94.02 % in InceptionV3 and 92.53 % in ResNet50v2, while SELU achieves the highest accuracy for VGG16 at 92.83 %. Mish provides competitive accuracy with lower training time than GELU, while Softplus and Softsign consistently perform poorly. This research demonstrates the potential of CNNs for automating AML subtype classification and identifies GELU as the most effective activation function. Future work could explore data augmentation, optimized activation functions, and attention mechanisms to improve classification performance.
期刊介绍:
Medical Engineering & Physics provides a forum for the publication of the latest developments in biomedical engineering, and reflects the essential multidisciplinary nature of the subject. The journal publishes in-depth critical reviews, scientific papers and technical notes. Our focus encompasses the application of the basic principles of physics and engineering to the development of medical devices and technology, with the ultimate aim of producing improvements in the quality of health care.Topics covered include biomechanics, biomaterials, mechanobiology, rehabilitation engineering, biomedical signal processing and medical device development. Medical Engineering & Physics aims to keep both engineers and clinicians abreast of the latest applications of technology to health care.