Wahidur Rahman , Mohammad Gazi Golam Faruque , Kaniz Roksana , A H M Saifullah Sadi , Mohammad Motiur Rahman , Mir Mohammad Azad
{"title":"Multiclass blood cancer classification using deep CNN with optimized features","authors":"Wahidur Rahman , Mohammad Gazi Golam Faruque , Kaniz Roksana , A H M Saifullah Sadi , Mohammad Motiur Rahman , Mir Mohammad Azad","doi":"10.1016/j.array.2023.100292","DOIUrl":null,"url":null,"abstract":"<div><p>Breast cancer, lung cancer, skin cancer, and blood malignancies such as leukemia and lymphoma are just a few instances of cancer, which is a collection of cells that proliferate uncontrollably within the body. Acute lymphoblastic leukemia is of one the significant form of malignancy. The hematologists frequently makes an oversight while determining a blood cancer diagnosis, which requires an excessive amount of time. Thus, this research reflects on a novel method for the grouping of the leukemia with the aid of the modern technologies like Machine Learning and Deep Learning. The proposed research pipeline is occupied into some interconnected parts like dataset building, feature extraction with pre-trained Convolutional Neural Network (CNN) architectures from each individual images of blood cells, and classification with the conventional classifiers. The dataset for this study is divided into two identical categories, Benign and Malignant, and then reshaped into four significant classes, each with three subtypes of malignant, namely, Benign, Early Pre-B, Pre-B, and Pro-B. The research first extracts the features from the individual images with CNN models and then transfers the extracted features to the features selections such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and SVC Feature Selectors along with two nature inspired algorithms like Particle Swarm Optimization (PSO) and Cat Swarm Optimization (CSO). After that, research has applied the seven Machine Learning classifiers to accomplish the multi-class malignant classification. To assess the efficacy of the proposed architecture a set of experimental data have been enumerated and interpreted accordingly. The study discovered a maximum accuracy of 98.43% when solely using pre-trained CNN and classifiers. Nevertheless, after incorporating PSO and CSO, the proposed model achieved the highest accuracy of 99.84% by integrating the ResNet50 CNN architecture, SVC feature selector, and LR classifiers. Although the model has a higher accuracy rate, it does have some drawbacks. However, the proposed model may also be helpful for real-world blood cancer classification.</p></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":"18 ","pages":"Article 100292"},"PeriodicalIF":2.3000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005623000176","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 3
Abstract
Breast cancer, lung cancer, skin cancer, and blood malignancies such as leukemia and lymphoma are just a few instances of cancer, which is a collection of cells that proliferate uncontrollably within the body. Acute lymphoblastic leukemia is of one the significant form of malignancy. The hematologists frequently makes an oversight while determining a blood cancer diagnosis, which requires an excessive amount of time. Thus, this research reflects on a novel method for the grouping of the leukemia with the aid of the modern technologies like Machine Learning and Deep Learning. The proposed research pipeline is occupied into some interconnected parts like dataset building, feature extraction with pre-trained Convolutional Neural Network (CNN) architectures from each individual images of blood cells, and classification with the conventional classifiers. The dataset for this study is divided into two identical categories, Benign and Malignant, and then reshaped into four significant classes, each with three subtypes of malignant, namely, Benign, Early Pre-B, Pre-B, and Pro-B. The research first extracts the features from the individual images with CNN models and then transfers the extracted features to the features selections such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and SVC Feature Selectors along with two nature inspired algorithms like Particle Swarm Optimization (PSO) and Cat Swarm Optimization (CSO). After that, research has applied the seven Machine Learning classifiers to accomplish the multi-class malignant classification. To assess the efficacy of the proposed architecture a set of experimental data have been enumerated and interpreted accordingly. The study discovered a maximum accuracy of 98.43% when solely using pre-trained CNN and classifiers. Nevertheless, after incorporating PSO and CSO, the proposed model achieved the highest accuracy of 99.84% by integrating the ResNet50 CNN architecture, SVC feature selector, and LR classifiers. Although the model has a higher accuracy rate, it does have some drawbacks. However, the proposed model may also be helpful for real-world blood cancer classification.
乳腺癌、肺癌、皮肤癌和血液恶性肿瘤如白血病和淋巴瘤只是癌症的几个例子,癌症是一种在体内不受控制地增殖的细胞的集合。急性淋巴细胞白血病是恶性肿瘤的重要形式之一。血液学家在诊断血癌时经常会出现疏忽,这需要大量的时间。因此,本研究反思了一种借助机器学习和深度学习等现代技术对白血病进行分组的新方法。所提出的研究管道分为几个相互关联的部分,如数据集构建,使用预训练的卷积神经网络(CNN)架构从每个单独的血细胞图像中提取特征,以及使用常规分类器进行分类。本研究的数据集被分为两个相同的类别,Benign和Malignant,然后重塑为四个重要的类别,每个类别有三个恶性亚型,即Benign, Early Pre-B, Pre-B和Pro-B。该研究首先利用CNN模型对单个图像进行特征提取,然后结合粒子群优化(PSO)和Cat群优化(CSO)两种自然启发算法,将提取的特征转移到主成分分析(PCA)、线性判别分析(LDA)和SVC特征选择器等特征选择中。之后,研究应用了7种机器学习分类器完成了多类恶性分类。为了评估所提出的体系结构的有效性,我们列举了一组实验数据并对其进行了相应的解释。研究发现,单独使用预训练的CNN和分类器时,准确率最高可达98.43%。然而,在结合PSO和CSO之后,通过集成ResNet50 CNN架构、SVC特征选择器和LR分类器,所提出的模型达到了99.84%的最高准确率。尽管该模型具有较高的准确率,但它也存在一些缺点。然而,所提出的模型也可能有助于现实世界的血癌分类。