Syed Shakir Hameed Shah , Norziana Jamil , Atta ur Rehman Khan , Lariyah Mohd Sidek , Nazik Alturki , Zuhaira Muhammad Zain
{"title":"MalRed: An innovative approach for detecting malware using the red channel analysis of color images","authors":"Syed Shakir Hameed Shah , Norziana Jamil , Atta ur Rehman Khan , Lariyah Mohd Sidek , Nazik Alturki , Zuhaira Muhammad Zain","doi":"10.1016/j.eij.2024.100478","DOIUrl":null,"url":null,"abstract":"<div><p>Technological advancements have significantly progressed and innovated across various industries. However, these advancements have also led to an increase in cyberattacks using malware. Researchers have developed diverse techniques to detect and classify malware, including a visualization-based approach that converts suspicious files into color or grayscale images, eliminating the need for domain-specific expertise. Nonetheless, converting files to color and grayscale images may result in the loss of texture details due to noise, adversely affecting the performance of machine learning models. The aim of this study is to present to assess the texture features and noise contributions of the red, green, and blue channels in color images. We propose a novel method to enhance model performance in terms of accuracy, precision, recall, f1-score, memory utilization, and computing cost during testing and training. This study introduces an approach involves separating color channels into individual red, green, and blue datasets and using various Discrete Wavelet Transform levels to reduce dimensions and remove noise. The extracted texture characteristics are then used as input for machine learning classifiers for training and prediction. Through comprehensive evaluation, we compare the performance of grayscale images with that of the red, green, and blue datasets. The results show that grayscale images significantly lose critical textural artifacts and perform worse than the color channels. Notably, employing extra tree classifiers yielded the best results, achieving an accuracy of 98.37%, precision of 98.64%, recall of 97.60%, and an f1-score of 98.04% with the red channel of color dataset.</p></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110866524000410/pdfft?md5=45302b24c7f2d0c312cdb11107ac452a&pid=1-s2.0-S1110866524000410-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866524000410","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Technological advancements have significantly progressed and innovated across various industries. However, these advancements have also led to an increase in cyberattacks using malware. Researchers have developed diverse techniques to detect and classify malware, including a visualization-based approach that converts suspicious files into color or grayscale images, eliminating the need for domain-specific expertise. Nonetheless, converting files to color and grayscale images may result in the loss of texture details due to noise, adversely affecting the performance of machine learning models. The aim of this study is to present to assess the texture features and noise contributions of the red, green, and blue channels in color images. We propose a novel method to enhance model performance in terms of accuracy, precision, recall, f1-score, memory utilization, and computing cost during testing and training. This study introduces an approach involves separating color channels into individual red, green, and blue datasets and using various Discrete Wavelet Transform levels to reduce dimensions and remove noise. The extracted texture characteristics are then used as input for machine learning classifiers for training and prediction. Through comprehensive evaluation, we compare the performance of grayscale images with that of the red, green, and blue datasets. The results show that grayscale images significantly lose critical textural artifacts and perform worse than the color channels. Notably, employing extra tree classifiers yielded the best results, achieving an accuracy of 98.37%, precision of 98.64%, recall of 97.60%, and an f1-score of 98.04% with the red channel of color dataset.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.