Convolutional Neural Network Image Classification Based on Different Color Spaces

IF 6.6 1区计算机科学 Q1 Multidisciplinary

Tsinghua Science and Technology Pub Date : 2024-09-11 DOI:10.26599/TST.2024.9010001

Zixiang Xian;Rubing Huang;Dave Towey;Chuan Yue

{"title":"Convolutional Neural Network Image Classification Based on Different Color Spaces","authors":"Zixiang Xian;Rubing Huang;Dave Towey;Chuan Yue","doi":"10.26599/TST.2024.9010001","DOIUrl":null,"url":null,"abstract":"Although Convolutional Neural Networks (CNNs) have achieved remarkable success in image classification, most CNNs use image datasets in the Red-Green-Blue (RGB) color space (one of the most commonly used color spaces). The existing literature regarding the influence of color space use on the performance of CNNs is limited. This paper explores the impact of different color spaces on image classification using CNNs. We compare the performance of five CNN models with different convolution operations and numbers of layers on four image datasets, each converted to nine color spaces. We find that color space selection can significantly affect classification accuracy, and that some classes are more sensitive to color space changes than others. Different color spaces may have different expression abilities for different image features, such as brightness, saturation, hue, etc. To leverage the complementary information from different color spaces, we propose a pseudo-Siamese network that fuses two color spaces without modifying the network architecture. Our experiments show that our proposed model can outperform the single-color-space models on most datasets. We also find that our method is simple, flexible, and compatible with any CNN and image dataset.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 1","pages":"402-417"},"PeriodicalIF":6.6000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10676405","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsinghua Science and Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10676405/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}

引用次数: 0

Abstract

Although Convolutional Neural Networks (CNNs) have achieved remarkable success in image classification, most CNNs use image datasets in the Red-Green-Blue (RGB) color space (one of the most commonly used color spaces). The existing literature regarding the influence of color space use on the performance of CNNs is limited. This paper explores the impact of different color spaces on image classification using CNNs. We compare the performance of five CNN models with different convolution operations and numbers of layers on four image datasets, each converted to nine color spaces. We find that color space selection can significantly affect classification accuracy, and that some classes are more sensitive to color space changes than others. Different color spaces may have different expression abilities for different image features, such as brightness, saturation, hue, etc. To leverage the complementary information from different color spaces, we propose a pseudo-Siamese network that fuses two color spaces without modifying the network architecture. Our experiments show that our proposed model can outperform the single-color-space models on most datasets. We also find that our method is simple, flexible, and compatible with any CNN and image dataset.

查看原文本刊更多论文

基于不同色彩空间的卷积神经网络图像分类

虽然卷积神经网络（CNN）在图像分类方面取得了显著成就，但大多数 CNN 使用的图像数据集都是红绿蓝（RGB）色彩空间（最常用的色彩空间之一）。关于色彩空间的使用对 CNN 性能影响的现有文献十分有限。本文探讨了不同色彩空间对使用 CNN 进行图像分类的影响。我们在四个图像数据集上比较了具有不同卷积操作和层数的五个 CNN 模型的性能，每个数据集都转换为九种色彩空间。我们发现，色彩空间的选择会显著影响分类准确性，而且某些类别对色彩空间的变化比其他类别更敏感。对于不同的图像特征，如亮度、饱和度、色调等，不同的色彩空间可能有不同的表达能力。为了充分利用不同色彩空间的互补信息，我们提出了一种伪暹罗网络（pseudo-Siamese network），它能在不修改网络架构的情况下融合两种色彩空间。实验表明，我们提出的模型在大多数数据集上都优于单一色彩空间模型。我们还发现，我们的方法简单、灵活，可与任何 CNN 和图像数据集兼容。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Tsinghua Science and Technology COMPUTER SCIENCE, INFORMATION SYSTEMSCOMPU-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

10.20

自引率

10.60%

发文量

2340

期刊介绍： Tsinghua Science and Technology (Tsinghua Sci Technol) started publication in 1996. It is an international academic journal sponsored by Tsinghua University and is published bimonthly. This journal aims at presenting the up-to-date scientific achievements in computer science, electronic engineering, and other IT fields. Contributions all over the world are welcome.