基于多尺度卷积神经网络的图像分类方法

IF 0.9 4区 工程技术 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Shaobo Du, Jing Li
{"title":"基于多尺度卷积神经网络的图像分类方法","authors":"Shaobo Du, Jing Li","doi":"10.1142/s021812662450186x","DOIUrl":null,"url":null,"abstract":"<p>Traditional convolutional neural networks (CNNs) typically use fixed scale convolutional kernels for feature extraction when processing image classification tasks, while ignoring the multi-scale information present in the image. To overcome this limitation, we propose an algorithm based on multi-scale CNNs, which capture features at different levels by introducing convolutional kernels of different scales into the convolutional layer. In this study, we first designed a multi-scale convolutional layer consisting of multiple convolutional kernels of different scales to extract multi-scale features of the image. To further enhance classification performance, we introduced a multi-scale feature fusion module that can effectively fuse features of different scales and classify them through a fully connected layer. Then we conducted extensive experiments on several commonly used image classification datasets. The experimental results show that this network can not only effectively identify and locate hyperspectral image targets in different scenarios, but also reduce missed detections and false positives during the detection process. The average accuracy of the improved model has been improved, and the recognition accuracy of some small markers affected by external factors such as occlusion and lighting has also been improved. In addition, by comparing the detection effect of a single image, the progressiveness and anti-leakage ability of the improved model are proved. The image classification method based on multi-scale CNNs has broad application prospects in image recognition and feature extraction, and can provide valuable reference and reference for research in related fields.</p>","PeriodicalId":54866,"journal":{"name":"Journal of Circuits Systems and Computers","volume":"13 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image Classification Method Based on Multi-Scale Convolutional Neural Network\",\"authors\":\"Shaobo Du, Jing Li\",\"doi\":\"10.1142/s021812662450186x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Traditional convolutional neural networks (CNNs) typically use fixed scale convolutional kernels for feature extraction when processing image classification tasks, while ignoring the multi-scale information present in the image. To overcome this limitation, we propose an algorithm based on multi-scale CNNs, which capture features at different levels by introducing convolutional kernels of different scales into the convolutional layer. In this study, we first designed a multi-scale convolutional layer consisting of multiple convolutional kernels of different scales to extract multi-scale features of the image. To further enhance classification performance, we introduced a multi-scale feature fusion module that can effectively fuse features of different scales and classify them through a fully connected layer. Then we conducted extensive experiments on several commonly used image classification datasets. The experimental results show that this network can not only effectively identify and locate hyperspectral image targets in different scenarios, but also reduce missed detections and false positives during the detection process. The average accuracy of the improved model has been improved, and the recognition accuracy of some small markers affected by external factors such as occlusion and lighting has also been improved. In addition, by comparing the detection effect of a single image, the progressiveness and anti-leakage ability of the improved model are proved. The image classification method based on multi-scale CNNs has broad application prospects in image recognition and feature extraction, and can provide valuable reference and reference for research in related fields.</p>\",\"PeriodicalId\":54866,\"journal\":{\"name\":\"Journal of Circuits Systems and Computers\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Circuits Systems and Computers\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1142/s021812662450186x\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Circuits Systems and Computers","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1142/s021812662450186x","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

传统的卷积神经网络(CNN)在处理图像分类任务时,通常使用固定尺度的卷积核进行特征提取,而忽略了图像中存在的多尺度信息。为了克服这一局限,我们提出了一种基于多尺度 CNN 的算法,通过在卷积层中引入不同尺度的卷积核来捕捉不同层次的特征。在这项研究中,我们首先设计了一个由多个不同尺度卷积核组成的多尺度卷积层,以提取图像的多尺度特征。为了进一步提高分类性能,我们引入了多尺度特征融合模块,该模块能有效融合不同尺度的特征,并通过全连接层进行分类。然后,我们在几个常用的图像分类数据集上进行了大量实验。实验结果表明,该网络不仅能有效识别和定位不同场景下的高光谱图像目标,还能减少检测过程中的漏检和误报。改进后模型的平均准确率得到了提高,一些受遮挡和光照等外部因素影响的小标记的识别准确率也得到了提高。此外,通过对比单幅图像的检测效果,证明了改进模型的渐进性和抗泄漏能力。基于多尺度 CNN 的图像分类方法在图像识别和特征提取方面具有广阔的应用前景,可为相关领域的研究提供有价值的参考和借鉴。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Image Classification Method Based on Multi-Scale Convolutional Neural Network

Traditional convolutional neural networks (CNNs) typically use fixed scale convolutional kernels for feature extraction when processing image classification tasks, while ignoring the multi-scale information present in the image. To overcome this limitation, we propose an algorithm based on multi-scale CNNs, which capture features at different levels by introducing convolutional kernels of different scales into the convolutional layer. In this study, we first designed a multi-scale convolutional layer consisting of multiple convolutional kernels of different scales to extract multi-scale features of the image. To further enhance classification performance, we introduced a multi-scale feature fusion module that can effectively fuse features of different scales and classify them through a fully connected layer. Then we conducted extensive experiments on several commonly used image classification datasets. The experimental results show that this network can not only effectively identify and locate hyperspectral image targets in different scenarios, but also reduce missed detections and false positives during the detection process. The average accuracy of the improved model has been improved, and the recognition accuracy of some small markers affected by external factors such as occlusion and lighting has also been improved. In addition, by comparing the detection effect of a single image, the progressiveness and anti-leakage ability of the improved model are proved. The image classification method based on multi-scale CNNs has broad application prospects in image recognition and feature extraction, and can provide valuable reference and reference for research in related fields.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Circuits Systems and Computers
Journal of Circuits Systems and Computers 工程技术-工程:电子与电气
CiteScore
2.80
自引率
26.70%
发文量
350
审稿时长
5.4 months
期刊介绍: Journal of Circuits, Systems, and Computers covers a wide scope, ranging from mathematical foundations to practical engineering design in the general areas of circuits, systems, and computers with focus on their circuit aspects. Although primary emphasis will be on research papers, survey, expository and tutorial papers are also welcome. The journal consists of two sections: Papers - Contributions in this section may be of a research or tutorial nature. Research papers must be original and must not duplicate descriptions or derivations available elsewhere. The author should limit paper length whenever this can be done without impairing quality. Letters - This section provides a vehicle for speedy publication of new results and information of current interest in circuits, systems, and computers. Focus will be directed to practical design- and applications-oriented contributions, but publication in this section will not be restricted to this material. These letters are to concentrate on reporting the results obtained, their significance and the conclusions, while including only the minimum of supporting details required to understand the contribution. Publication of a manuscript in this manner does not preclude a later publication with a fully developed version.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信