YOLOv8-MAH: Multi-attribute recognition model for Vehicles

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yazhou Zhao , Hongdong Zhao , Jianfeng Shi
{"title":"YOLOv8-MAH: Multi-attribute recognition model for Vehicles","authors":"Yazhou Zhao ,&nbsp;Hongdong Zhao ,&nbsp;Jianfeng Shi","doi":"10.1016/j.patcog.2025.111849","DOIUrl":null,"url":null,"abstract":"<div><div>Vehicle multi-attribute recognition tasks have been increasingly used in intelligent traffic management, but the intra-class variability and inter-class similarity among vehicles bring great difficulties to vehicle multi-attribute recognition. To address this challenge, this paper proposes an improved model named YOLOv8-MAH (YOLOv8 Multi-Attribute-Head), which aims to enhance the performance of multi-attribute recognition. In order to utilize the ability of transformer encoder to accurately obtain detailed information, the C2f (CSP Bottleneck 2 Convolution) module in the backbone network is replaced by the global channel module of MobileViT, at the same time the C2f-E module based on the EMA architecture (Efficient Multi-scale Attention) is designed to improve the ability of the network to recognize different attributes, and we also add an additional detection layer to better extract information from the detailed part of the image to identify more attributes. Furthermore, our self-made dataset is labeled in three perspectives: vehicle brand, color, and direction, and is divided into 144 categories. The experiment results show that the YOLOv8-MAH significantly achieves good performance in the vehicle multi-recognition task.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"167 ","pages":"Article 111849"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325005096","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Vehicle multi-attribute recognition tasks have been increasingly used in intelligent traffic management, but the intra-class variability and inter-class similarity among vehicles bring great difficulties to vehicle multi-attribute recognition. To address this challenge, this paper proposes an improved model named YOLOv8-MAH (YOLOv8 Multi-Attribute-Head), which aims to enhance the performance of multi-attribute recognition. In order to utilize the ability of transformer encoder to accurately obtain detailed information, the C2f (CSP Bottleneck 2 Convolution) module in the backbone network is replaced by the global channel module of MobileViT, at the same time the C2f-E module based on the EMA architecture (Efficient Multi-scale Attention) is designed to improve the ability of the network to recognize different attributes, and we also add an additional detection layer to better extract information from the detailed part of the image to identify more attributes. Furthermore, our self-made dataset is labeled in three perspectives: vehicle brand, color, and direction, and is divided into 144 categories. The experiment results show that the YOLOv8-MAH significantly achieves good performance in the vehicle multi-recognition task.
YOLOv8-MAH:车辆多属性识别模型
车辆多属性识别任务在智能交通管理中的应用越来越广泛,但车辆类内变异性和类间相似性给车辆多属性识别带来了很大的困难。针对这一挑战,本文提出了一种改进的YOLOv8- mah (YOLOv8 multi-attribute - head)模型,旨在提高多属性识别的性能。为了利用变压器编码器准确获取详细信息的能力,将骨干网中的C2f (CSP瓶颈2卷积)模块替换为MobileViT的全局信道模块,同时设计基于EMA (Efficient Multi-scale Attention)架构的C2f- e模块,提高网络对不同属性的识别能力。我们还增加了一个额外的检测层,以便更好地从图像的细节部分提取信息,以识别更多的属性。此外,我们的自制数据集从三个角度进行了标记:汽车品牌、颜色和方向,并分为144个类别。实验结果表明,YOLOv8-MAH在车辆多识别任务中取得了较好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信