YOLOv8-MAH: Multi-attribute recognition model for Vehicles

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2025-05-13 DOI:10.1016/j.patcog.2025.111849

Yazhou Zhao , Hongdong Zhao , Jianfeng Shi

{"title":"YOLOv8-MAH: Multi-attribute recognition model for Vehicles","authors":"Yazhou Zhao , Hongdong Zhao , Jianfeng Shi","doi":"10.1016/j.patcog.2025.111849","DOIUrl":null,"url":null,"abstract":"<div><div>Vehicle multi-attribute recognition tasks have been increasingly used in intelligent traffic management, but the intra-class variability and inter-class similarity among vehicles bring great difficulties to vehicle multi-attribute recognition. To address this challenge, this paper proposes an improved model named YOLOv8-MAH (YOLOv8 Multi-Attribute-Head), which aims to enhance the performance of multi-attribute recognition. In order to utilize the ability of transformer encoder to accurately obtain detailed information, the C2f (CSP Bottleneck 2 Convolution) module in the backbone network is replaced by the global channel module of MobileViT, at the same time the C2f-E module based on the EMA architecture (Efficient Multi-scale Attention) is designed to improve the ability of the network to recognize different attributes, and we also add an additional detection layer to better extract information from the detailed part of the image to identify more attributes. Furthermore, our self-made dataset is labeled in three perspectives: vehicle brand, color, and direction, and is divided into 144 categories. The experiment results show that the YOLOv8-MAH significantly achieves good performance in the vehicle multi-recognition task.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"167 ","pages":"Article 111849"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325005096","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Vehicle multi-attribute recognition tasks have been increasingly used in intelligent traffic management, but the intra-class variability and inter-class similarity among vehicles bring great difficulties to vehicle multi-attribute recognition. To address this challenge, this paper proposes an improved model named YOLOv8-MAH (YOLOv8 Multi-Attribute-Head), which aims to enhance the performance of multi-attribute recognition. In order to utilize the ability of transformer encoder to accurately obtain detailed information, the C2f (CSP Bottleneck 2 Convolution) module in the backbone network is replaced by the global channel module of MobileViT, at the same time the C2f-E module based on the EMA architecture (Efficient Multi-scale Attention) is designed to improve the ability of the network to recognize different attributes, and we also add an additional detection layer to better extract information from the detailed part of the image to identify more attributes. Furthermore, our self-made dataset is labeled in three perspectives: vehicle brand, color, and direction, and is divided into 144 categories. The experiment results show that the YOLOv8-MAH significantly achieves good performance in the vehicle multi-recognition task.

查看原文本刊更多论文

YOLOv8-MAH：车辆多属性识别模型

车辆多属性识别任务在智能交通管理中的应用越来越广泛，但车辆类内变异性和类间相似性给车辆多属性识别带来了很大的困难。针对这一挑战，本文提出了一种改进的YOLOv8- mah （YOLOv8 multi-attribute - head）模型，旨在提高多属性识别的性能。为了利用变压器编码器准确获取详细信息的能力，将骨干网中的C2f （CSP瓶颈2卷积）模块替换为MobileViT的全局信道模块，同时设计基于EMA （Efficient Multi-scale Attention）架构的C2f- e模块，提高网络对不同属性的识别能力。我们还增加了一个额外的检测层，以便更好地从图像的细节部分提取信息，以识别更多的属性。此外，我们的自制数据集从三个角度进行了标记：汽车品牌、颜色和方向，并分为144个类别。实验结果表明，YOLOv8-MAH在车辆多识别任务中取得了较好的效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.