Longyan Xu , Peilong Li , Qiang Peng , Yifan Zhao , Lan Zhu , Sanhong Yuan
{"title":"基于多尺度特征融合的Yolov5道路车辆检测","authors":"Longyan Xu , Peilong Li , Qiang Peng , Yifan Zhao , Lan Zhu , Sanhong Yuan","doi":"10.1016/j.array.2025.100522","DOIUrl":null,"url":null,"abstract":"<div><div>This study focuses on Yolov5-based multi-head multi-scale adaptive feature fusion for vehicle detection to enhance the intelligence and refinement of road traffic safety management. As urbanization accelerates, road traffic problems are becoming increasingly serious. Accurate vehicle detection is crucial for traffic management to detect violations, monitor traffic flow, and prevent accidents in a timely manner. This paper proposes an improved Yolov5s-G model, which enhances the detection performance for small objects and improves the retention of feature information by introducing a small-object detection layer and a Weighted Cross-scale Fusion module (WCF), and an Adaptively Spatial Feature Fusion4(ASFF4) module. These enhancements enable the model to improve detection accuracy while maintaining moderate computational complexity. Specifically, the new small-object detection layer captures positional information of small objects more effectively, while the WCF module prevents the loss of small-object information during convolution through bidirectional cross-scale link feature fusion. Additionally, the ASFF4 module utilizes adaptive spatial feature fusion to further enhance the processing capability of feature information. Experimental results demonstrate that the improved Yolov5s-G model performs well on the vehicle detection dataset, with a mAP improvement of 9.3% compared to the original Yolov5 model. Furthermore, by introducing the knowledge distillation technique, the model has been significantly enhanced in terms of lightweighting.</div></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":"28 ","pages":"Article 100522"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vehicle detection on roads based on Yolov5 with multi-scale feature fusion\",\"authors\":\"Longyan Xu , Peilong Li , Qiang Peng , Yifan Zhao , Lan Zhu , Sanhong Yuan\",\"doi\":\"10.1016/j.array.2025.100522\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study focuses on Yolov5-based multi-head multi-scale adaptive feature fusion for vehicle detection to enhance the intelligence and refinement of road traffic safety management. As urbanization accelerates, road traffic problems are becoming increasingly serious. Accurate vehicle detection is crucial for traffic management to detect violations, monitor traffic flow, and prevent accidents in a timely manner. This paper proposes an improved Yolov5s-G model, which enhances the detection performance for small objects and improves the retention of feature information by introducing a small-object detection layer and a Weighted Cross-scale Fusion module (WCF), and an Adaptively Spatial Feature Fusion4(ASFF4) module. These enhancements enable the model to improve detection accuracy while maintaining moderate computational complexity. Specifically, the new small-object detection layer captures positional information of small objects more effectively, while the WCF module prevents the loss of small-object information during convolution through bidirectional cross-scale link feature fusion. Additionally, the ASFF4 module utilizes adaptive spatial feature fusion to further enhance the processing capability of feature information. Experimental results demonstrate that the improved Yolov5s-G model performs well on the vehicle detection dataset, with a mAP improvement of 9.3% compared to the original Yolov5 model. Furthermore, by introducing the knowledge distillation technique, the model has been significantly enhanced in terms of lightweighting.</div></div>\",\"PeriodicalId\":8417,\"journal\":{\"name\":\"Array\",\"volume\":\"28 \",\"pages\":\"Article 100522\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Array\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2590005625001493\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005625001493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
Vehicle detection on roads based on Yolov5 with multi-scale feature fusion
This study focuses on Yolov5-based multi-head multi-scale adaptive feature fusion for vehicle detection to enhance the intelligence and refinement of road traffic safety management. As urbanization accelerates, road traffic problems are becoming increasingly serious. Accurate vehicle detection is crucial for traffic management to detect violations, monitor traffic flow, and prevent accidents in a timely manner. This paper proposes an improved Yolov5s-G model, which enhances the detection performance for small objects and improves the retention of feature information by introducing a small-object detection layer and a Weighted Cross-scale Fusion module (WCF), and an Adaptively Spatial Feature Fusion4(ASFF4) module. These enhancements enable the model to improve detection accuracy while maintaining moderate computational complexity. Specifically, the new small-object detection layer captures positional information of small objects more effectively, while the WCF module prevents the loss of small-object information during convolution through bidirectional cross-scale link feature fusion. Additionally, the ASFF4 module utilizes adaptive spatial feature fusion to further enhance the processing capability of feature information. Experimental results demonstrate that the improved Yolov5s-G model performs well on the vehicle detection dataset, with a mAP improvement of 9.3% compared to the original Yolov5 model. Furthermore, by introducing the knowledge distillation technique, the model has been significantly enhanced in terms of lightweighting.