基于YOLOv5的实时定向检测器

X. Li, Zhenhua Cai, Xi Zhao
{"title":"基于YOLOv5的实时定向检测器","authors":"X. Li, Zhenhua Cai, Xi Zhao","doi":"10.1109/icccs55155.2022.9846234","DOIUrl":null,"url":null,"abstract":"Current state-of-the-art oriented detectors have time-consuming feature extraction backbones, oriented proposals generation methods or additional special branches. These tricks increase the computational cost of oriented detectors and prevent them from some practical applications. YOLOv5 is one of the best real-time (inference time⩽33.3ms, or FPS⩾30) detectors in the field of general target detection that can be applied in various real tasks. However, YOLOv5 does not output the angular prediction that is crucial to reflect attitudes and shapes of the targets. We propose Oriented-YOLOv5 that can output angular prediction of rotated target and achieve real-time detection in aerial images. Specifically, we integrate Circular Smooth Label (CSL) into YOLOv5 (v5.0), so it inherits both the fast and lightweight features of YOLOv5 and the high performance of CSL for angle detection. Experimental results indicate that Oriented-YOLOv5s achieves an accuracy of 68.24% mAP and an efficiency of 11.8ms with aerial images of 1024×1024 input size. It can be further improved (68.86% mAP and 10.8ms) using Conv instead of Focus. We also integrate CSL into YOLOv5m and YOLOv5l, and they achieve improved accuracies (70.03% mAP and 71.2% mAP, respectively) and retain real-time inference speed. Overall, Oriented-YOLOv5 is well suited for detection tasks that have vertical view and require angular prediction with highly real-time and limited memory, so it can be used for applications deployed to satellites, UAVs, or robotic arms. Source code: https://github.com/Leesanjin/Oriented-YOLOv5","PeriodicalId":121713,"journal":{"name":"2022 7th International Conference on Computer and Communication Systems (ICCCS)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Oriented-YOLOv5: A Real-time Oriented Detector Based on YOLOv5\",\"authors\":\"X. Li, Zhenhua Cai, Xi Zhao\",\"doi\":\"10.1109/icccs55155.2022.9846234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Current state-of-the-art oriented detectors have time-consuming feature extraction backbones, oriented proposals generation methods or additional special branches. These tricks increase the computational cost of oriented detectors and prevent them from some practical applications. YOLOv5 is one of the best real-time (inference time⩽33.3ms, or FPS⩾30) detectors in the field of general target detection that can be applied in various real tasks. However, YOLOv5 does not output the angular prediction that is crucial to reflect attitudes and shapes of the targets. We propose Oriented-YOLOv5 that can output angular prediction of rotated target and achieve real-time detection in aerial images. Specifically, we integrate Circular Smooth Label (CSL) into YOLOv5 (v5.0), so it inherits both the fast and lightweight features of YOLOv5 and the high performance of CSL for angle detection. Experimental results indicate that Oriented-YOLOv5s achieves an accuracy of 68.24% mAP and an efficiency of 11.8ms with aerial images of 1024×1024 input size. It can be further improved (68.86% mAP and 10.8ms) using Conv instead of Focus. We also integrate CSL into YOLOv5m and YOLOv5l, and they achieve improved accuracies (70.03% mAP and 71.2% mAP, respectively) and retain real-time inference speed. Overall, Oriented-YOLOv5 is well suited for detection tasks that have vertical view and require angular prediction with highly real-time and limited memory, so it can be used for applications deployed to satellites, UAVs, or robotic arms. Source code: https://github.com/Leesanjin/Oriented-YOLOv5\",\"PeriodicalId\":121713,\"journal\":{\"name\":\"2022 7th International Conference on Computer and Communication Systems (ICCCS)\",\"volume\":\"102 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 7th International Conference on Computer and Communication Systems (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icccs55155.2022.9846234\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icccs55155.2022.9846234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

当前最先进的面向检测器存在耗时的特征提取主干、面向提议生成方法或额外的特殊分支。这些技巧增加了定向检测器的计算成本,阻碍了定向检测器的实际应用。YOLOv5是一般目标检测领域中最好的实时(推断时间≥33.3ms,或FPS小于或等于30)探测器之一,可应用于各种实际任务。然而,YOLOv5不输出对反映目标的姿态和形状至关重要的角度预测。我们提出了一种能够输出旋转目标的角度预测并实现航拍图像实时检测的Oriented-YOLOv5。具体来说,我们将圆形平滑标签(CSL)集成到YOLOv5 (v5.0)中,因此它既继承了YOLOv5快速轻量级的特点,又继承了CSL在角度检测方面的高性能。实验结果表明,当航测图像输入尺寸为1024×1024时,orient - yolov5s的mAP识别精度为68.24%,效率为11.8ms。使用Conv而不是Focus可以进一步提高(68.86% mAP和10.8ms)。我们还将CSL集成到YOLOv5m和YOLOv5l中,它们获得了更高的精度(分别为70.03% mAP和71.2% mAP),并保持了实时推理速度。总体而言,Oriented-YOLOv5非常适合具有垂直视图和需要高度实时和有限内存的角度预测的探测任务,因此它可用于部署到卫星,无人机或机械臂的应用。源代码:https://github.com/Leesanjin/Oriented-YOLOv5
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Oriented-YOLOv5: A Real-time Oriented Detector Based on YOLOv5
Current state-of-the-art oriented detectors have time-consuming feature extraction backbones, oriented proposals generation methods or additional special branches. These tricks increase the computational cost of oriented detectors and prevent them from some practical applications. YOLOv5 is one of the best real-time (inference time⩽33.3ms, or FPS⩾30) detectors in the field of general target detection that can be applied in various real tasks. However, YOLOv5 does not output the angular prediction that is crucial to reflect attitudes and shapes of the targets. We propose Oriented-YOLOv5 that can output angular prediction of rotated target and achieve real-time detection in aerial images. Specifically, we integrate Circular Smooth Label (CSL) into YOLOv5 (v5.0), so it inherits both the fast and lightweight features of YOLOv5 and the high performance of CSL for angle detection. Experimental results indicate that Oriented-YOLOv5s achieves an accuracy of 68.24% mAP and an efficiency of 11.8ms with aerial images of 1024×1024 input size. It can be further improved (68.86% mAP and 10.8ms) using Conv instead of Focus. We also integrate CSL into YOLOv5m and YOLOv5l, and they achieve improved accuracies (70.03% mAP and 71.2% mAP, respectively) and retain real-time inference speed. Overall, Oriented-YOLOv5 is well suited for detection tasks that have vertical view and require angular prediction with highly real-time and limited memory, so it can be used for applications deployed to satellites, UAVs, or robotic arms. Source code: https://github.com/Leesanjin/Oriented-YOLOv5
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信