在缺乏丰富训练数据集的情况下提高行人检测性能:一个英国案例研究

Juliana Negrini de Araujo, V. Palade, Tabassom Sedighi, A. Daneshkhah
{"title":"在缺乏丰富训练数据集的情况下提高行人检测性能:一个英国案例研究","authors":"Juliana Negrini de Araujo, V. Palade, Tabassom Sedighi, A. Daneshkhah","doi":"10.54364/aaiml.2022.1121","DOIUrl":null,"url":null,"abstract":"The World Health Organization estimates that well in excess of one million of lives are lost each year due to road traffic accidents. Since the human factor is the preeminent cause behind the traffic accidents, the development of reliable Advanced Driver Assistance Systems (ADASs) and Autonomous Vehicles (AVs) is seen by many as a possible solution to improve road safety. ADASs rely on the car perception system input that consists of camera(s), LIDAR and/or radar to detect pedestrians and other objects on the road. Hardware improvements as well as advances done in employing Deep Learning techniques for object detection popularized the Convolutional Neural Networks in the area of autonomous driving research and applications. However, the availability of quality and large datasets continues to be a most important contributor to the Deep Learning based model’s performance. With this in mind, this work analyses how a YOLO-based object detection architecture responded to limited data available for training and containing low-quality images. The work focused on pedestrian detection, since vulnerable road user’s safety is a major concern within AV and ADAS research communities. The proposed model was trained and tested on data gathered from Coventry, United Kingdom, city streets. The results show that the original YOLOv3 implementation reaches a 42.18% average precision (AP) and the main challenge was in detecting small objects. Network modifications were made and our final model, based on the original YOLOv3 implementation, achieved 51.6% AP. It is also demonstrated that the employed data augmentation approach is responsible for doubling the average precision of the final model.","PeriodicalId":373878,"journal":{"name":"Adv. Artif. Intell. Mach. Learn.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the Pedestrian Detection Performance in the Absence of Rich Training Datasets: A UK Case Study\",\"authors\":\"Juliana Negrini de Araujo, V. Palade, Tabassom Sedighi, A. Daneshkhah\",\"doi\":\"10.54364/aaiml.2022.1121\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The World Health Organization estimates that well in excess of one million of lives are lost each year due to road traffic accidents. Since the human factor is the preeminent cause behind the traffic accidents, the development of reliable Advanced Driver Assistance Systems (ADASs) and Autonomous Vehicles (AVs) is seen by many as a possible solution to improve road safety. ADASs rely on the car perception system input that consists of camera(s), LIDAR and/or radar to detect pedestrians and other objects on the road. Hardware improvements as well as advances done in employing Deep Learning techniques for object detection popularized the Convolutional Neural Networks in the area of autonomous driving research and applications. However, the availability of quality and large datasets continues to be a most important contributor to the Deep Learning based model’s performance. With this in mind, this work analyses how a YOLO-based object detection architecture responded to limited data available for training and containing low-quality images. The work focused on pedestrian detection, since vulnerable road user’s safety is a major concern within AV and ADAS research communities. The proposed model was trained and tested on data gathered from Coventry, United Kingdom, city streets. The results show that the original YOLOv3 implementation reaches a 42.18% average precision (AP) and the main challenge was in detecting small objects. Network modifications were made and our final model, based on the original YOLOv3 implementation, achieved 51.6% AP. It is also demonstrated that the employed data augmentation approach is responsible for doubling the average precision of the final model.\",\"PeriodicalId\":373878,\"journal\":{\"name\":\"Adv. Artif. Intell. Mach. Learn.\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Adv. Artif. Intell. Mach. Learn.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54364/aaiml.2022.1121\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adv. Artif. Intell. Mach. Learn.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54364/aaiml.2022.1121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

世界卫生组织估计,每年因道路交通事故而丧生的人数远远超过100万人。由于人为因素是导致交通事故的主要原因,因此开发可靠的高级驾驶辅助系统(ADASs)和自动驾驶汽车(AVs)被许多人视为改善道路安全的可能解决方案。ADASs依靠由摄像头、激光雷达和/或雷达组成的汽车感知系统输入来检测道路上的行人和其他物体。硬件的改进以及使用深度学习技术进行目标检测的进步使卷积神经网络在自动驾驶研究和应用领域得到普及。然而,高质量和大型数据集的可用性仍然是基于深度学习的模型性能的最重要贡献者。考虑到这一点,本工作分析了基于yolo的目标检测架构如何响应用于训练和包含低质量图像的有限数据。这项工作的重点是行人检测,因为弱势道路使用者的安全是自动驾驶和ADAS研究界关注的主要问题。所提出的模型在英国考文垂城市街道收集的数据上进行了训练和测试。结果表明,原始的YOLOv3实现达到42.18%的平均精度(AP),主要挑战是检测小目标。我们对网络进行了修改,最终模型在原始的YOLOv3实现的基础上实现了51.6%的AP。还表明,采用数据增强方法可以使最终模型的平均精度提高一倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving the Pedestrian Detection Performance in the Absence of Rich Training Datasets: A UK Case Study
The World Health Organization estimates that well in excess of one million of lives are lost each year due to road traffic accidents. Since the human factor is the preeminent cause behind the traffic accidents, the development of reliable Advanced Driver Assistance Systems (ADASs) and Autonomous Vehicles (AVs) is seen by many as a possible solution to improve road safety. ADASs rely on the car perception system input that consists of camera(s), LIDAR and/or radar to detect pedestrians and other objects on the road. Hardware improvements as well as advances done in employing Deep Learning techniques for object detection popularized the Convolutional Neural Networks in the area of autonomous driving research and applications. However, the availability of quality and large datasets continues to be a most important contributor to the Deep Learning based model’s performance. With this in mind, this work analyses how a YOLO-based object detection architecture responded to limited data available for training and containing low-quality images. The work focused on pedestrian detection, since vulnerable road user’s safety is a major concern within AV and ADAS research communities. The proposed model was trained and tested on data gathered from Coventry, United Kingdom, city streets. The results show that the original YOLOv3 implementation reaches a 42.18% average precision (AP) and the main challenge was in detecting small objects. Network modifications were made and our final model, based on the original YOLOv3 implementation, achieved 51.6% AP. It is also demonstrated that the employed data augmentation approach is responsible for doubling the average precision of the final model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信