在缺乏丰富训练数据集的情况下提高行人检测性能:一个英国案例研究

Adv. Artif. Intell. Mach. Learn. Pub Date : 1900-01-01 DOI:10.54364/aaiml.2022.1121

Juliana Negrini de Araujo, V. Palade, Tabassom Sedighi, A. Daneshkhah

{"title":"在缺乏丰富训练数据集的情况下提高行人检测性能:一个英国案例研究","authors":"Juliana Negrini de Araujo, V. Palade, Tabassom Sedighi, A. Daneshkhah","doi":"10.54364/aaiml.2022.1121","DOIUrl":null,"url":null,"abstract":"The World Health Organization estimates that well in excess of one million of lives are lost each year due to road traffic accidents. Since the human factor is the preeminent cause behind the traffic accidents, the development of reliable Advanced Driver Assistance Systems (ADASs) and Autonomous Vehicles (AVs) is seen by many as a possible solution to improve road safety. ADASs rely on the car perception system input that consists of camera(s), LIDAR and/or radar to detect pedestrians and other objects on the road. Hardware improvements as well as advances done in employing Deep Learning techniques for object detection popularized the Convolutional Neural Networks in the area of autonomous driving research and applications. However, the availability of quality and large datasets continues to be a most important contributor to the Deep Learning based model’s performance. With this in mind, this work analyses how a YOLO-based object detection architecture responded to limited data available for training and containing low-quality images. The work focused on pedestrian detection, since vulnerable road user’s safety is a major concern within AV and ADAS research communities. The proposed model was trained and tested on data gathered from Coventry, United Kingdom, city streets. The results show that the original YOLOv3 implementation reaches a 42.18% average precision (AP) and the main challenge was in detecting small objects. Network modifications were made and our final model, based on the original YOLOv3 implementation, achieved 51.6% AP. It is also demonstrated that the employed data augmentation approach is responsible for doubling the average precision of the final model.","PeriodicalId":373878,"journal":{"name":"Adv. Artif. Intell. Mach. Learn.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the Pedestrian Detection Performance in the Absence of Rich Training Datasets: A UK Case Study\",\"authors\":\"Juliana Negrini de Araujo, V. Palade, Tabassom Sedighi, A. Daneshkhah\",\"doi\":\"10.54364/aaiml.2022.1121\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The World Health Organization estimates that well in excess of one million of lives are lost each year due to road traffic accidents. Since the human factor is the preeminent cause behind the traffic accidents, the development of reliable Advanced Driver Assistance Systems (ADASs) and Autonomous Vehicles (AVs) is seen by many as a possible solution to improve road safety. ADASs rely on the car perception system input that consists of camera(s), LIDAR and/or radar to detect pedestrians and other objects on the road. Hardware improvements as well as advances done in employing Deep Learning techniques for object detection popularized the Convolutional Neural Networks in the area of autonomous driving research and applications. However, the availability of quality and large datasets continues to be a most important contributor to the Deep Learning based model’s performance. With this in mind, this work analyses how a YOLO-based object detection architecture responded to limited data available for training and containing low-quality images. The work focused on pedestrian detection, since vulnerable road user’s safety is a major concern within AV and ADAS research communities. The proposed model was trained and tested on data gathered from Coventry, United Kingdom, city streets. The results show that the original YOLOv3 implementation reaches a 42.18% average precision (AP) and the main challenge was in detecting small objects. Network modifications were made and our final model, based on the original YOLOv3 implementation, achieved 51.6% AP. It is also demonstrated that the employed data augmentation approach is responsible for doubling the average precision of the final model.\",\"PeriodicalId\":373878,\"journal\":{\"name\":\"Adv. Artif. Intell. Mach. Learn.\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Adv. Artif. Intell. Mach. Learn.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54364/aaiml.2022.1121\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adv. Artif. Intell. Mach. Learn.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54364/aaiml.2022.1121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

世界卫生组织估计，每年因道路交通事故而丧生的人数远远超过100万人。由于人为因素是导致交通事故的主要原因，因此开发可靠的高级驾驶辅助系统(ADASs)和自动驾驶汽车(AVs)被许多人视为改善道路安全的可能解决方案。ADASs依靠由摄像头、激光雷达和/或雷达组成的汽车感知系统输入来检测道路上的行人和其他物体。硬件的改进以及使用深度学习技术进行目标检测的进步使卷积神经网络在自动驾驶研究和应用领域得到普及。然而，高质量和大型数据集的可用性仍然是基于深度学习的模型性能的最重要贡献者。考虑到这一点，本工作分析了基于yolo的目标检测架构如何响应用于训练和包含低质量图像的有限数据。这项工作的重点是行人检测，因为弱势道路使用者的安全是自动驾驶和ADAS研究界关注的主要问题。所提出的模型在英国考文垂城市街道收集的数据上进行了训练和测试。结果表明，原始的YOLOv3实现达到42.18%的平均精度(AP)，主要挑战是检测小目标。我们对网络进行了修改，最终模型在原始的YOLOv3实现的基础上实现了51.6%的AP。还表明，采用数据增强方法可以使最终模型的平均精度提高一倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving the Pedestrian Detection Performance in the Absence of Rich Training Datasets: A UK Case Study

The World Health Organization estimates that well in excess of one million of lives are lost each year due to road traffic accidents. Since the human factor is the preeminent cause behind the traffic accidents, the development of reliable Advanced Driver Assistance Systems (ADASs) and Autonomous Vehicles (AVs) is seen by many as a possible solution to improve road safety. ADASs rely on the car perception system input that consists of camera(s), LIDAR and/or radar to detect pedestrians and other objects on the road. Hardware improvements as well as advances done in employing Deep Learning techniques for object detection popularized the Convolutional Neural Networks in the area of autonomous driving research and applications. However, the availability of quality and large datasets continues to be a most important contributor to the Deep Learning based model’s performance. With this in mind, this work analyses how a YOLO-based object detection architecture responded to limited data available for training and containing low-quality images. The work focused on pedestrian detection, since vulnerable road user’s safety is a major concern within AV and ADAS research communities. The proposed model was trained and tested on data gathered from Coventry, United Kingdom, city streets. The results show that the original YOLOv3 implementation reaches a 42.18% average precision (AP) and the main challenge was in detecting small objects. Network modifications were made and our final model, based on the original YOLOv3 implementation, achieved 51.6% AP. It is also demonstrated that the employed data augmentation approach is responsible for doubling the average precision of the final model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Adv. Artif. Intell. Mach. Learn.

自引率

0.00%

发文量