Zhihao Chen, R. Khemmar, B. Decoux, A. Atahouet, J. Ertaud
{"title":"Real Time Object Detection, Tracking, and Distance and Motion Estimation based on Deep Learning: Application to Smart Mobility","authors":"Zhihao Chen, R. Khemmar, B. Decoux, A. Atahouet, J. Ertaud","doi":"10.1109/EST.2019.8806222","DOIUrl":null,"url":null,"abstract":"In this paper, we will introduce our object detection, localization and tracking system for smart mobility applications like traffic road and railway environment. Firstly, an object detection and tracking approach was firstly carried out within two deep learning approaches: You Only Look Once (YOLO) V3 and Single Shot Detector (SSD). A comparison between the two methods allows us to identify their applicability in the traffic environment. Both the performances in road and in railway environments were evaluated. Secondly, object distance estimation based on Monodepth algorithm was developed. This model is trained on stereo images dataset but its inference uses monocular images. As the output data, we have a disparity map that we combine with the output of object detection. To validate our approach, we have tested two models with different backbones including VGG and ResNet used with two datasets: Cityscape and KITTI. As the last step of our approach, we have developed a new method-based SSD to analyse the behavior of pedestrian and vehicle by tracking their movements even in case of no detection on some images of a sequence. We have developed an algorithm based on the coordinates of the output bounding boxes of the SSD algorithm. The objective is to determine if the trajectory of a pedestrian or vehicle can lead to a dangerous situations. The whole of development is tested in real vehicle traffic conditions in Rouen city center, and with videos taken by embedded cameras along the Rouen tramway.","PeriodicalId":102238,"journal":{"name":"2019 Eighth International Conference on Emerging Security Technologies (EST)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"41","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Eighth International Conference on Emerging Security Technologies (EST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EST.2019.8806222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 41
Abstract
In this paper, we will introduce our object detection, localization and tracking system for smart mobility applications like traffic road and railway environment. Firstly, an object detection and tracking approach was firstly carried out within two deep learning approaches: You Only Look Once (YOLO) V3 and Single Shot Detector (SSD). A comparison between the two methods allows us to identify their applicability in the traffic environment. Both the performances in road and in railway environments were evaluated. Secondly, object distance estimation based on Monodepth algorithm was developed. This model is trained on stereo images dataset but its inference uses monocular images. As the output data, we have a disparity map that we combine with the output of object detection. To validate our approach, we have tested two models with different backbones including VGG and ResNet used with two datasets: Cityscape and KITTI. As the last step of our approach, we have developed a new method-based SSD to analyse the behavior of pedestrian and vehicle by tracking their movements even in case of no detection on some images of a sequence. We have developed an algorithm based on the coordinates of the output bounding boxes of the SSD algorithm. The objective is to determine if the trajectory of a pedestrian or vehicle can lead to a dangerous situations. The whole of development is tested in real vehicle traffic conditions in Rouen city center, and with videos taken by embedded cameras along the Rouen tramway.
在本文中,我们将介绍我们的目标检测,定位和跟踪系统,用于智能移动应用,如交通道路和铁路环境。首先,在两种深度学习方法:You Only Look Once (YOLO) V3和Single Shot Detector (SSD)中首先进行了目标检测和跟踪方法。通过对两种方法的比较,我们可以确定它们在交通环境中的适用性。对其在公路和铁路环境下的性能进行了评价。其次,提出了基于Monodepth算法的目标距离估计方法。该模型是在立体图像数据集上训练的,但其推理使用的是单眼图像。作为输出数据,我们有一个视差图,我们结合输出的目标检测。为了验证我们的方法,我们测试了两个具有不同主干的模型,包括VGG和ResNet,使用两个数据集:Cityscape和KITTI。作为我们方法的最后一步,我们开发了一种基于SSD的新方法,即使在序列的某些图像没有检测的情况下,也可以通过跟踪行人和车辆的运动来分析他们的行为。我们开发了一种基于SSD算法的输出边界框坐标的算法。目的是确定行人或车辆的轨迹是否会导致危险情况。整个开发过程在鲁昂市中心的真实车辆交通条件下进行了测试,并通过鲁昂有轨电车沿线的嵌入式摄像头拍摄了视频。