Metro Pedestrian Detection Based on Mask R-CNN and Spatial-temporal Feature

2020 IEEE 3rd International Conference on Information Communication and Signal Processing (ICICSP) Pub Date : 2020-09-01 DOI:10.1109/ICICSP50920.2020.9232096

Guochen Shen, Faezeh Jamshidi, Decun Dong, Rei ZhG

引用次数: 0

Abstract

In this paper, we introduce the Mask R-CNN, an object detection method based on deep learning networks, to detect the number of pedestrians from surveillance video in the metro train carriage and on the metro station platform, and introduce the fusion of the multi-frame processing result to reduce the detection error. In order to apply and analyze the detection result, we establish a spatial-temporal model of the number of pedestrians in the carriage and on the platform. The experiment shows the efficient result of our method. The average accuracy of the single-frame detection is 73.43%. By fusing the detection result of frames in time series, the average accuracy is 88.85%, which increases 21%. The data of pedestrians’ numbers produced by our method can be helpful for metro management, pedestrian guidance, emergency management and so on.

查看原文本刊更多论文

基于掩模R-CNN和时空特征的地铁行人检测

本文介绍了一种基于深度学习网络的目标检测方法Mask R-CNN，对地铁车厢和地铁站站台的监控视频中行人数量进行检测，并引入多帧处理结果的融合来降低检测误差。为了应用和分析检测结果，我们建立了车厢内和站台上行人数量的时空模型。实验结果表明了该方法的有效性。单帧检测的平均准确率为73.43%。通过对时间序列中帧的检测结果进行融合，平均准确率为88.85%，提高了21%。该方法生成的行人数量数据可以为地铁管理、行人引导、应急管理等提供帮助。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 3rd International Conference on Information Communication and Signal Processing (ICICSP)

自引率

0.00%

发文量