A multi-view annotation tool for people detection evaluation

International Workshop on Video and Image Ground Truth in Computer Vision Applications Pub Date : 2012-05-21 DOI:10.1145/2304496.2304499

Á. Utasi, C. Benedek

引用次数: 13

Abstract

In this paper we introduce a novel multi-view annotation tool for generating 3D ground truth data of the real location of people in the scene. The proposed tool allows the user to accurately select the ground occupancy of people by aligning an oriented rectangle on the ground plane. In addition, the height of the people can also be adjusted. In order to achieve precise ground truth data the user is aided by the video frames of multiple synchronized and calibrated cameras. Finally, the 3D annotation data can be easily converted to 2D image positions using the available calibration matrices. One key advantage of the proposed technique is that different methods can be compared against each other, whether they estimate the real world ground position of people or the 2D position on the camera images. Therefore, we defined two different error metrics, which quantitatively evaluate the estimated positions. We used the proposed tool to annotate two publicly available datasets, and evaluated the metrics on two state of the art algorithms.

查看原文本刊更多论文

用于人员检测评价的多视图标注工具

本文介绍了一种新的多视图标注工具，用于生成场景中人物真实位置的三维地面真实数据。所提出的工具允许用户通过在地平面上对齐一个定向矩形来准确地选择人员的地面占用。此外，人的高度也可以调节。为了获得精确的地面真实数据，用户借助于多个同步和校准的摄像机的视频帧。最后，使用可用的校准矩阵可以轻松地将3D注释数据转换为2D图像位置。所提出的技术的一个关键优势是，不同的方法可以相互比较，无论是估计真实世界中的人的地面位置还是相机图像上的二维位置。因此，我们定义了两种不同的误差度量，以定量地评估估计的位置。我们使用提出的工具来注释两个公开可用的数据集，并评估了两个最先进算法的指标。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Workshop on Video and Image Ground Truth in Computer Vision Applications

自引率

0.00%

发文量