Shuang Wu, Shibao Zheng, Hua Yang, Yawen Fan, Longfei Liang, Hang Su
{"title":"SAGTA:人群场景中的半自动地面真相标注","authors":"Shuang Wu, Shibao Zheng, Hua Yang, Yawen Fan, Longfei Liang, Hang Su","doi":"10.1109/ICMEW.2014.6890539","DOIUrl":null,"url":null,"abstract":"Ground truth is crucial in the performance evaluation of algorithms. Nevertheless, it is a tedious and time-consuming task to annotate ground truth manually, especially in crowd scenes. In this paper, we propose a novel semi-automatic tool called SAGTA (Semi-automatic Ground Truth Annotation Tool), which can assist researchers to annotate pedestrians easily and quickly in crowd scenes. Firstly, users label pedestrians manually in a few key frames by drawing bounding boxes through the friendly GUI of SAGTA. Then, the annotations in the rest frames are coarsely estimated by automatically interpolating based on 3D linear motion assumption. Moreover, our tool refines the estimated annotations through using ORB feature matching. This coarse-to-fine method facilitates the annotation process efficiently. Afterwards, the refined annotations are manually verified and corrected to guarantee the accuracy of annotations. In addition, some extra information (such as density, trajectory and occlusion relationships) can be inferred automatically and visualized vividly. The proposed tool has been tested on PETS and real surveillance data sets. Experimental results demonstrate that SAGTA achieves superior performance in time cost than ViPER-GT, which is the widely used annotation tool.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"328 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"SAGTA: Semi-automatic Ground Truth Annotation in crowd scenes\",\"authors\":\"Shuang Wu, Shibao Zheng, Hua Yang, Yawen Fan, Longfei Liang, Hang Su\",\"doi\":\"10.1109/ICMEW.2014.6890539\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ground truth is crucial in the performance evaluation of algorithms. Nevertheless, it is a tedious and time-consuming task to annotate ground truth manually, especially in crowd scenes. In this paper, we propose a novel semi-automatic tool called SAGTA (Semi-automatic Ground Truth Annotation Tool), which can assist researchers to annotate pedestrians easily and quickly in crowd scenes. Firstly, users label pedestrians manually in a few key frames by drawing bounding boxes through the friendly GUI of SAGTA. Then, the annotations in the rest frames are coarsely estimated by automatically interpolating based on 3D linear motion assumption. Moreover, our tool refines the estimated annotations through using ORB feature matching. This coarse-to-fine method facilitates the annotation process efficiently. Afterwards, the refined annotations are manually verified and corrected to guarantee the accuracy of annotations. In addition, some extra information (such as density, trajectory and occlusion relationships) can be inferred automatically and visualized vividly. The proposed tool has been tested on PETS and real surveillance data sets. Experimental results demonstrate that SAGTA achieves superior performance in time cost than ViPER-GT, which is the widely used annotation tool.\",\"PeriodicalId\":178700,\"journal\":{\"name\":\"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"volume\":\"328 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW.2014.6890539\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW.2014.6890539","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
在算法的性能评估中,基础真值是至关重要的。然而,手动标注地面真相是一项繁琐且耗时的任务,特别是在人群场景中。在本文中,我们提出了一种新的半自动工具SAGTA (semi-automatic Ground Truth Annotation tool),它可以帮助研究人员在人群场景中轻松快速地对行人进行标注。首先,用户通过SAGTA友好的GUI绘制边界框,在几个关键帧中手动标记行人。然后,基于三维线性运动假设,通过自动插值对剩余帧中的注释进行粗估计。此外,我们的工具通过使用ORB特征匹配来细化估计的注释。这种从粗到精的方法简化了标注过程。然后,对改进后的标注进行人工验证和校正,保证标注的准确性。此外,一些额外的信息(如密度、轨迹和遮挡关系)可以自动推断并生动地可视化。所提出的工具已在pet和实际监测数据集上进行了测试。实验结果表明,SAGTA在时间成本上优于目前广泛使用的标注工具ViPER-GT。
SAGTA: Semi-automatic Ground Truth Annotation in crowd scenes
Ground truth is crucial in the performance evaluation of algorithms. Nevertheless, it is a tedious and time-consuming task to annotate ground truth manually, especially in crowd scenes. In this paper, we propose a novel semi-automatic tool called SAGTA (Semi-automatic Ground Truth Annotation Tool), which can assist researchers to annotate pedestrians easily and quickly in crowd scenes. Firstly, users label pedestrians manually in a few key frames by drawing bounding boxes through the friendly GUI of SAGTA. Then, the annotations in the rest frames are coarsely estimated by automatically interpolating based on 3D linear motion assumption. Moreover, our tool refines the estimated annotations through using ORB feature matching. This coarse-to-fine method facilitates the annotation process efficiently. Afterwards, the refined annotations are manually verified and corrected to guarantee the accuracy of annotations. In addition, some extra information (such as density, trajectory and occlusion relationships) can be inferred automatically and visualized vividly. The proposed tool has been tested on PETS and real surveillance data sets. Experimental results demonstrate that SAGTA achieves superior performance in time cost than ViPER-GT, which is the widely used annotation tool.