Kinect-supported dataset creation for human pose estimation

Kamil Behún, A. Herout, A. Páldy
{"title":"Kinect-supported dataset creation for human pose estimation","authors":"Kamil Behún, A. Herout, A. Páldy","doi":"10.1145/2643188.2643195","DOIUrl":null,"url":null,"abstract":"Training and evaluation datasets for specific tasks of human pose estimation are hard to find. This paper presents an approach for rapid construction of a precisely annotated training dataset for human pose estimation of a sitting subject, intended especially for aeronautic cockpit. We propose to use Kinect as a tool for collecting ground truth to a purely visual dataset (for reasons defined by the application, use of Kinect or similar structured light-based approaches is impossible). Since Kinect annotation of individual joints might be imprecise at certain moments, manual post-processing of the acquired data is necessary and we propose a scheme for efficient and reliable manual post-annotation. We produced a dataset of 6,322 annotated frames, involving 11 human subjects recorded in various lighting conditions, different clothing, and varying background. Each frame contains one seated person in frontal view with annotation of pose and optical flow data. We used detectors of body parts based on Random Forest on the produced dataset in order to verify its usability. These preliminary results show that the detector can be trained successfully on the developed dataset and that the optical flow contributes to the detection accuracy considerably. The dataset and the intermediary data used during its creation is made publicly available. By this, we intend to support further research and evaluation in the specific topic of human pose estimation focused on a sitting subject in a cockpit scenario.","PeriodicalId":115384,"journal":{"name":"Proceedings of the 30th Spring Conference on Computer Graphics","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 30th Spring Conference on Computer Graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2643188.2643195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Training and evaluation datasets for specific tasks of human pose estimation are hard to find. This paper presents an approach for rapid construction of a precisely annotated training dataset for human pose estimation of a sitting subject, intended especially for aeronautic cockpit. We propose to use Kinect as a tool for collecting ground truth to a purely visual dataset (for reasons defined by the application, use of Kinect or similar structured light-based approaches is impossible). Since Kinect annotation of individual joints might be imprecise at certain moments, manual post-processing of the acquired data is necessary and we propose a scheme for efficient and reliable manual post-annotation. We produced a dataset of 6,322 annotated frames, involving 11 human subjects recorded in various lighting conditions, different clothing, and varying background. Each frame contains one seated person in frontal view with annotation of pose and optical flow data. We used detectors of body parts based on Random Forest on the produced dataset in order to verify its usability. These preliminary results show that the detector can be trained successfully on the developed dataset and that the optical flow contributes to the detection accuracy considerably. The dataset and the intermediary data used during its creation is made publicly available. By this, we intend to support further research and evaluation in the specific topic of human pose estimation focused on a sitting subject in a cockpit scenario.
kinect支持的人体姿态估计数据集创建
用于人体姿态估计的特定任务的训练和评估数据集很难找到。本文提出了一种快速构建精确注释训练数据集的方法,用于对坐着的受试者进行姿态估计,特别是针对航空驾驶舱。我们建议使用Kinect作为收集纯视觉数据集的工具(由于应用程序定义的原因,使用Kinect或类似的基于结构光的方法是不可能的)。由于Kinect对单个关节的标注在某些时刻可能不精确,因此需要对采集到的数据进行人工后处理,我们提出了一种高效可靠的人工后处理方案。我们制作了一个包含6322个带注释的帧的数据集,涉及11个在不同照明条件下、不同服装和不同背景下记录的人类受试者。每一帧都包含一个正面视图中坐着的人,并注释了姿态和光流数据。我们在生成的数据集上使用基于随机森林的身体部位检测器来验证其可用性。这些初步结果表明,检测器可以成功地在开发的数据集上进行训练,并且光流对检测精度有很大的影响。数据集及其创建过程中使用的中间数据是公开可用的。通过这一点,我们打算支持进一步的研究和评估人类姿态估计的具体主题,重点是座舱场景中坐着的受试者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信