OpenThermalPose2:用更多的数据、主题和姿势扩展开源注释热人体姿势数据集

IF 5
Askat Kuzdeuov;Miras Zakaryanov;Alim Tleuliyev;Huseyin Atakan Varol
{"title":"OpenThermalPose2:用更多的数据、主题和姿势扩展开源注释热人体姿势数据集","authors":"Askat Kuzdeuov;Miras Zakaryanov;Alim Tleuliyev;Huseyin Atakan Varol","doi":"10.1109/TBIOM.2025.3575499","DOIUrl":null,"url":null,"abstract":"Human pose estimation has many applications in action recognition, human-robot interaction, motion capture, augmented reality, sports analytics, and healthcare. Numerous datasets and deep learning models have been developed for human pose estimation within the visible domain. However, poor lighting conditions and privacy issues persist. These challenges can be addressed using thermal cameras; however, there is a limited number of annotated thermal human pose datasets for training deep learning models. Previously, we presented the OpenThermalPose dataset with 6,090 thermal images of 31 subjects and 14,315 annotated human instances. In this work, we extend OpenThermalPose with more thermal images, human instances, and poses. The extended dataset, OpenThermalPose2, contains 21,125 elaborately annotated human instances within 11,391 thermal images of 170 subjects. To show the efficacy of OpenThermalPose2, we trained the YOLOv8-pose and YOLO11-pose models on the dataset. The experimental results showed that models trained with OpenThermalPose2 outperformed the previous YOLOv8-pose models trained with OpenThermalPose. Additionally, we optimized the YOLO11-pose models trained on OpenThermalPose2 by converting their checkpoints from PyTorch to TensorRT formats. We deployed the PyTorch and TensorRT models on an NVIDIA Jetson AGX Orin 64GB and measured their inference time and accuracy. The TensorRT models using half-precision floating-point (FP16) achieved the best balance between speed and accuracy, making them suitable for real-time applications. We have made the dataset, source code, and pre-trained models publicly available at <uri>https://github.com/IS2AI/OpenThermalPose</uri> to bolster research in this field.","PeriodicalId":73307,"journal":{"name":"IEEE transactions on biometrics, behavior, and identity science","volume":"7 4","pages":"902-913"},"PeriodicalIF":5.0000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OpenThermalPose2: Extending the Open-Source Annotated Thermal Human Pose Dataset With More Data, Subjects, and Poses\",\"authors\":\"Askat Kuzdeuov;Miras Zakaryanov;Alim Tleuliyev;Huseyin Atakan Varol\",\"doi\":\"10.1109/TBIOM.2025.3575499\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human pose estimation has many applications in action recognition, human-robot interaction, motion capture, augmented reality, sports analytics, and healthcare. Numerous datasets and deep learning models have been developed for human pose estimation within the visible domain. However, poor lighting conditions and privacy issues persist. These challenges can be addressed using thermal cameras; however, there is a limited number of annotated thermal human pose datasets for training deep learning models. Previously, we presented the OpenThermalPose dataset with 6,090 thermal images of 31 subjects and 14,315 annotated human instances. In this work, we extend OpenThermalPose with more thermal images, human instances, and poses. The extended dataset, OpenThermalPose2, contains 21,125 elaborately annotated human instances within 11,391 thermal images of 170 subjects. To show the efficacy of OpenThermalPose2, we trained the YOLOv8-pose and YOLO11-pose models on the dataset. The experimental results showed that models trained with OpenThermalPose2 outperformed the previous YOLOv8-pose models trained with OpenThermalPose. Additionally, we optimized the YOLO11-pose models trained on OpenThermalPose2 by converting their checkpoints from PyTorch to TensorRT formats. We deployed the PyTorch and TensorRT models on an NVIDIA Jetson AGX Orin 64GB and measured their inference time and accuracy. The TensorRT models using half-precision floating-point (FP16) achieved the best balance between speed and accuracy, making them suitable for real-time applications. We have made the dataset, source code, and pre-trained models publicly available at <uri>https://github.com/IS2AI/OpenThermalPose</uri> to bolster research in this field.\",\"PeriodicalId\":73307,\"journal\":{\"name\":\"IEEE transactions on biometrics, behavior, and identity science\",\"volume\":\"7 4\",\"pages\":\"902-913\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on biometrics, behavior, and identity science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11020744/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biometrics, behavior, and identity science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11020744/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

人体姿势估计在动作识别、人机交互、动作捕捉、增强现实、运动分析和医疗保健中有许多应用。许多数据集和深度学习模型已经开发用于在可见域内的人体姿态估计。然而,恶劣的照明条件和隐私问题仍然存在。这些挑战可以通过热像仪来解决;然而,用于训练深度学习模型的带注释的热人体姿势数据集数量有限。之前,我们提出了OpenThermalPose数据集,其中包含31个受试者的6090张热图像和14,315个注释的人类实例。在这项工作中,我们扩展了OpenThermalPose,增加了更多的热图像、人体实例和姿势。扩展数据集OpenThermalPose2包含170个主题的11,391张热图像中的21,125个精心注释的人类实例。为了证明OpenThermalPose2的有效性,我们在数据集上训练了YOLOv8-pose和YOLO11-pose模型。实验结果表明,使用OpenThermalPose2训练的模型优于之前使用OpenThermalPose训练的YOLOv8-pose模型。此外,我们优化了在OpenThermalPose2上训练的YOLO11-pose模型,将它们的检查点从PyTorch转换为TensorRT格式。我们将PyTorch和TensorRT模型部署在NVIDIA Jetson AGX Orin 64GB上,并测量了它们的推理时间和准确性。使用半精度浮点(FP16)的TensorRT模型实现了速度和精度之间的最佳平衡,使其适合实时应用。我们已经在https://github.com/IS2AI/OpenThermalPose上公开了数据集、源代码和预训练模型,以支持该领域的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
OpenThermalPose2: Extending the Open-Source Annotated Thermal Human Pose Dataset With More Data, Subjects, and Poses
Human pose estimation has many applications in action recognition, human-robot interaction, motion capture, augmented reality, sports analytics, and healthcare. Numerous datasets and deep learning models have been developed for human pose estimation within the visible domain. However, poor lighting conditions and privacy issues persist. These challenges can be addressed using thermal cameras; however, there is a limited number of annotated thermal human pose datasets for training deep learning models. Previously, we presented the OpenThermalPose dataset with 6,090 thermal images of 31 subjects and 14,315 annotated human instances. In this work, we extend OpenThermalPose with more thermal images, human instances, and poses. The extended dataset, OpenThermalPose2, contains 21,125 elaborately annotated human instances within 11,391 thermal images of 170 subjects. To show the efficacy of OpenThermalPose2, we trained the YOLOv8-pose and YOLO11-pose models on the dataset. The experimental results showed that models trained with OpenThermalPose2 outperformed the previous YOLOv8-pose models trained with OpenThermalPose. Additionally, we optimized the YOLO11-pose models trained on OpenThermalPose2 by converting their checkpoints from PyTorch to TensorRT formats. We deployed the PyTorch and TensorRT models on an NVIDIA Jetson AGX Orin 64GB and measured their inference time and accuracy. The TensorRT models using half-precision floating-point (FP16) achieved the best balance between speed and accuracy, making them suitable for real-time applications. We have made the dataset, source code, and pre-trained models publicly available at https://github.com/IS2AI/OpenThermalPose to bolster research in this field.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
10.90
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信