Synthetic Data Meets Real-World Challenges: Transfer Learning for Driver Pose Estimation Using RAMSIS

IF 2.5 4区计算机科学 Q3 TELECOMMUNICATIONS

Transactions on Emerging Telecommunications Technologies Pub Date : 2025-09-23 DOI:10.1002/ett.70240

Junjie Gou, Xian Wu, Jianwang Shao

{"title":"Synthetic Data Meets Real-World Challenges: Transfer Learning for Driver Pose Estimation Using RAMSIS","authors":"Junjie Gou, Xian Wu, Jianwang Shao","doi":"10.1002/ett.70240","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Driver posture recognition holds significant application value in intelligent driving and human-machine interaction. However, the difficulty in acquiring driving posture data and the high labeling costs result in large errors in posture recognition models, which severely limit the development of related algorithm research. This paper presents a driving posture transfer learning method based on the RAMSIS synthetic dataset, which can effectively reduce algorithm development costs and significantly improve model performance. First, we constructed a synthetic dataset based on RAMSIS, which includes images of 21 typical driving postures from 54 human body models and corresponding 3D keypoint ground truth labels. The data generation method efficiently acquires sample data, saving substantial labeling efforts, while also incorporating constraints from the automotive ergonomics environment to enhance the realism of the synthetic data—a crucial aspect in dataset construction. Next, we applied a pre-trained model to generate keypoint pseudo-labels for real vehicle driving images, which were then combined with the synthetic data to form a mixed dataset. Finally, we employed a fine-tuning strategy to perform transfer learning on the pre-trained model using the mixed dataset, and explored the impact of synthetic data proportion on model performance. Experimental results show that the proposed method achieves an optimal MPJPE of 30.4 mm, significantly outperforming other models. This provides new data support for driving posture estimation, with excellent model performance in posture recognition, thus offering potential for quantitative posture evaluation. It lays the foundation for broader applications in vehicle human-machine interaction and demonstrates promising engineering prospects.</p>\n </div>","PeriodicalId":23282,"journal":{"name":"Transactions on Emerging Telecommunications Technologies","volume":"36 10","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions on Emerging Telecommunications Technologies","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ett.70240","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Driver posture recognition holds significant application value in intelligent driving and human-machine interaction. However, the difficulty in acquiring driving posture data and the high labeling costs result in large errors in posture recognition models, which severely limit the development of related algorithm research. This paper presents a driving posture transfer learning method based on the RAMSIS synthetic dataset, which can effectively reduce algorithm development costs and significantly improve model performance. First, we constructed a synthetic dataset based on RAMSIS, which includes images of 21 typical driving postures from 54 human body models and corresponding 3D keypoint ground truth labels. The data generation method efficiently acquires sample data, saving substantial labeling efforts, while also incorporating constraints from the automotive ergonomics environment to enhance the realism of the synthetic data—a crucial aspect in dataset construction. Next, we applied a pre-trained model to generate keypoint pseudo-labels for real vehicle driving images, which were then combined with the synthetic data to form a mixed dataset. Finally, we employed a fine-tuning strategy to perform transfer learning on the pre-trained model using the mixed dataset, and explored the impact of synthetic data proportion on model performance. Experimental results show that the proposed method achieves an optimal MPJPE of 30.4 mm, significantly outperforming other models. This provides new data support for driving posture estimation, with excellent model performance in posture recognition, thus offering potential for quantitative posture evaluation. It lays the foundation for broader applications in vehicle human-machine interaction and demonstrates promising engineering prospects.

Abstract Image

查看原文本刊更多论文

合成数据满足现实世界的挑战：使用RAMSIS进行驾驶员姿态估计的迁移学习

驾驶员姿态识别在智能驾驶和人机交互中具有重要的应用价值。然而，由于获取驾驶姿态数据困难、标注成本高，导致姿态识别模型误差较大，严重限制了相关算法研究的发展。本文提出了一种基于RAMSIS合成数据集的驾驶姿态迁移学习方法，可以有效降低算法开发成本，显著提高模型性能。首先，我们构建了一个基于RAMSIS的合成数据集，该数据集包括来自54个人体模型的21种典型驾驶姿势图像和相应的三维关键点地面真值标签。数据生成方法有效地获取样本数据，节省了大量的标注工作，同时还结合了汽车工效学环境的约束，以增强合成数据的真实感——这是数据集构建的关键方面。接下来，我们应用预训练模型生成真实车辆驾驶图像的关键点伪标签，然后将其与合成数据结合形成混合数据集。最后，我们采用微调策略对混合数据集的预训练模型进行迁移学习，并探讨合成数据比例对模型性能的影响。实验结果表明，该方法的最佳MPJPE为30.4 mm，明显优于其他模型。这为驾驶姿态估计提供了新的数据支持，在姿态识别方面具有优异的模型性能，从而为姿态定量评估提供了潜力。为汽车人机交互的广泛应用奠定了基础，具有良好的工程应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Transactions on Emerging Telecommunications Technologies TELECOMMUNICATIONS-

CiteScore

8.90

自引率

13.90%

发文量

249

期刊介绍： ransactions on Emerging Telecommunications Technologies (ETT), formerly known as European Transactions on Telecommunications (ETT), has the following aims: - to attract cutting-edge publications from leading researchers and research groups around the world - to become a highly cited source of timely research findings in emerging fields of telecommunications - to limit revision and publication cycles to a few months and thus significantly increase attractiveness to publish - to become the leading journal for publishing the latest developments in telecommunications