{"title":"Synthetic Data Meets Real-World Challenges: Transfer Learning for Driver Pose Estimation Using RAMSIS","authors":"Junjie Gou, Xian Wu, Jianwang Shao","doi":"10.1002/ett.70240","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Driver posture recognition holds significant application value in intelligent driving and human-machine interaction. However, the difficulty in acquiring driving posture data and the high labeling costs result in large errors in posture recognition models, which severely limit the development of related algorithm research. This paper presents a driving posture transfer learning method based on the RAMSIS synthetic dataset, which can effectively reduce algorithm development costs and significantly improve model performance. First, we constructed a synthetic dataset based on RAMSIS, which includes images of 21 typical driving postures from 54 human body models and corresponding 3D keypoint ground truth labels. The data generation method efficiently acquires sample data, saving substantial labeling efforts, while also incorporating constraints from the automotive ergonomics environment to enhance the realism of the synthetic data—a crucial aspect in dataset construction. Next, we applied a pre-trained model to generate keypoint pseudo-labels for real vehicle driving images, which were then combined with the synthetic data to form a mixed dataset. Finally, we employed a fine-tuning strategy to perform transfer learning on the pre-trained model using the mixed dataset, and explored the impact of synthetic data proportion on model performance. Experimental results show that the proposed method achieves an optimal MPJPE of 30.4 mm, significantly outperforming other models. This provides new data support for driving posture estimation, with excellent model performance in posture recognition, thus offering potential for quantitative posture evaluation. It lays the foundation for broader applications in vehicle human-machine interaction and demonstrates promising engineering prospects.</p>\n </div>","PeriodicalId":23282,"journal":{"name":"Transactions on Emerging Telecommunications Technologies","volume":"36 10","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions on Emerging Telecommunications Technologies","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ett.70240","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Driver posture recognition holds significant application value in intelligent driving and human-machine interaction. However, the difficulty in acquiring driving posture data and the high labeling costs result in large errors in posture recognition models, which severely limit the development of related algorithm research. This paper presents a driving posture transfer learning method based on the RAMSIS synthetic dataset, which can effectively reduce algorithm development costs and significantly improve model performance. First, we constructed a synthetic dataset based on RAMSIS, which includes images of 21 typical driving postures from 54 human body models and corresponding 3D keypoint ground truth labels. The data generation method efficiently acquires sample data, saving substantial labeling efforts, while also incorporating constraints from the automotive ergonomics environment to enhance the realism of the synthetic data—a crucial aspect in dataset construction. Next, we applied a pre-trained model to generate keypoint pseudo-labels for real vehicle driving images, which were then combined with the synthetic data to form a mixed dataset. Finally, we employed a fine-tuning strategy to perform transfer learning on the pre-trained model using the mixed dataset, and explored the impact of synthetic data proportion on model performance. Experimental results show that the proposed method achieves an optimal MPJPE of 30.4 mm, significantly outperforming other models. This provides new data support for driving posture estimation, with excellent model performance in posture recognition, thus offering potential for quantitative posture evaluation. It lays the foundation for broader applications in vehicle human-machine interaction and demonstrates promising engineering prospects.
期刊介绍:
ransactions on Emerging Telecommunications Technologies (ETT), formerly known as European Transactions on Telecommunications (ETT), has the following aims:
- to attract cutting-edge publications from leading researchers and research groups around the world
- to become a highly cited source of timely research findings in emerging fields of telecommunications
- to limit revision and publication cycles to a few months and thus significantly increase attractiveness to publish
- to become the leading journal for publishing the latest developments in telecommunications