{"title":"Predicting occluded skeletal joints via tracking-based feature extraction","authors":"Khac-Anh Phu , Van-Dung Hoang , Van-Tuong-Lan Le","doi":"10.1016/j.neucom.2025.131004","DOIUrl":null,"url":null,"abstract":"<div><div>A rapidly growing research area in computer vision is the recognition of human poses, which imposes strong occlusion problems in subsequent tracking. Therefore, tracking mechanisms should contain algorithms for handling occlusions such that the skeletal data is continuous between frames. This paper introduces a new skeletal tracking approach where a deep-learning-based model for a Skeleton Feature Extractor is embedded into the tracking algorithm. This differs from the common tracking methods that, in the process of image feature extraction, consider feature extraction at joints and the spatial relation between them, thus making occlusion scenarios highly detectable. We further make a comparison with our model and MotioNet, which is a broadly applied model for 3D motion reconstruction. MotioNet can interpolate the missing joints based on the information both spatial and temporal. It, however, does not work when actual joints are occluded for some frames. Our model predicts the skeletal joints that are missing. Experiments on the JHMDB and Penn_Action dataset were meant to show that the method improves the accuracy of forecasting occluded skeletal joint positions by the same PCK metric.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"651 ","pages":"Article 131004"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225016765","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
A rapidly growing research area in computer vision is the recognition of human poses, which imposes strong occlusion problems in subsequent tracking. Therefore, tracking mechanisms should contain algorithms for handling occlusions such that the skeletal data is continuous between frames. This paper introduces a new skeletal tracking approach where a deep-learning-based model for a Skeleton Feature Extractor is embedded into the tracking algorithm. This differs from the common tracking methods that, in the process of image feature extraction, consider feature extraction at joints and the spatial relation between them, thus making occlusion scenarios highly detectable. We further make a comparison with our model and MotioNet, which is a broadly applied model for 3D motion reconstruction. MotioNet can interpolate the missing joints based on the information both spatial and temporal. It, however, does not work when actual joints are occluded for some frames. Our model predicts the skeletal joints that are missing. Experiments on the JHMDB and Penn_Action dataset were meant to show that the method improves the accuracy of forecasting occluded skeletal joint positions by the same PCK metric.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.