{"title":"A Multi-view 3D Human Pose Estimation Algorithm Based On Positional Attention","authors":"Dandan Sun, ChangAn Zhang","doi":"10.1109/ICSP54964.2022.9778615","DOIUrl":null,"url":null,"abstract":"With the development of CNNs, the human pose estimation research has made great progress, but there is still a problem: the relationships of the human each joint location are not well exploited in previous CNNs-based methods. Considering the order of global spatial information and human body location information, we propose a multi-view 3D human pose estimation algorithm based on position attention. In 2D detection stage, position coding is adopted to rebuild the image in the global space position relation. The attention mechanism can model the relationship between various channels and capture feature maps the dependencies between the horizontal and vertical direction, and the details are mined from the feature location relationship to generate high-quality feature maps. In the last stage of feature extraction, adjacent view features are used to enhance the spatial expression ability of feature images, so as to better solve occlusion and oblique view. Experiments on the Human3.6M data set show that when using Resnet-50 as the backbone network and 256×256 of the image size, the average joint error of our algorithm is reduced to 25.2mm, which reaching the competitive result.","PeriodicalId":363766,"journal":{"name":"2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP)","volume":"231 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSP54964.2022.9778615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the development of CNNs, the human pose estimation research has made great progress, but there is still a problem: the relationships of the human each joint location are not well exploited in previous CNNs-based methods. Considering the order of global spatial information and human body location information, we propose a multi-view 3D human pose estimation algorithm based on position attention. In 2D detection stage, position coding is adopted to rebuild the image in the global space position relation. The attention mechanism can model the relationship between various channels and capture feature maps the dependencies between the horizontal and vertical direction, and the details are mined from the feature location relationship to generate high-quality feature maps. In the last stage of feature extraction, adjacent view features are used to enhance the spatial expression ability of feature images, so as to better solve occlusion and oblique view. Experiments on the Human3.6M data set show that when using Resnet-50 as the backbone network and 256×256 of the image size, the average joint error of our algorithm is reduced to 25.2mm, which reaching the competitive result.