{"title":"Multistage attention network for human pose estimation","authors":"Jingyang Zhou, Guangzhao Wen, Yu Zhang, Xin Geng","doi":"10.1117/1.JEI.31.6.063001","DOIUrl":null,"url":null,"abstract":"Abstract. Human pose estimation is a fundamental yet challenging task in computer vision. Although many methods have achieved significant improvement, they are still insufficient for the fusion of feature maps at different stages, such as the stacked hourglass network (SHNet). The SHNet is a classic human pose estimation network that extracts multiscale features through stacked multistage downsampling and upsampling operations. We propose a multistage attention mechanism to fuse the multistage feature maps. Furthermore, we apply it in the SHNet to propose a multistage attention network (MANet). In the experiments, we demonstrated the effectiveness of MANet in human pose estimation on the common objects in context dataset and the MPII human pose dataset.","PeriodicalId":54843,"journal":{"name":"Journal of Electronic Imaging","volume":"15 1","pages":"063001 - 063001"},"PeriodicalIF":1.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Electronic Imaging","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1117/1.JEI.31.6.063001","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract. Human pose estimation is a fundamental yet challenging task in computer vision. Although many methods have achieved significant improvement, they are still insufficient for the fusion of feature maps at different stages, such as the stacked hourglass network (SHNet). The SHNet is a classic human pose estimation network that extracts multiscale features through stacked multistage downsampling and upsampling operations. We propose a multistage attention mechanism to fuse the multistage feature maps. Furthermore, we apply it in the SHNet to propose a multistage attention network (MANet). In the experiments, we demonstrated the effectiveness of MANet in human pose estimation on the common objects in context dataset and the MPII human pose dataset.
期刊介绍:
The Journal of Electronic Imaging publishes peer-reviewed papers in all technology areas that make up the field of electronic imaging and are normally considered in the design, engineering, and applications of electronic imaging systems.