{"title":"资源受限头部姿态估计的注意引导软排序损失","authors":"Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao","doi":"10.1109/IC-NIDC54101.2021.9660542","DOIUrl":null,"url":null,"abstract":"This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attention-guided Soft Ranking Loss for Resource-constrained Head Pose Estimation\",\"authors\":\"Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao\",\"doi\":\"10.1109/IC-NIDC54101.2021.9660542\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.\",\"PeriodicalId\":264468,\"journal\":{\"name\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-NIDC54101.2021.9660542\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660542","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Attention-guided Soft Ranking Loss for Resource-constrained Head Pose Estimation
This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.