Attention-guided Soft Ranking Loss for Resource-constrained Head Pose Estimation

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) Pub Date : 2021-11-17 DOI:10.1109/IC-NIDC54101.2021.9660542

Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao

{"title":"Attention-guided Soft Ranking Loss for Resource-constrained Head Pose Estimation","authors":"Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao","doi":"10.1109/IC-NIDC54101.2021.9660542","DOIUrl":null,"url":null,"abstract":"This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660542","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.

查看原文本刊更多论文

资源受限头部姿态估计的注意引导软排序损失

本文提出了一种基于单幅图像的头部姿态估计模型，该模型具有紧凑的模型尺寸。以前最先进的方法通常依赖于大型训练模型，并且在标准gpu上收敛缓慢。在本文中，我们引入了注意引导的软排名损失，减少了最先进的方法的大小，同时提高了其性能。具体来说，我们设计了一个注意力模块来鼓励对显著特征的学习。此外，我们提出了一种成对的软排序损失，它用成对的样本来监督模型，并惩罚错误的头姿预测顺序。考虑到缺乏大姿态数据，我们还引入了一种少数头姿态过采样算法来平衡偏航、俯仰角和滚转角的分布。在BIWI和AFLW2000数据集上的实验表明，我们的方法明显优于最先进的方法。广泛的消融研究进一步验证了我们的框架设计的有效性和鲁棒性。代码将提供。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)

自引率

0.00%

发文量