资源受限头部姿态估计的注意引导软排序损失

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) Pub Date : 2021-11-17 DOI:10.1109/IC-NIDC54101.2021.9660542

Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao

{"title":"资源受限头部姿态估计的注意引导软排序损失","authors":"Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao","doi":"10.1109/IC-NIDC54101.2021.9660542","DOIUrl":null,"url":null,"abstract":"This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attention-guided Soft Ranking Loss for Resource-constrained Head Pose Estimation\",\"authors\":\"Wenqi Xu, Tangzheng Lian, Wei Liu, Kaili Zhao\",\"doi\":\"10.1109/IC-NIDC54101.2021.9660542\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.\",\"PeriodicalId\":264468,\"journal\":{\"name\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-NIDC54101.2021.9660542\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660542","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种基于单幅图像的头部姿态估计模型，该模型具有紧凑的模型尺寸。以前最先进的方法通常依赖于大型训练模型，并且在标准gpu上收敛缓慢。在本文中，我们引入了注意引导的软排名损失，减少了最先进的方法的大小，同时提高了其性能。具体来说，我们设计了一个注意力模块来鼓励对显著特征的学习。此外，我们提出了一种成对的软排序损失，它用成对的样本来监督模型，并惩罚错误的头姿预测顺序。考虑到缺乏大姿态数据，我们还引入了一种少数头姿态过采样算法来平衡偏航、俯仰角和滚转角的分布。在BIWI和AFLW2000数据集上的实验表明，我们的方法明显优于最先进的方法。广泛的消融研究进一步验证了我们的框架设计的有效性和鲁棒性。代码将提供。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Attention-guided Soft Ranking Loss for Resource-constrained Head Pose Estimation

This paper presents a novel model for head-pose estimation from a single image with a compact model size. Previous state-of-the-art methods often rely on large training models and converge slowly on standard GPUs. In this paper, we introduce attention-guided soft ranking loss that reduces the size of the state-of-the-art method while increasing its performance. Specifically, we design an attention module to encourage learning on salient features. In addition, we propose a pair-wise soft ranking loss that supervises the model with paired samples and penalizes incorrect ordering of head-pose prediction. Considering the lack of large-pose data, we also introduce a minority head-pose oversampling algorithm to balance the distribution of yaw, pitch, and roll angles. Experiments on BIWI and AFLW2000 datasets demonstrate that our approach significantly outperforms the state-of-the-art methods. Extensive ablation studies further validate the effectiveness and robustness of the design of our framework. Code will be made availablel.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)

自引率

0.00%

发文量