Deep learning and machine learning techniques for head pose estimation: a survey

IF 10.7 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2024-09-12 DOI:10.1007/s10462-024-10936-7

Redhwan Algabri, Ahmed Abdu, Sungon Lee

{"title":"Deep learning and machine learning techniques for head pose estimation: a survey","authors":"Redhwan Algabri, Ahmed Abdu, Sungon Lee","doi":"10.1007/s10462-024-10936-7","DOIUrl":null,"url":null,"abstract":"<div><p>Head pose estimation (HPE) has been extensively investigated over the past decade due to its wide range of applications across several domains of artificial intelligence (AI), resulting in progressive improvements in accuracy. The problem becomes more challenging when the application requires full-range angles, particularly in unconstrained environments, making HPE an active research topic. This paper presents a comprehensive survey of recent AI-based HPE tasks in digital images. We also propose a novel taxonomy based on the main steps to implement each method, broadly dividing these steps into eleven categories under four groups. Moreover, we provide the pros and cons of ten categories of the overall system. Finally, this survey sheds some light on the public datasets, available codes, and future research directions, aiding readers and aspiring researchers in identifying robust methods that exhibit a strong baseline within the subcategory for further exploration in this fascinating area. The review compared and analyzed 113 articles published between 2018 and 2024, distributing 70.5% deep learning, 24.1% machine learning, and 5.4% hybrid approaches. Furthermore, it included 101 articles related to datasets, definitions, and other elements for AI-based HPE systems published over the last two decades. To the best of our knowledge, this is the first paper that aims to survey HPE strategies based on artificial intelligence, with detailed explanations of the main steps to implement each method. A regularly updated project page is provided: (github).</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"57 10","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10936-7.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-024-10936-7","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Head pose estimation (HPE) has been extensively investigated over the past decade due to its wide range of applications across several domains of artificial intelligence (AI), resulting in progressive improvements in accuracy. The problem becomes more challenging when the application requires full-range angles, particularly in unconstrained environments, making HPE an active research topic. This paper presents a comprehensive survey of recent AI-based HPE tasks in digital images. We also propose a novel taxonomy based on the main steps to implement each method, broadly dividing these steps into eleven categories under four groups. Moreover, we provide the pros and cons of ten categories of the overall system. Finally, this survey sheds some light on the public datasets, available codes, and future research directions, aiding readers and aspiring researchers in identifying robust methods that exhibit a strong baseline within the subcategory for further exploration in this fascinating area. The review compared and analyzed 113 articles published between 2018 and 2024, distributing 70.5% deep learning, 24.1% machine learning, and 5.4% hybrid approaches. Furthermore, it included 101 articles related to datasets, definitions, and other elements for AI-based HPE systems published over the last two decades. To the best of our knowledge, this is the first paper that aims to survey HPE strategies based on artificial intelligence, with detailed explanations of the main steps to implement each method. A regularly updated project page is provided: (github).

Abstract Image

查看原文本刊更多论文

用于头部姿态估计的深度学习和机器学习技术：一项调查

由于头部姿态估计（HPE）在人工智能（AI）多个领域的广泛应用，其准确性在过去十年间得到了广泛的研究。当应用需要全方位角度时，问题就变得更具挑战性，尤其是在无约束环境中，这使得 HPE 成为一个活跃的研究课题。本文全面介绍了近期基于人工智能的数字图像 HPE 任务。我们还根据实现每种方法的主要步骤提出了一种新的分类法，将这些步骤大致分为四组十一个类别。此外，我们还对整个系统的十个类别进行了利弊分析。最后，本调查报告对公共数据集、可用代码和未来研究方向作了一些说明，帮助读者和有志于此的研究人员确定在子类中表现出强大基线的稳健方法，以便在这一引人入胜的领域作进一步探索。该综述对比分析了 2018 年至 2024 年间发表的 113 篇文章，其中深度学习占 70.5%，机器学习占 24.1%，混合方法占 5.4%。此外，它还收录了过去二十年间发表的与基于人工智能的 HPE 系统的数据集、定义和其他要素相关的 101 篇文章。据我们所知，这是第一篇旨在调查基于人工智能的 HPE 策略的论文，其中详细解释了实施每种方法的主要步骤。我们提供了一个定期更新的项目页面：（github）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.