Multi-View Human Tracking and 3D Localization in Retail

Artificial Intelligence and Machine Learning Pub Date : 2022-07-23 DOI:10.5121/csit.2022.121214

Akash Jadhav

引用次数: 0

Abstract

In recent years, retail stores have seen traction in bringing online shopping experience to offline stores via autonomous checkouts. Autonomous checkouts is a computer vision-based technology that needs to understand three human elements within the store: who, where, and doing what. This paper addresses two of the three elements: who and where. It presents an approach to track and localize humans in a multi-view camera system. Traditional methods have limitations as they: (1) fail to overcome substantial occlusion of humans; (2) suffer a lengthy processing time; (3) require a planar homography constraint between camera frames; (4) suffer swapping of labels assigned to a human. The proposed method in this paper handles all the aforementioned limitations. The key idea is to use a hierarchical association model for tracking, which uses each human's clothing features, human pose orientation, and relative depth of joints, and runs at over 23fps.

查看原文本刊更多论文

零售业中的多视角人体跟踪和3D定位

近年来，零售商店已经看到了通过自动结账将在线购物体验带到线下商店的吸引力。自动结帐是一种基于计算机视觉的技术，需要了解商店中的三个人为因素:谁、在哪里、做什么。本文讨论了三个要素中的两个:谁和在哪里。提出了一种在多视点摄像机系统中对人进行跟踪和定位的方法。传统方法的局限性在于:(1)无法克服人类的大量遮挡;(2)处理时间长;(3)要求相机帧间的平面单应性约束;(4)忍受交换分配给人类的标签。本文提出的方法克服了上述所有限制。关键思想是使用分层关联模型进行跟踪，该模型利用每个人的服装特征、人体姿势方向和关节的相对深度，并以超过23fps的速度运行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence and Machine Learning

自引率

0.00%

发文量