MART: Motion-Aware Recurrent Neural Network for Robust Visual Tracking

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) Pub Date : 2021-01-01 DOI:10.1109/WACV48630.2021.00061

Heng Fan, Haibin Ling

引用次数: 0

Abstract

We introduce MART, Motion-Aware Recurrent neural network (MA-RNN) for Tracking, by modeling robust long-term spatial-temporal representation. In particular, we propose a simple, yet effective context-aware displacement attention (CADA) module to capture target motion in videos. By seamlessly integrating CADA into RNN, the proposed MA-RNN can spatially align and aggregate temporal information guided by motion from frame to frame, leading to more effective representation that benefits a tracker from motion when handling occlusion, deformation, viewpoint change etc. Moreover, to deal with scale change, we present a monotonic bounding box regression (mBBR) approach that iteratively predicts regression offsets for target object under the guidance of intersection-over-union (IoU) score, guaranteeing non-decreasing accuracy. In extensive experiments on five benchmarks, including GOT-10k, LaSOT, TC-128, OTB-15 and VOT-19, our tracker MART consistently achieves state-of-the-art results and runs in real-time.

查看原文本刊更多论文

用于鲁棒视觉跟踪的运动感知递归神经网络

我们通过建模鲁棒的长期时空表征，引入运动感知递归神经网络(MA-RNN)用于跟踪。特别是，我们提出了一个简单而有效的上下文感知位移注意(CADA)模块来捕捉视频中的目标运动。通过将CADA无缝集成到RNN中，所提出的MA-RNN可以在帧与帧之间对运动引导的时间信息进行空间对齐和聚合，从而在处理遮挡、变形、视点变化等时更有效地表示运动，从而使跟踪器受益。针对尺度变化，提出了一种单调边界盒回归(mBBR)方法，该方法在IoU分数的指导下迭代预测目标物体的回归偏移量，保证了精度的不下降。在包括GOT-10k, LaSOT, TC-128, OTB-15和vote -19在内的五个基准的广泛实验中，我们的跟踪器MART始终实现最先进的结果并实时运行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

自引率

0.00%

发文量