Learning Saccadic Gaze Control via Motion Prediciton

Fourth Canadian Conference on Computer and Robot Vision (CRV '07) Pub Date : 2007-05-28 DOI:10.1109/CRV.2007.42

Per-Erik Forssén

引用次数: 18

Abstract

This paper describes a system that autonomously learns to perform saccadic gaze control on a stereo pan-tilt unit. Instead of learning a direct map from image positions to a centering action, the system first learns a forward model that predicts how image features move in the visual field as the gaze is shifted. Gaze control can then be performed by searching for the action that best centers a feature in both the left and the right image. By attacking the problem in a different way we are able to collect many training examples in each action, and thus learning converges much faster. The learning is performed using image features obtained from the scale invariant feature transform (SIFT) detected and matched before and after a saccade, and thus requires no special environment during the training stage. We demonstrate that our system stabilises already after 300 saccades, which is more than 100 times fewer than the best current approaches.

查看原文本刊更多论文

通过运动预测学习扫视控制

本文描述了一种在立体平移装置上自主学习进行眼球注视控制的系统。该系统不是学习从图像位置到居中动作的直接映射，而是首先学习一个前向模型，该模型预测图像特征在视线转移时如何在视野中移动。然后，可以通过搜索在左右图像中最能集中一个特征的动作来进行凝视控制。通过以不同的方式解决问题，我们能够在每个动作中收集许多训练样例，因此学习收敛得更快。学习是使用在扫视前后检测和匹配的尺度不变特征变换(SIFT)得到的图像特征进行的，因此在训练阶段不需要特殊的环境。我们证明，我们的系统在300次扫视后就已经稳定了，这比目前最好的方法少了100多倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Fourth Canadian Conference on Computer and Robot Vision (CRV '07)

自引率

0.00%

发文量