Ecological Active Vision: Four Bioinspired Principles to Integrate Bottom–Up and Adaptive Top–Down Attention Tested With a Simple Camera-Arm Robot

IEEE Transactions on Autonomous Mental Development Pub Date : 2015-03-01 DOI:10.1109/TAMD.2014.2341351

D. Ognibene, G. Baldassarre

{"title":"Ecological Active Vision: Four Bioinspired Principles to Integrate Bottom–Up and Adaptive Top–Down Attention Tested With a Simple Camera-Arm Robot","authors":"D. Ognibene, G. Baldassarre","doi":"10.1109/TAMD.2014.2341351","DOIUrl":null,"url":null,"abstract":"Vision gives primates a wealth of information useful to manipulate the environment, but at the same time it can easily overwhelm their computational resources. Active vision is a key solution found by nature to solve this problem: a limited fovea actively displaced in space to collect only relevant information. Here we highlight that in ecological conditions this solution encounters four problems: 1) the agent needs to learn where to look based on its goals; 2) manipulation causes learning feedback in areas of space possibly outside the attention focus; 3) good visual actions are needed to guide manipulation actions, but only these can generate learning feedback; and 4) a limited fovea causes aliasing problems. We then propose a computational architecture (“BITPIC”) to overcome the four problems, integrating four bioinspired key ingredients: 1) reinforcement-learning fovea-based top-down attention; 2) a strong vision-manipulation coupling; 3) bottom-up periphery-based attention; and 4) a novel action-oriented memory. The system is tested with a simple simulated camera-arm robot solving a class of search-and-reach tasks involving color-blob “objects.” The results show that the architecture solves the problems, and hence the tasks, very efficiently, and highlight how the architecture principles can contribute to a full exploitation of the advantages of active vision in ecological conditions.","PeriodicalId":49193,"journal":{"name":"IEEE Transactions on Autonomous Mental Development","volume":"7 1","pages":"3-25"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TAMD.2014.2341351","citationCount":"67","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Autonomous Mental Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAMD.2014.2341351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 67

Abstract

Vision gives primates a wealth of information useful to manipulate the environment, but at the same time it can easily overwhelm their computational resources. Active vision is a key solution found by nature to solve this problem: a limited fovea actively displaced in space to collect only relevant information. Here we highlight that in ecological conditions this solution encounters four problems: 1) the agent needs to learn where to look based on its goals; 2) manipulation causes learning feedback in areas of space possibly outside the attention focus; 3) good visual actions are needed to guide manipulation actions, but only these can generate learning feedback; and 4) a limited fovea causes aliasing problems. We then propose a computational architecture (“BITPIC”) to overcome the four problems, integrating four bioinspired key ingredients: 1) reinforcement-learning fovea-based top-down attention; 2) a strong vision-manipulation coupling; 3) bottom-up periphery-based attention; and 4) a novel action-oriented memory. The system is tested with a simple simulated camera-arm robot solving a class of search-and-reach tasks involving color-blob “objects.” The results show that the architecture solves the problems, and hence the tasks, very efficiently, and highlight how the architecture principles can contribute to a full exploitation of the advantages of active vision in ecological conditions.

查看原文本刊更多论文

生态主动视觉:整合自底向上和自适应自顶向下注意力的四种生物启发原则，用一个简单的相机臂机器人测试

视觉为灵长类动物提供了大量有用的信息来操纵环境，但同时也很容易压倒它们的计算资源。主动视觉是自然界找到的解决这个问题的关键方案:一个有限的中央凹在空间上主动移位，只收集相关的信息。这里我们强调，在生态条件下，该解决方案遇到四个问题:1)智能体需要根据其目标学习到哪里;2)操作在可能在注意焦点之外的空间区域引起学习反馈;3)需要良好的视觉动作来指导操作动作，但只有这样才能产生学习反馈;有限的中央凹导致混叠问题。然后，我们提出了一个计算架构(“BITPIC”)来克服这四个问题，整合了四个生物启发的关键成分:1)基于强化学习的自上而下的中央凹注意力;2)强烈的视觉操纵耦合;3)自下而上的外围注意;4)一种新颖的动作导向记忆。该系统通过一个简单的模拟摄像臂机器人进行测试，该机器人解决了一类涉及彩色斑点“物体”的搜索和到达任务。结果表明，该架构非常有效地解决了问题，从而解决了任务，并突出了架构原则如何有助于充分利用生态条件下主动视觉的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Autonomous Mental Development COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-ROBOTICS

自引率

0.00%

发文量

审稿时长

3 months