{"title":"以对象为中心的动作策略的机器人多模态嵌入属性学习","authors":"Xiaohan Zhang, Saeid Amiri, Jivko Sinapov, Jesse Thomason, Peter Stone, Shiqi Zhang","doi":"10.1007/s10514-023-10098-5","DOIUrl":null,"url":null,"abstract":"<div><p>Robots frequently need to perceive object attributes, such as <span>red</span>, <span>heavy</span>, and <span>empty</span>, using multimodal exploratory behaviors, such as <i>look</i>, <i>lift</i>, and <i>shake</i>. One possible way for robots to do so is to learn a classifier for each perceivable attribute given an exploratory behavior. Once the attribute classifiers are learned, they can be used by robots to select actions and identify attributes of new objects, answering questions, such as “<i>Is this object</i> <span>red</span> <i> and</i> <span>empty</span> ?” In this article, we introduce a robot interactive perception problem, called <b>M</b>ultimodal <b>E</b>mbodied <b>A</b>ttribute <b>L</b>earning (<span>meal</span>), and explore solutions to this new problem. Under different assumptions, there are two classes of <span>meal</span> problems. <span>offline-meal</span> problems are defined in this article as learning attribute classifiers from pre-collected data, and sequencing actions towards attribute identification under the challenging trade-off between information gains and exploration action costs. For this purpose, we introduce <b>M</b>ixed <b>O</b>bservability <b>R</b>obot <b>C</b>ontrol (<span>morc</span>), an algorithm for <span>offline-meal</span> problems, that dynamically constructs both fully and partially observable components of the state for multimodal attribute identification of objects. We further investigate a more challenging class of <span>meal</span> problems, called <span>online-meal</span>, where the robot assumes no pre-collected data, and works on both attribute classification and attribute identification at the same time. Based on <span>morc</span>, we develop an algorithm called <b>I</b>nformation-<b>T</b>heoretic <b>R</b>eward <b>S</b>haping (<span>morc</span>-<span>itrs</span>) that actively addresses the trade-off between exploration and exploitation in <span>online-meal</span> problems. <span>morc</span> and <span>morc</span>-<span>itrs</span> are evaluated in comparison with competitive <span>meal</span> baselines, and results demonstrate the superiority of our methods in learning efficiency and identification accuracy.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 5","pages":"505 - 528"},"PeriodicalIF":3.7000,"publicationDate":"2023-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Multimodal embodied attribute learning by robots for object-centric action policies\",\"authors\":\"Xiaohan Zhang, Saeid Amiri, Jivko Sinapov, Jesse Thomason, Peter Stone, Shiqi Zhang\",\"doi\":\"10.1007/s10514-023-10098-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Robots frequently need to perceive object attributes, such as <span>red</span>, <span>heavy</span>, and <span>empty</span>, using multimodal exploratory behaviors, such as <i>look</i>, <i>lift</i>, and <i>shake</i>. One possible way for robots to do so is to learn a classifier for each perceivable attribute given an exploratory behavior. Once the attribute classifiers are learned, they can be used by robots to select actions and identify attributes of new objects, answering questions, such as “<i>Is this object</i> <span>red</span> <i> and</i> <span>empty</span> ?” In this article, we introduce a robot interactive perception problem, called <b>M</b>ultimodal <b>E</b>mbodied <b>A</b>ttribute <b>L</b>earning (<span>meal</span>), and explore solutions to this new problem. Under different assumptions, there are two classes of <span>meal</span> problems. <span>offline-meal</span> problems are defined in this article as learning attribute classifiers from pre-collected data, and sequencing actions towards attribute identification under the challenging trade-off between information gains and exploration action costs. For this purpose, we introduce <b>M</b>ixed <b>O</b>bservability <b>R</b>obot <b>C</b>ontrol (<span>morc</span>), an algorithm for <span>offline-meal</span> problems, that dynamically constructs both fully and partially observable components of the state for multimodal attribute identification of objects. We further investigate a more challenging class of <span>meal</span> problems, called <span>online-meal</span>, where the robot assumes no pre-collected data, and works on both attribute classification and attribute identification at the same time. Based on <span>morc</span>, we develop an algorithm called <b>I</b>nformation-<b>T</b>heoretic <b>R</b>eward <b>S</b>haping (<span>morc</span>-<span>itrs</span>) that actively addresses the trade-off between exploration and exploitation in <span>online-meal</span> problems. <span>morc</span> and <span>morc</span>-<span>itrs</span> are evaluated in comparison with competitive <span>meal</span> baselines, and results demonstrate the superiority of our methods in learning efficiency and identification accuracy.</p></div>\",\"PeriodicalId\":55409,\"journal\":{\"name\":\"Autonomous Robots\",\"volume\":\"47 5\",\"pages\":\"505 - 528\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2023-03-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Autonomous Robots\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10514-023-10098-5\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Autonomous Robots","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10514-023-10098-5","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Multimodal embodied attribute learning by robots for object-centric action policies
Robots frequently need to perceive object attributes, such as red, heavy, and empty, using multimodal exploratory behaviors, such as look, lift, and shake. One possible way for robots to do so is to learn a classifier for each perceivable attribute given an exploratory behavior. Once the attribute classifiers are learned, they can be used by robots to select actions and identify attributes of new objects, answering questions, such as “Is this objectred andempty ?” In this article, we introduce a robot interactive perception problem, called Multimodal Embodied Attribute Learning (meal), and explore solutions to this new problem. Under different assumptions, there are two classes of meal problems. offline-meal problems are defined in this article as learning attribute classifiers from pre-collected data, and sequencing actions towards attribute identification under the challenging trade-off between information gains and exploration action costs. For this purpose, we introduce Mixed Observability Robot Control (morc), an algorithm for offline-meal problems, that dynamically constructs both fully and partially observable components of the state for multimodal attribute identification of objects. We further investigate a more challenging class of meal problems, called online-meal, where the robot assumes no pre-collected data, and works on both attribute classification and attribute identification at the same time. Based on morc, we develop an algorithm called Information-Theoretic Reward Shaping (morc-itrs) that actively addresses the trade-off between exploration and exploitation in online-meal problems. morc and morc-itrs are evaluated in comparison with competitive meal baselines, and results demonstrate the superiority of our methods in learning efficiency and identification accuracy.
期刊介绍:
Autonomous Robots reports on the theory and applications of robotic systems capable of some degree of self-sufficiency. It features papers that include performance data on actual robots in the real world. Coverage includes: control of autonomous robots · real-time vision · autonomous wheeled and tracked vehicles · legged vehicles · computational architectures for autonomous systems · distributed architectures for learning, control and adaptation · studies of autonomous robot systems · sensor fusion · theory of autonomous systems · terrain mapping and recognition · self-calibration and self-repair for robots · self-reproducing intelligent structures · genetic algorithms as models for robot development.
The focus is on the ability to move and be self-sufficient, not on whether the system is an imitation of biology. Of course, biological models for robotic systems are of major interest to the journal since living systems are prototypes for autonomous behavior.