面向机器人任务规划的多层关注神经符号接地

2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids) Pub Date : 2022-11-28 DOI:10.1109/Humanoids53995.2022.10000074

Pinxin Lv, Li Ning, Hao Jiang, Yushuang Huang, Jing Liu, Zhao Wang

{"title":"面向机器人任务规划的多层关注神经符号接地","authors":"Pinxin Lv, Li Ning, Hao Jiang, Yushuang Huang, Jing Liu, Zhao Wang","doi":"10.1109/Humanoids53995.2022.10000074","DOIUrl":null,"url":null,"abstract":"The high-level symbolic representation has proven to be an effective expression for planning problems and is widely used in robot task planning. However, the grounding of symbol based on multimodal raw data in complex environment still remains a significant challenge. In this paper, we put forward a Multi-layer Attention Network for Multimodal Symbol Grounding (maMSG Net) where we combine the high level symbolic representation and multimodal perception effectively, improving the capability and accuracy of understanding complex environment and increasing the diversity of the symbol definition. Meanwhile, we introduce both the cross-modality attention and intra-modality attention in our neural network, which is demonstrated to improve the accuracy of symbol grounding. The maMSG Net takes multimodal raw data as input and estimates values of state symbols defined in given planning domain. We designed computer simulated experiments to evaluate the effectiveness of our method and verify its robustness against external interference.","PeriodicalId":180816,"journal":{"name":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Neural Symbol Grounding with Multi-Layer Attention for Robot Task Planning\",\"authors\":\"Pinxin Lv, Li Ning, Hao Jiang, Yushuang Huang, Jing Liu, Zhao Wang\",\"doi\":\"10.1109/Humanoids53995.2022.10000074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The high-level symbolic representation has proven to be an effective expression for planning problems and is widely used in robot task planning. However, the grounding of symbol based on multimodal raw data in complex environment still remains a significant challenge. In this paper, we put forward a Multi-layer Attention Network for Multimodal Symbol Grounding (maMSG Net) where we combine the high level symbolic representation and multimodal perception effectively, improving the capability and accuracy of understanding complex environment and increasing the diversity of the symbol definition. Meanwhile, we introduce both the cross-modality attention and intra-modality attention in our neural network, which is demonstrated to improve the accuracy of symbol grounding. The maMSG Net takes multimodal raw data as input and estimates values of state symbols defined in given planning domain. We designed computer simulated experiments to evaluate the effectiveness of our method and verify its robustness against external interference.\",\"PeriodicalId\":180816,\"journal\":{\"name\":\"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/Humanoids53995.2022.10000074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Humanoids53995.2022.10000074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

高级符号表示已被证明是一种有效的规划问题表达方式，在机器人任务规划中得到了广泛的应用。然而，复杂环境下基于多模态原始数据的符号接地仍然是一个重大挑战。本文提出了一种多模态符号接地的多层注意网络(maMSG Net)，将高层符号表示和多模态感知有效地结合起来，提高了理解复杂环境的能力和准确性，增加了符号定义的多样性。同时，我们在神经网络中引入了跨模态注意和模态内注意，证明了该方法可以提高符号定位的准确性。该网络以多模态原始数据作为输入，对给定规划域中定义的状态符号进行估计。我们设计了计算机模拟实验来评估我们的方法的有效性，并验证了它对外部干扰的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Neural Symbol Grounding with Multi-Layer Attention for Robot Task Planning

The high-level symbolic representation has proven to be an effective expression for planning problems and is widely used in robot task planning. However, the grounding of symbol based on multimodal raw data in complex environment still remains a significant challenge. In this paper, we put forward a Multi-layer Attention Network for Multimodal Symbol Grounding (maMSG Net) where we combine the high level symbolic representation and multimodal perception effectively, improving the capability and accuracy of understanding complex environment and increasing the diversity of the symbol definition. Meanwhile, we introduce both the cross-modality attention and intra-modality attention in our neural network, which is demonstrated to improve the accuracy of symbol grounding. The maMSG Net takes multimodal raw data as input and estimates values of state symbols defined in given planning domain. We designed computer simulated experiments to evaluate the effectiveness of our method and verify its robustness against external interference.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)

自引率

0.00%

发文量