Pinxin Lv, Li Ning, Hao Jiang, Yushuang Huang, Jing Liu, Zhao Wang
{"title":"面向机器人任务规划的多层关注神经符号接地","authors":"Pinxin Lv, Li Ning, Hao Jiang, Yushuang Huang, Jing Liu, Zhao Wang","doi":"10.1109/Humanoids53995.2022.10000074","DOIUrl":null,"url":null,"abstract":"The high-level symbolic representation has proven to be an effective expression for planning problems and is widely used in robot task planning. However, the grounding of symbol based on multimodal raw data in complex environment still remains a significant challenge. In this paper, we put forward a Multi-layer Attention Network for Multimodal Symbol Grounding (maMSG Net) where we combine the high level symbolic representation and multimodal perception effectively, improving the capability and accuracy of understanding complex environment and increasing the diversity of the symbol definition. Meanwhile, we introduce both the cross-modality attention and intra-modality attention in our neural network, which is demonstrated to improve the accuracy of symbol grounding. The maMSG Net takes multimodal raw data as input and estimates values of state symbols defined in given planning domain. We designed computer simulated experiments to evaluate the effectiveness of our method and verify its robustness against external interference.","PeriodicalId":180816,"journal":{"name":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Neural Symbol Grounding with Multi-Layer Attention for Robot Task Planning\",\"authors\":\"Pinxin Lv, Li Ning, Hao Jiang, Yushuang Huang, Jing Liu, Zhao Wang\",\"doi\":\"10.1109/Humanoids53995.2022.10000074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The high-level symbolic representation has proven to be an effective expression for planning problems and is widely used in robot task planning. However, the grounding of symbol based on multimodal raw data in complex environment still remains a significant challenge. In this paper, we put forward a Multi-layer Attention Network for Multimodal Symbol Grounding (maMSG Net) where we combine the high level symbolic representation and multimodal perception effectively, improving the capability and accuracy of understanding complex environment and increasing the diversity of the symbol definition. Meanwhile, we introduce both the cross-modality attention and intra-modality attention in our neural network, which is demonstrated to improve the accuracy of symbol grounding. The maMSG Net takes multimodal raw data as input and estimates values of state symbols defined in given planning domain. We designed computer simulated experiments to evaluate the effectiveness of our method and verify its robustness against external interference.\",\"PeriodicalId\":180816,\"journal\":{\"name\":\"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/Humanoids53995.2022.10000074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Humanoids53995.2022.10000074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Neural Symbol Grounding with Multi-Layer Attention for Robot Task Planning
The high-level symbolic representation has proven to be an effective expression for planning problems and is widely used in robot task planning. However, the grounding of symbol based on multimodal raw data in complex environment still remains a significant challenge. In this paper, we put forward a Multi-layer Attention Network for Multimodal Symbol Grounding (maMSG Net) where we combine the high level symbolic representation and multimodal perception effectively, improving the capability and accuracy of understanding complex environment and increasing the diversity of the symbol definition. Meanwhile, we introduce both the cross-modality attention and intra-modality attention in our neural network, which is demonstrated to improve the accuracy of symbol grounding. The maMSG Net takes multimodal raw data as input and estimates values of state symbols defined in given planning domain. We designed computer simulated experiments to evaluate the effectiveness of our method and verify its robustness against external interference.