Ryo Yoshino, Toshiaki Takano, Hiroki Tanaka, T. Taniguchi
{"title":"基于多模态层次Dirichlet过程的无监督对象分类主动探索","authors":"Ryo Yoshino, Toshiaki Takano, Hiroki Tanaka, T. Taniguchi","doi":"10.1109/IEEECONF49454.2021.9382781","DOIUrl":null,"url":null,"abstract":"This paper describes an effective active exploration method for multimodal object categorization using a multimodal hierarchical Dirichlet process (MHDP). MHDP is a type of multimodal latent variable models, e.g., multimodal latent Dirichlet allocation and multimodal variational autoencoder, that enables a robot to perform unsupervised multimodal object categorization on the basis of different types of sensor information. The goal of the active exploration is to reduce the number of actions executed to collect multimodal sensor information from a variety of objects to acquire knowledge on object categories. The active exploration method employing the information gain (IG) criterion for MHDP is described by extending the IG-based active perception method. Exploiting the submodular property of IG in MHDP, greedy and lazy greedy algorithms with a certain theoretical guarantee of performance are proposed. The effectiveness of the proposed method is evaluated in a robot experiment. Results show that the proposed active exploration method with the greedy algorithm works well, and it significantly reduces the step for exploration. Further, the performance of the lazy greedy algorithm is found to deteriorate at times, due to the estimation error in the IG, differently from that of active perception.","PeriodicalId":395378,"journal":{"name":"2021 IEEE/SICE International Symposium on System Integration (SII)","volume":"293 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Active Exploration for Unsupervised Object Categorization Based on Multimodal Hierarchical Dirichlet Process\",\"authors\":\"Ryo Yoshino, Toshiaki Takano, Hiroki Tanaka, T. Taniguchi\",\"doi\":\"10.1109/IEEECONF49454.2021.9382781\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes an effective active exploration method for multimodal object categorization using a multimodal hierarchical Dirichlet process (MHDP). MHDP is a type of multimodal latent variable models, e.g., multimodal latent Dirichlet allocation and multimodal variational autoencoder, that enables a robot to perform unsupervised multimodal object categorization on the basis of different types of sensor information. The goal of the active exploration is to reduce the number of actions executed to collect multimodal sensor information from a variety of objects to acquire knowledge on object categories. The active exploration method employing the information gain (IG) criterion for MHDP is described by extending the IG-based active perception method. Exploiting the submodular property of IG in MHDP, greedy and lazy greedy algorithms with a certain theoretical guarantee of performance are proposed. The effectiveness of the proposed method is evaluated in a robot experiment. Results show that the proposed active exploration method with the greedy algorithm works well, and it significantly reduces the step for exploration. Further, the performance of the lazy greedy algorithm is found to deteriorate at times, due to the estimation error in the IG, differently from that of active perception.\",\"PeriodicalId\":395378,\"journal\":{\"name\":\"2021 IEEE/SICE International Symposium on System Integration (SII)\",\"volume\":\"293 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE/SICE International Symposium on System Integration (SII)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IEEECONF49454.2021.9382781\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/SICE International Symposium on System Integration (SII)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEEECONF49454.2021.9382781","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Active Exploration for Unsupervised Object Categorization Based on Multimodal Hierarchical Dirichlet Process
This paper describes an effective active exploration method for multimodal object categorization using a multimodal hierarchical Dirichlet process (MHDP). MHDP is a type of multimodal latent variable models, e.g., multimodal latent Dirichlet allocation and multimodal variational autoencoder, that enables a robot to perform unsupervised multimodal object categorization on the basis of different types of sensor information. The goal of the active exploration is to reduce the number of actions executed to collect multimodal sensor information from a variety of objects to acquire knowledge on object categories. The active exploration method employing the information gain (IG) criterion for MHDP is described by extending the IG-based active perception method. Exploiting the submodular property of IG in MHDP, greedy and lazy greedy algorithms with a certain theoretical guarantee of performance are proposed. The effectiveness of the proposed method is evaluated in a robot experiment. Results show that the proposed active exploration method with the greedy algorithm works well, and it significantly reduces the step for exploration. Further, the performance of the lazy greedy algorithm is found to deteriorate at times, due to the estimation error in the IG, differently from that of active perception.