基于物体向量的机器人室内场景分类算法

Ximing Fan, Bo Zhu, Xiang Gao, Bin Wang, Cong Wang, Guozheng Xu
{"title":"基于物体向量的机器人室内场景分类算法","authors":"Ximing Fan, Bo Zhu, Xiang Gao, Bin Wang, Cong Wang, Guozheng Xu","doi":"10.1145/3512826.3512846","DOIUrl":null,"url":null,"abstract":"At present, many indoor scene classification algorithms are based on abstract feature descriptions and lack semantic interpretation for classification results. For robot applications, these classification methods fail to reuse prior knowledge of environmental objects, resulting in a waste of computing resources, and are not conducive to interaction. In addition, a large number of existing methods have not been verified for robot applications because training and test are performed on the same dataset. Here, intermediate semantic structure, called object vector, is proposed to describe the scene. The recognition confidence, category, and quantity of typical objects, which are related to scene concepts, can be captured by this structure. Based on the object vector, the semantic prototype models of scene categories are obtained using the prototype learning method, and the corresponding prototype confidence vector is generated for scene prediction. The object vector and prototype confidence vector are concatenated to form a complete descriptor. Applying sparse random forest learning, scene conceptual models are formed and used in real scene classification. In the experiment, the training and test sets were selected from sets with different styles. The average accuracy on the MIT Indoor 67, VPC, and Tsotsos_home datasets is higher than that of fine-tuned VGG. In particular, the accuracy on the VPC dataset improved by more than 5%. Moreover, the algorithm performance is evaluated on an actual robot platform.","PeriodicalId":270295,"journal":{"name":"Proceedings of the 2022 3rd International Conference on Artificial Intelligence in Electronics Engineering","volume":"13 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Indoor Scene Classification Algorithm Based on an Object Vector for Robot Applications\",\"authors\":\"Ximing Fan, Bo Zhu, Xiang Gao, Bin Wang, Cong Wang, Guozheng Xu\",\"doi\":\"10.1145/3512826.3512846\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"At present, many indoor scene classification algorithms are based on abstract feature descriptions and lack semantic interpretation for classification results. For robot applications, these classification methods fail to reuse prior knowledge of environmental objects, resulting in a waste of computing resources, and are not conducive to interaction. In addition, a large number of existing methods have not been verified for robot applications because training and test are performed on the same dataset. Here, intermediate semantic structure, called object vector, is proposed to describe the scene. The recognition confidence, category, and quantity of typical objects, which are related to scene concepts, can be captured by this structure. Based on the object vector, the semantic prototype models of scene categories are obtained using the prototype learning method, and the corresponding prototype confidence vector is generated for scene prediction. The object vector and prototype confidence vector are concatenated to form a complete descriptor. Applying sparse random forest learning, scene conceptual models are formed and used in real scene classification. In the experiment, the training and test sets were selected from sets with different styles. The average accuracy on the MIT Indoor 67, VPC, and Tsotsos_home datasets is higher than that of fine-tuned VGG. In particular, the accuracy on the VPC dataset improved by more than 5%. Moreover, the algorithm performance is evaluated on an actual robot platform.\",\"PeriodicalId\":270295,\"journal\":{\"name\":\"Proceedings of the 2022 3rd International Conference on Artificial Intelligence in Electronics Engineering\",\"volume\":\"13 11\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 3rd International Conference on Artificial Intelligence in Electronics Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3512826.3512846\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 3rd International Conference on Artificial Intelligence in Electronics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3512826.3512846","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

目前,许多室内场景分类算法都是基于抽象的特征描述,缺乏对分类结果的语义解释。对于机器人应用,这些分类方法不能重用环境对象的先验知识,造成计算资源的浪费,不利于交互。此外,由于训练和测试是在相同的数据集上进行的,因此大量现有的方法尚未对机器人应用进行验证。本文提出了一种中间语义结构——对象向量来描述场景。该结构可以捕获与场景概念相关的典型对象的识别置信度、类别和数量。在对象向量的基础上,利用原型学习方法获得场景类别的语义原型模型,并生成相应的原型置信向量进行场景预测。将对象向量和原型置信向量连接起来,形成一个完整的描述符。应用稀疏随机森林学习,形成场景概念模型并应用于真实场景分类。在实验中,训练集和测试集从不同风格的集合中选择。在MIT Indoor 67、VPC和Tsotsos_home数据集上的平均准确率高于经过微调的VGG。特别是在VPC数据集上,准确率提高了5%以上。并在实际机器人平台上对算法的性能进行了评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Indoor Scene Classification Algorithm Based on an Object Vector for Robot Applications
At present, many indoor scene classification algorithms are based on abstract feature descriptions and lack semantic interpretation for classification results. For robot applications, these classification methods fail to reuse prior knowledge of environmental objects, resulting in a waste of computing resources, and are not conducive to interaction. In addition, a large number of existing methods have not been verified for robot applications because training and test are performed on the same dataset. Here, intermediate semantic structure, called object vector, is proposed to describe the scene. The recognition confidence, category, and quantity of typical objects, which are related to scene concepts, can be captured by this structure. Based on the object vector, the semantic prototype models of scene categories are obtained using the prototype learning method, and the corresponding prototype confidence vector is generated for scene prediction. The object vector and prototype confidence vector are concatenated to form a complete descriptor. Applying sparse random forest learning, scene conceptual models are formed and used in real scene classification. In the experiment, the training and test sets were selected from sets with different styles. The average accuracy on the MIT Indoor 67, VPC, and Tsotsos_home datasets is higher than that of fine-tuned VGG. In particular, the accuracy on the VPC dataset improved by more than 5%. Moreover, the algorithm performance is evaluated on an actual robot platform.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信