结合神经网络的实例分割室内家庭场景识别

Amlan Basu, Keerati Kaewrak, L. Petropoulakis, G. Di Caterina, J. Soraghan
{"title":"结合神经网络的实例分割室内家庭场景识别","authors":"Amlan Basu, Keerati Kaewrak, L. Petropoulakis, G. Di Caterina, J. Soraghan","doi":"10.1109/AIC55036.2022.9848982","DOIUrl":null,"url":null,"abstract":"This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Indoor Home Scene Recognition through Instance Segmentation Using a Combination of Neural Networks\",\"authors\":\"Amlan Basu, Keerati Kaewrak, L. Petropoulakis, G. Di Caterina, J. Soraghan\",\"doi\":\"10.1109/AIC55036.2022.9848982\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.\",\"PeriodicalId\":433590,\"journal\":{\"name\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIC55036.2022.9848982\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848982","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种利用物体检测识别室内家庭场景的技术。目标检测任务通过预训练的Mask-RCNN(区域卷积神经网络)完成,而场景识别则通过卷积神经网络(CNN)完成。Mask-RCNN的输出作为CNN的输入,因为这为CNN提供了在一个场景中检测到的物体的信息。因此,CNN通过观察检测到的物体组合来识别场景。CNN使用Mask-RCNN的各种目标检测输出进行训练。这有助于CNN学习场景中物体的各种组合。CNN使用Mask-RCNN生成的室内家庭5个不同场景(浴室、卧室、厨房、客厅和餐厅)的500组组合进行训练。训练后的网络在24000张室内家庭场景图像上进行了测试。CNN的最终准确率为97.14%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Indoor Home Scene Recognition through Instance Segmentation Using a Combination of Neural Networks
This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信