结合神经网络的实例分割室内家庭场景识别

2022 IEEE World Conference on Applied Intelligence and Computing (AIC) Pub Date : 2022-06-17 DOI:10.1109/AIC55036.2022.9848982

Amlan Basu, Keerati Kaewrak, L. Petropoulakis, G. Di Caterina, J. Soraghan

{"title":"结合神经网络的实例分割室内家庭场景识别","authors":"Amlan Basu, Keerati Kaewrak, L. Petropoulakis, G. Di Caterina, J. Soraghan","doi":"10.1109/AIC55036.2022.9848982","DOIUrl":null,"url":null,"abstract":"This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Indoor Home Scene Recognition through Instance Segmentation Using a Combination of Neural Networks\",\"authors\":\"Amlan Basu, Keerati Kaewrak, L. Petropoulakis, G. Di Caterina, J. Soraghan\",\"doi\":\"10.1109/AIC55036.2022.9848982\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.\",\"PeriodicalId\":433590,\"journal\":{\"name\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIC55036.2022.9848982\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848982","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种利用物体检测识别室内家庭场景的技术。目标检测任务通过预训练的Mask-RCNN(区域卷积神经网络)完成，而场景识别则通过卷积神经网络(CNN)完成。Mask-RCNN的输出作为CNN的输入，因为这为CNN提供了在一个场景中检测到的物体的信息。因此，CNN通过观察检测到的物体组合来识别场景。CNN使用Mask-RCNN的各种目标检测输出进行训练。这有助于CNN学习场景中物体的各种组合。CNN使用Mask-RCNN生成的室内家庭5个不同场景(浴室、卧室、厨房、客厅和餐厅)的500组组合进行训练。训练后的网络在24000张室内家庭场景图像上进行了测试。CNN的最终准确率为97.14%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Indoor Home Scene Recognition through Instance Segmentation Using a Combination of Neural Networks

This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE World Conference on Applied Intelligence and Computing (AIC)

自引率

0.00%

发文量