Environment Recognition from Spherical Camera Images Based on Multi-Attention DeepLab

Yuta Nishida, Li Guangxu, Huimin Lu, Tohru Kamiya
{"title":"Environment Recognition from Spherical Camera Images Based on Multi-Attention DeepLab","authors":"Yuta Nishida, Li Guangxu, Huimin Lu, Tohru Kamiya","doi":"10.23919/ICCAS55662.2022.10003689","DOIUrl":null,"url":null,"abstract":"Electric wheelchair is an easy-to-operate means of transportation that does not require physical strength. With the number of electric wheelchair users increasing in recent years, the increase in traffic accidents becomes a problem. Therefore, by developing an autonomous electric wheelchair, it is expected that the risk of accidents will be reduced and the convenience of the electric wheelchair will be improved. Environment recognition is indispensable for the development of autonomous electric wheelchairs. We propose a semantic segmentation method for recognizing 16 objects in traffic environment. This paper examines the improvement of problems such as the high price of autonomous electric wheelchairs due to the increase in the number of sensors used, which has been a concern in related research. Therefore, we use panoramic images acquired by a spherical camera as input data, and extern the Multi-Attention Deep Lab algorithms fitting for the recognition of distorted images. A new CNN model is constructed using Deep Lab v3+, scSE Block, Pairwise Self-Attention, and Joint Pyramid Up-sampling. We conducted a recognition experiment using images taken on campus and verified its effectiveness. (Comparing to DeepLab v3+, IoU and Dice showed a 3.5% and 3.6% improvement in accuracy, respectively.)","PeriodicalId":129856,"journal":{"name":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS55662.2022.10003689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Electric wheelchair is an easy-to-operate means of transportation that does not require physical strength. With the number of electric wheelchair users increasing in recent years, the increase in traffic accidents becomes a problem. Therefore, by developing an autonomous electric wheelchair, it is expected that the risk of accidents will be reduced and the convenience of the electric wheelchair will be improved. Environment recognition is indispensable for the development of autonomous electric wheelchairs. We propose a semantic segmentation method for recognizing 16 objects in traffic environment. This paper examines the improvement of problems such as the high price of autonomous electric wheelchairs due to the increase in the number of sensors used, which has been a concern in related research. Therefore, we use panoramic images acquired by a spherical camera as input data, and extern the Multi-Attention Deep Lab algorithms fitting for the recognition of distorted images. A new CNN model is constructed using Deep Lab v3+, scSE Block, Pairwise Self-Attention, and Joint Pyramid Up-sampling. We conducted a recognition experiment using images taken on campus and verified its effectiveness. (Comparing to DeepLab v3+, IoU and Dice showed a 3.5% and 3.6% improvement in accuracy, respectively.)
基于多注意DeepLab的球面相机图像环境识别
电动轮椅是一种不需要体力,易于操作的交通工具。近年来,随着电动轮椅使用者数量的增加,交通事故的增加成为一个问题。因此,通过开发自主电动轮椅,有望降低事故风险,提高电动轮椅的便利性。环境识别对于自主电动轮椅的发展是必不可少的。针对交通环境中的16个目标,提出了一种语义分割方法。本文探讨了由于传感器数量的增加而导致的自主电动轮椅价格过高等问题的改善,这是相关研究一直关注的问题。因此,我们使用球形相机获取的全景图像作为输入数据,并将Multi-Attention Deep Lab算法用于畸变图像的识别。使用Deep Lab v3+、scSE块、成对自注意和联合金字塔上采样构建了一个新的CNN模型。我们利用校园拍摄的图像进行了识别实验,验证了其有效性。(与DeepLab v3+相比,IoU和Dice的准确率分别提高了3.5%和3.6%。)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信