{"title":"基于多注意DeepLab的球面相机图像环境识别","authors":"Yuta Nishida, Li Guangxu, Huimin Lu, Tohru Kamiya","doi":"10.23919/ICCAS55662.2022.10003689","DOIUrl":null,"url":null,"abstract":"Electric wheelchair is an easy-to-operate means of transportation that does not require physical strength. With the number of electric wheelchair users increasing in recent years, the increase in traffic accidents becomes a problem. Therefore, by developing an autonomous electric wheelchair, it is expected that the risk of accidents will be reduced and the convenience of the electric wheelchair will be improved. Environment recognition is indispensable for the development of autonomous electric wheelchairs. We propose a semantic segmentation method for recognizing 16 objects in traffic environment. This paper examines the improvement of problems such as the high price of autonomous electric wheelchairs due to the increase in the number of sensors used, which has been a concern in related research. Therefore, we use panoramic images acquired by a spherical camera as input data, and extern the Multi-Attention Deep Lab algorithms fitting for the recognition of distorted images. A new CNN model is constructed using Deep Lab v3+, scSE Block, Pairwise Self-Attention, and Joint Pyramid Up-sampling. We conducted a recognition experiment using images taken on campus and verified its effectiveness. (Comparing to DeepLab v3+, IoU and Dice showed a 3.5% and 3.6% improvement in accuracy, respectively.)","PeriodicalId":129856,"journal":{"name":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Environment Recognition from Spherical Camera Images Based on Multi-Attention DeepLab\",\"authors\":\"Yuta Nishida, Li Guangxu, Huimin Lu, Tohru Kamiya\",\"doi\":\"10.23919/ICCAS55662.2022.10003689\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Electric wheelchair is an easy-to-operate means of transportation that does not require physical strength. With the number of electric wheelchair users increasing in recent years, the increase in traffic accidents becomes a problem. Therefore, by developing an autonomous electric wheelchair, it is expected that the risk of accidents will be reduced and the convenience of the electric wheelchair will be improved. Environment recognition is indispensable for the development of autonomous electric wheelchairs. We propose a semantic segmentation method for recognizing 16 objects in traffic environment. This paper examines the improvement of problems such as the high price of autonomous electric wheelchairs due to the increase in the number of sensors used, which has been a concern in related research. Therefore, we use panoramic images acquired by a spherical camera as input data, and extern the Multi-Attention Deep Lab algorithms fitting for the recognition of distorted images. A new CNN model is constructed using Deep Lab v3+, scSE Block, Pairwise Self-Attention, and Joint Pyramid Up-sampling. We conducted a recognition experiment using images taken on campus and verified its effectiveness. (Comparing to DeepLab v3+, IoU and Dice showed a 3.5% and 3.6% improvement in accuracy, respectively.)\",\"PeriodicalId\":129856,\"journal\":{\"name\":\"2022 22nd International Conference on Control, Automation and Systems (ICCAS)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 22nd International Conference on Control, Automation and Systems (ICCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICCAS55662.2022.10003689\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS55662.2022.10003689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
电动轮椅是一种不需要体力,易于操作的交通工具。近年来,随着电动轮椅使用者数量的增加,交通事故的增加成为一个问题。因此,通过开发自主电动轮椅,有望降低事故风险,提高电动轮椅的便利性。环境识别对于自主电动轮椅的发展是必不可少的。针对交通环境中的16个目标,提出了一种语义分割方法。本文探讨了由于传感器数量的增加而导致的自主电动轮椅价格过高等问题的改善,这是相关研究一直关注的问题。因此,我们使用球形相机获取的全景图像作为输入数据,并将Multi-Attention Deep Lab算法用于畸变图像的识别。使用Deep Lab v3+、scSE块、成对自注意和联合金字塔上采样构建了一个新的CNN模型。我们利用校园拍摄的图像进行了识别实验,验证了其有效性。(与DeepLab v3+相比,IoU和Dice的准确率分别提高了3.5%和3.6%。)
Environment Recognition from Spherical Camera Images Based on Multi-Attention DeepLab
Electric wheelchair is an easy-to-operate means of transportation that does not require physical strength. With the number of electric wheelchair users increasing in recent years, the increase in traffic accidents becomes a problem. Therefore, by developing an autonomous electric wheelchair, it is expected that the risk of accidents will be reduced and the convenience of the electric wheelchair will be improved. Environment recognition is indispensable for the development of autonomous electric wheelchairs. We propose a semantic segmentation method for recognizing 16 objects in traffic environment. This paper examines the improvement of problems such as the high price of autonomous electric wheelchairs due to the increase in the number of sensors used, which has been a concern in related research. Therefore, we use panoramic images acquired by a spherical camera as input data, and extern the Multi-Attention Deep Lab algorithms fitting for the recognition of distorted images. A new CNN model is constructed using Deep Lab v3+, scSE Block, Pairwise Self-Attention, and Joint Pyramid Up-sampling. We conducted a recognition experiment using images taken on campus and verified its effectiveness. (Comparing to DeepLab v3+, IoU and Dice showed a 3.5% and 3.6% improvement in accuracy, respectively.)