Yuan Liang, Zixiang Xu, S. Rasti, Soumyabrata Dev, A. Campbell
{"title":"On the Use of a Semantic Segmentation Micro-Service in AR Devices for UI Placement","authors":"Yuan Liang, Zixiang Xu, S. Rasti, Soumyabrata Dev, A. Campbell","doi":"10.1109/GEM56474.2022.10017522","DOIUrl":null,"url":null,"abstract":"Augmented Reality (AR) technology is now increasingly applied in various fields, which can bring an unprecedented immersive experience and rich interaction to the application field. However, complex interactions and informative interfaces impose a long learning curve and burden on users. Making the AR experience more intelligent to reduce redundant operations is one solution to enhance the user experience. One potential research direction is seamlessly combining the two fields of machine learning and AR. This paper proposes using semantic segmentation to assist automatic information placement in AR using a case study within precision agriculture as an example. The precise location of the crop area in the user view is determined by semantic segmentation, which helps to place information in the AR environment automatically. The dataset used for machine learning model construction consists of 242 farmland images. Four semantic segmentation techniques are proposed and bench-marked against each other. The results show that the Attention U-Net deep neural network has the highest recognition accuracy, reaching 91.9%. An AR automatic information placement prototype using Attention U-Net has been developed to run on tablets utilising a micro-service approach. This work demonstrates how AR user interfaces could be placed correctly within the real world, which traditionally has been an understudied area of research within AR and is essential for future AR games and Enterprise applications. As such, this solution has potential usage in all areas of AR application.","PeriodicalId":200252,"journal":{"name":"2022 IEEE Games, Entertainment, Media Conference (GEM)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Games, Entertainment, Media Conference (GEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GEM56474.2022.10017522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Augmented Reality (AR) technology is now increasingly applied in various fields, which can bring an unprecedented immersive experience and rich interaction to the application field. However, complex interactions and informative interfaces impose a long learning curve and burden on users. Making the AR experience more intelligent to reduce redundant operations is one solution to enhance the user experience. One potential research direction is seamlessly combining the two fields of machine learning and AR. This paper proposes using semantic segmentation to assist automatic information placement in AR using a case study within precision agriculture as an example. The precise location of the crop area in the user view is determined by semantic segmentation, which helps to place information in the AR environment automatically. The dataset used for machine learning model construction consists of 242 farmland images. Four semantic segmentation techniques are proposed and bench-marked against each other. The results show that the Attention U-Net deep neural network has the highest recognition accuracy, reaching 91.9%. An AR automatic information placement prototype using Attention U-Net has been developed to run on tablets utilising a micro-service approach. This work demonstrates how AR user interfaces could be placed correctly within the real world, which traditionally has been an understudied area of research within AR and is essential for future AR games and Enterprise applications. As such, this solution has potential usage in all areas of AR application.