Sven Richter, Johannes Beck, Sascha Wirges, C. Stiller
{"title":"Semantic Evidential Grid Mapping based on Stereo Vision","authors":"Sven Richter, Johannes Beck, Sascha Wirges, C. Stiller","doi":"10.1109/MFI49285.2020.9235217","DOIUrl":null,"url":null,"abstract":"Accurately estimating the current state of local traffic scenes is a crucial component of automated vehicles. The desired representation may include static and dynamic traffic participants, details on free space and drivability, but also information on the semantics. Multi-layer grid maps allow to include all these information in a common representation. In this work, we present an improved method to estimate a semantic evidential multi-layer grid map using depth from stereo vision paired with pixel-wise semantically annotated images. The error characteristics of the depth from stereo is explicitly modeled when transferring pixel labels from the image to the grid map space. We achieve accurate and dense mapping results by incorporating a disparity-based ground surface estimation in the inverse perspective mapping. The proposed method is validated on our experimental vehicle in challenging urban traffic scenarios.","PeriodicalId":446154,"journal":{"name":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MFI49285.2020.9235217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Accurately estimating the current state of local traffic scenes is a crucial component of automated vehicles. The desired representation may include static and dynamic traffic participants, details on free space and drivability, but also information on the semantics. Multi-layer grid maps allow to include all these information in a common representation. In this work, we present an improved method to estimate a semantic evidential multi-layer grid map using depth from stereo vision paired with pixel-wise semantically annotated images. The error characteristics of the depth from stereo is explicitly modeled when transferring pixel labels from the image to the grid map space. We achieve accurate and dense mapping results by incorporating a disparity-based ground surface estimation in the inverse perspective mapping. The proposed method is validated on our experimental vehicle in challenging urban traffic scenarios.