Sven Richter, Johannes Beck, Sascha Wirges, C. Stiller
{"title":"基于立体视觉的语义证据网格映射","authors":"Sven Richter, Johannes Beck, Sascha Wirges, C. Stiller","doi":"10.1109/MFI49285.2020.9235217","DOIUrl":null,"url":null,"abstract":"Accurately estimating the current state of local traffic scenes is a crucial component of automated vehicles. The desired representation may include static and dynamic traffic participants, details on free space and drivability, but also information on the semantics. Multi-layer grid maps allow to include all these information in a common representation. In this work, we present an improved method to estimate a semantic evidential multi-layer grid map using depth from stereo vision paired with pixel-wise semantically annotated images. The error characteristics of the depth from stereo is explicitly modeled when transferring pixel labels from the image to the grid map space. We achieve accurate and dense mapping results by incorporating a disparity-based ground surface estimation in the inverse perspective mapping. The proposed method is validated on our experimental vehicle in challenging urban traffic scenarios.","PeriodicalId":446154,"journal":{"name":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Semantic Evidential Grid Mapping based on Stereo Vision\",\"authors\":\"Sven Richter, Johannes Beck, Sascha Wirges, C. Stiller\",\"doi\":\"10.1109/MFI49285.2020.9235217\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurately estimating the current state of local traffic scenes is a crucial component of automated vehicles. The desired representation may include static and dynamic traffic participants, details on free space and drivability, but also information on the semantics. Multi-layer grid maps allow to include all these information in a common representation. In this work, we present an improved method to estimate a semantic evidential multi-layer grid map using depth from stereo vision paired with pixel-wise semantically annotated images. The error characteristics of the depth from stereo is explicitly modeled when transferring pixel labels from the image to the grid map space. We achieve accurate and dense mapping results by incorporating a disparity-based ground surface estimation in the inverse perspective mapping. The proposed method is validated on our experimental vehicle in challenging urban traffic scenarios.\",\"PeriodicalId\":446154,\"journal\":{\"name\":\"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MFI49285.2020.9235217\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MFI49285.2020.9235217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Semantic Evidential Grid Mapping based on Stereo Vision
Accurately estimating the current state of local traffic scenes is a crucial component of automated vehicles. The desired representation may include static and dynamic traffic participants, details on free space and drivability, but also information on the semantics. Multi-layer grid maps allow to include all these information in a common representation. In this work, we present an improved method to estimate a semantic evidential multi-layer grid map using depth from stereo vision paired with pixel-wise semantically annotated images. The error characteristics of the depth from stereo is explicitly modeled when transferring pixel labels from the image to the grid map space. We achieve accurate and dense mapping results by incorporating a disparity-based ground surface estimation in the inverse perspective mapping. The proposed method is validated on our experimental vehicle in challenging urban traffic scenarios.