{"title":"TransUNet for Cross-Domain Semantic Segmentation of Urban Scenery","authors":"Wei Yuen Teh, I. K. T. Tan","doi":"10.1109/ISPACS57703.2022.10082854","DOIUrl":null,"url":null,"abstract":"TransUNet is a hybrid architecture that combines a transformer-based encoder with a CNN-based UNet. Originally introduced for semantic segmentation of medical images, we show in our work that TransUNet can be successfully applied to urban scenery datasets commonly used for developing autonomous driving systems. We also explore the performance characteristics of training on multi-domain data from the real world and a simulator, and show that using simulated images to augment a live dataset can improve segmentation performance. Code will be made available at https://github.com/weiyuen.","PeriodicalId":410603,"journal":{"name":"2022 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS57703.2022.10082854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
TransUNet is a hybrid architecture that combines a transformer-based encoder with a CNN-based UNet. Originally introduced for semantic segmentation of medical images, we show in our work that TransUNet can be successfully applied to urban scenery datasets commonly used for developing autonomous driving systems. We also explore the performance characteristics of training on multi-domain data from the real world and a simulator, and show that using simulated images to augment a live dataset can improve segmentation performance. Code will be made available at https://github.com/weiyuen.