Video Semantic Segmentation for Intersection by Domain Adaptation

2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC) Pub Date : 2023-06-25 DOI:10.1109/ITC-CSCC58803.2023.10212842

Shota Suzuki, Takafumi Katayama, Tian Song, T. Shimamoto

{"title":"Video Semantic Segmentation for Intersection by Domain Adaptation","authors":"Shota Suzuki, Takafumi Katayama, Tian Song, T. Shimamoto","doi":"10.1109/ITC-CSCC58803.2023.10212842","DOIUrl":null,"url":null,"abstract":"In recent years, semantic segmentation, which is one of the image recognition technologies for automatic driving, has attracted attention. Semantic segmentation can perform high accuracy object detection by discriminating classes with pixel level precision. However, training data for segmentation usually requires extensive manual labor because a corresponding label has to be assigned to each pixel. Currently, using a computer graphics (CG) dataset makes it easier to create supervised data. However, when inference images are real-world, intersection over union (IoU) is greatly reduced by the variation of the domain. In this work, an unsupervised domain adaptation (UDA) training method is proposed to achieve efficient intersection video segmentation models for each target domain. The inference is performed for intersection images with relatively complex situations. The simulation results show that the proposed algorithm for semantic segmentation of intersection videos achieve IoU scores comparable to supervised learning for classes that are prominently displayed on the screen.","PeriodicalId":220939,"journal":{"name":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","volume":"361 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC-CSCC58803.2023.10212842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, semantic segmentation, which is one of the image recognition technologies for automatic driving, has attracted attention. Semantic segmentation can perform high accuracy object detection by discriminating classes with pixel level precision. However, training data for segmentation usually requires extensive manual labor because a corresponding label has to be assigned to each pixel. Currently, using a computer graphics (CG) dataset makes it easier to create supervised data. However, when inference images are real-world, intersection over union (IoU) is greatly reduced by the variation of the domain. In this work, an unsupervised domain adaptation (UDA) training method is proposed to achieve efficient intersection video segmentation models for each target domain. The inference is performed for intersection images with relatively complex situations. The simulation results show that the proposed algorithm for semantic segmentation of intersection videos achieve IoU scores comparable to supervised learning for classes that are prominently displayed on the screen.

查看原文本刊更多论文

基于域自适应的交叉视频语义分割

语义分割作为自动驾驶图像识别技术之一，近年来备受关注。语义分割能够以像素级的精度进行分类，实现高精度的目标检测。然而，用于分割的训练数据通常需要大量的手工劳动，因为必须为每个像素分配相应的标签。目前，使用计算机图形(CG)数据集可以更容易地创建监督数据。然而，当推理图像是真实世界时，域的变化大大减少了交集比并(IoU)。本文提出了一种无监督域自适应(UDA)训练方法，以实现每个目标域的高效交叉视频分割模型。对情况相对复杂的交叉图像进行推理。仿真结果表明，本文提出的交叉视频语义分割算法对于屏幕上突出显示的课程，IoU分数与监督学习相当。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)

自引率

0.00%

发文量