用于实时语义分割的增强训练感知双组

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI:10.1109/ICMEW56448.2022.9859497

Chih-Chung Hsu, Cheih Lee, Shen-Chieh Tai, Yun Jiang

{"title":"用于实时语义分割的增强训练感知双组","authors":"Chih-Chung Hsu, Cheih Lee, Shen-Chieh Tai, Yun Jiang","doi":"10.1109/ICMEW56448.2022.9859497","DOIUrl":null,"url":null,"abstract":"Semantic segmentation techniques have become an attractive research field for autonomous driving. However, it is well-known that the computational complexity of the conventional semantic segmentation is relatively high compared to other computer vision applications. Fast inference of the semantic segmentation for autonomous driving is highly desired. A lightweight convolutional neural network, the Bilateral segmentation network (BiSeNet), is adopted in this paper. However, the performance of the conventional BiSeNet is not so reliable that the model quantization could lead to an even worse result. Therefore, we proposed an augmented training strategy to significantly improve the semantic segmentation task’s performance. First, heavy data augmentation, including CutOut, deformable distortion, and step-wise hard example mining, is used in the training phase to boost the performance of the feature representation learning. Second, the L1 and L2 norm regularization are also used in the model training to prevent the possible overfitting issue. Then, the post-quantization is performed on the TensorFlow-Lite model to significantly reduce the model size and computational complexity. The comprehensive experiments verified that the proposed method is effective and efficient for autonomous driving applications over other state-of-the-art methods.","PeriodicalId":106759,"journal":{"name":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Augmented-Training-Aware Bisenet for Real-Time Semantic Segmentation\",\"authors\":\"Chih-Chung Hsu, Cheih Lee, Shen-Chieh Tai, Yun Jiang\",\"doi\":\"10.1109/ICMEW56448.2022.9859497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic segmentation techniques have become an attractive research field for autonomous driving. However, it is well-known that the computational complexity of the conventional semantic segmentation is relatively high compared to other computer vision applications. Fast inference of the semantic segmentation for autonomous driving is highly desired. A lightweight convolutional neural network, the Bilateral segmentation network (BiSeNet), is adopted in this paper. However, the performance of the conventional BiSeNet is not so reliable that the model quantization could lead to an even worse result. Therefore, we proposed an augmented training strategy to significantly improve the semantic segmentation task’s performance. First, heavy data augmentation, including CutOut, deformable distortion, and step-wise hard example mining, is used in the training phase to boost the performance of the feature representation learning. Second, the L1 and L2 norm regularization are also used in the model training to prevent the possible overfitting issue. Then, the post-quantization is performed on the TensorFlow-Lite model to significantly reduce the model size and computational complexity. The comprehensive experiments verified that the proposed method is effective and efficient for autonomous driving applications over other state-of-the-art methods.\",\"PeriodicalId\":106759,\"journal\":{\"name\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW56448.2022.9859497\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW56448.2022.9859497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

语义分割技术已成为自动驾驶领域的研究热点。然而，众所周知，与其他计算机视觉应用相比，传统语义分割的计算复杂度相对较高。自动驾驶中语义分割的快速推理是迫切需要的。本文采用了一种轻量级的卷积神经网络——双侧分割网络(BiSeNet)。然而，传统的BiSeNet的性能不太可靠，模型量化可能导致更差的结果。因此，我们提出了一种增强训练策略，以显著提高语义分割任务的性能。首先，在训练阶段使用大量数据增强，包括CutOut、可变形变形和逐步硬示例挖掘，以提高特征表示学习的性能。其次，在模型训练中也使用L1和L2范数正则化，以防止可能的过拟合问题。然后，对TensorFlow-Lite模型进行后量化，显著减小模型尺寸和计算复杂度。综合实验验证了该方法在自动驾驶应用中优于其他先进方法的有效性和效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Augmented-Training-Aware Bisenet for Real-Time Semantic Segmentation

Semantic segmentation techniques have become an attractive research field for autonomous driving. However, it is well-known that the computational complexity of the conventional semantic segmentation is relatively high compared to other computer vision applications. Fast inference of the semantic segmentation for autonomous driving is highly desired. A lightweight convolutional neural network, the Bilateral segmentation network (BiSeNet), is adopted in this paper. However, the performance of the conventional BiSeNet is not so reliable that the model quantization could lead to an even worse result. Therefore, we proposed an augmented training strategy to significantly improve the semantic segmentation task’s performance. First, heavy data augmentation, including CutOut, deformable distortion, and step-wise hard example mining, is used in the training phase to boost the performance of the feature representation learning. Second, the L1 and L2 norm regularization are also used in the model training to prevent the possible overfitting issue. Then, the post-quantization is performed on the TensorFlow-Lite model to significantly reduce the model size and computational complexity. The comprehensive experiments verified that the proposed method is effective and efficient for autonomous driving applications over other state-of-the-art methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

自引率

0.00%

发文量