歧义引导双分支语义分割

International Conference on Electronic Information Technology Pub Date : 2023-08-15 DOI:10.1117/12.2685757

Jin Cheng

{"title":"歧义引导双分支语义分割","authors":"Jin Cheng","doi":"10.1117/12.2685757","DOIUrl":null,"url":null,"abstract":"In order to improve the feature extraction process in the semantic segmentation network and clarify the operating logic of the model, this paper splits the discriminative features according to the pixel's own information and the inter-pixel correlation information. Using two branch networks, relational branch and information branch to focus on the extraction of their own features, and using the fully convolutional segmentation head and the fully-connected segmentation head respectively to make the features more in line with the requirements of the branch itself. In order to make full use of the information contained in the discrepancy between the two branches, a Discrepancy-guided Fusion module is proposed at the feature level, in where the difference information between the features is used as a guide to promote the fusion of the features of the two branches. At the prediction level, with the help of Point-wise Discrepancy Cross Entropy loss, the prediction difference is used to determine the network's attention to each pixel. The effectiveness of the method is verified by extensive experiments on the Cityscapes dataset.","PeriodicalId":305812,"journal":{"name":"International Conference on Electronic Information Technology","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discrepancy-guided dual-branch semantic segmentation\",\"authors\":\"Jin Cheng\",\"doi\":\"10.1117/12.2685757\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the feature extraction process in the semantic segmentation network and clarify the operating logic of the model, this paper splits the discriminative features according to the pixel's own information and the inter-pixel correlation information. Using two branch networks, relational branch and information branch to focus on the extraction of their own features, and using the fully convolutional segmentation head and the fully-connected segmentation head respectively to make the features more in line with the requirements of the branch itself. In order to make full use of the information contained in the discrepancy between the two branches, a Discrepancy-guided Fusion module is proposed at the feature level, in where the difference information between the features is used as a guide to promote the fusion of the features of the two branches. At the prediction level, with the help of Point-wise Discrepancy Cross Entropy loss, the prediction difference is used to determine the network's attention to each pixel. The effectiveness of the method is verified by extensive experiments on the Cityscapes dataset.\",\"PeriodicalId\":305812,\"journal\":{\"name\":\"International Conference on Electronic Information Technology\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Electronic Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2685757\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2685757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了改进语义分割网络中的特征提取过程，明确模型的运行逻辑，本文根据像素本身的信息和像素间的相关信息对判别特征进行分割。利用关系分支网络和信息分支网络两种分支网络，重点提取各自的特征，分别使用全卷积分割头和全连通分割头，使特征更符合分支本身的要求。为了充分利用两支路之间的差异所包含的信息，在特征层提出了一种差异引导融合模块，以特征之间的差异信息为导向，促进两支路特征的融合。在预测层面，借助逐点差异交叉熵损失(Point-wise difference Cross Entropy loss)，利用预测差值确定网络对每个像素的关注程度。在城市景观数据集上的大量实验验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Discrepancy-guided dual-branch semantic segmentation

In order to improve the feature extraction process in the semantic segmentation network and clarify the operating logic of the model, this paper splits the discriminative features according to the pixel's own information and the inter-pixel correlation information. Using two branch networks, relational branch and information branch to focus on the extraction of their own features, and using the fully convolutional segmentation head and the fully-connected segmentation head respectively to make the features more in line with the requirements of the branch itself. In order to make full use of the information contained in the discrepancy between the two branches, a Discrepancy-guided Fusion module is proposed at the feature level, in where the difference information between the features is used as a guide to promote the fusion of the features of the two branches. At the prediction level, with the help of Point-wise Discrepancy Cross Entropy loss, the prediction difference is used to determine the network's attention to each pixel. The effectiveness of the method is verified by extensive experiments on the Cityscapes dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Electronic Information Technology

自引率

0.00%

发文量