{"title":"歧义引导双分支语义分割","authors":"Jin Cheng","doi":"10.1117/12.2685757","DOIUrl":null,"url":null,"abstract":"In order to improve the feature extraction process in the semantic segmentation network and clarify the operating logic of the model, this paper splits the discriminative features according to the pixel's own information and the inter-pixel correlation information. Using two branch networks, relational branch and information branch to focus on the extraction of their own features, and using the fully convolutional segmentation head and the fully-connected segmentation head respectively to make the features more in line with the requirements of the branch itself. In order to make full use of the information contained in the discrepancy between the two branches, a Discrepancy-guided Fusion module is proposed at the feature level, in where the difference information between the features is used as a guide to promote the fusion of the features of the two branches. At the prediction level, with the help of Point-wise Discrepancy Cross Entropy loss, the prediction difference is used to determine the network's attention to each pixel. The effectiveness of the method is verified by extensive experiments on the Cityscapes dataset.","PeriodicalId":305812,"journal":{"name":"International Conference on Electronic Information Technology","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discrepancy-guided dual-branch semantic segmentation\",\"authors\":\"Jin Cheng\",\"doi\":\"10.1117/12.2685757\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the feature extraction process in the semantic segmentation network and clarify the operating logic of the model, this paper splits the discriminative features according to the pixel's own information and the inter-pixel correlation information. Using two branch networks, relational branch and information branch to focus on the extraction of their own features, and using the fully convolutional segmentation head and the fully-connected segmentation head respectively to make the features more in line with the requirements of the branch itself. In order to make full use of the information contained in the discrepancy between the two branches, a Discrepancy-guided Fusion module is proposed at the feature level, in where the difference information between the features is used as a guide to promote the fusion of the features of the two branches. At the prediction level, with the help of Point-wise Discrepancy Cross Entropy loss, the prediction difference is used to determine the network's attention to each pixel. The effectiveness of the method is verified by extensive experiments on the Cityscapes dataset.\",\"PeriodicalId\":305812,\"journal\":{\"name\":\"International Conference on Electronic Information Technology\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Electronic Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2685757\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2685757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In order to improve the feature extraction process in the semantic segmentation network and clarify the operating logic of the model, this paper splits the discriminative features according to the pixel's own information and the inter-pixel correlation information. Using two branch networks, relational branch and information branch to focus on the extraction of their own features, and using the fully convolutional segmentation head and the fully-connected segmentation head respectively to make the features more in line with the requirements of the branch itself. In order to make full use of the information contained in the discrepancy between the two branches, a Discrepancy-guided Fusion module is proposed at the feature level, in where the difference information between the features is used as a guide to promote the fusion of the features of the two branches. At the prediction level, with the help of Point-wise Discrepancy Cross Entropy loss, the prediction difference is used to determine the network's attention to each pixel. The effectiveness of the method is verified by extensive experiments on the Cityscapes dataset.