{"title":"基于改进DAFormer的无监督域自适应语义分割","authors":"Hao Liu, Jingchun Piao","doi":"10.1145/3573942.3574046","DOIUrl":null,"url":null,"abstract":"To overcome the intensive of manual labeling tasks at the pixel level required for semantic segmentation under traditional supervised learning, an Unsupervised Domain Adaptive for Semantic Segmentation (UDASS) method based on DAFormer improved model is proposed. This model adapted the Max Mean Discrepancy (MMD) method in the regenerated Hilbert space to help the alignment of the feature distribution, the soft paste strategy to retain the partially covered image blocks to help the model to accelerate convergence, the non-convex consistency regularization at the output level to enhance the robustness of the network, and the spatial pyramid pooling framework and the decoder with large window attention collaboration to improve its consistency. The proposed method was evaluated on the public dataset, and obtained the of 2.4% mIoU improvement in GTA5-to-Cityscapes and 1.1% mIoU in SYSTHIA-to-Cityscapes, respectively, which proved that this method was effective for DAFormer improvement.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Unsupervised Domain Adaptive Semantic Segmentation Based on Improved DAFormer\",\"authors\":\"Hao Liu, Jingchun Piao\",\"doi\":\"10.1145/3573942.3574046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To overcome the intensive of manual labeling tasks at the pixel level required for semantic segmentation under traditional supervised learning, an Unsupervised Domain Adaptive for Semantic Segmentation (UDASS) method based on DAFormer improved model is proposed. This model adapted the Max Mean Discrepancy (MMD) method in the regenerated Hilbert space to help the alignment of the feature distribution, the soft paste strategy to retain the partially covered image blocks to help the model to accelerate convergence, the non-convex consistency regularization at the output level to enhance the robustness of the network, and the spatial pyramid pooling framework and the decoder with large window attention collaboration to improve its consistency. The proposed method was evaluated on the public dataset, and obtained the of 2.4% mIoU improvement in GTA5-to-Cityscapes and 1.1% mIoU in SYSTHIA-to-Cityscapes, respectively, which proved that this method was effective for DAFormer improvement.\",\"PeriodicalId\":103293,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3573942.3574046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573942.3574046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Unsupervised Domain Adaptive Semantic Segmentation Based on Improved DAFormer
To overcome the intensive of manual labeling tasks at the pixel level required for semantic segmentation under traditional supervised learning, an Unsupervised Domain Adaptive for Semantic Segmentation (UDASS) method based on DAFormer improved model is proposed. This model adapted the Max Mean Discrepancy (MMD) method in the regenerated Hilbert space to help the alignment of the feature distribution, the soft paste strategy to retain the partially covered image blocks to help the model to accelerate convergence, the non-convex consistency regularization at the output level to enhance the robustness of the network, and the spatial pyramid pooling framework and the decoder with large window attention collaboration to improve its consistency. The proposed method was evaluated on the public dataset, and obtained the of 2.4% mIoU improvement in GTA5-to-Cityscapes and 1.1% mIoU in SYSTHIA-to-Cityscapes, respectively, which proved that this method was effective for DAFormer improvement.