{"title":"SpaGAN:用于建立图像地图泛化的空间感知生成对抗网络","authors":"Zhiyong Zhou, Cheng Fu, Robert Weibel","doi":"10.1016/j.jag.2024.104236","DOIUrl":null,"url":null,"abstract":"Building generalization is an essential task in generating multi-scale topographic maps. The progress of deep learning offers a new paradigm to overcome the coordination challenges faced by conventional building generalization algorithms. Some studies have confirmed the feasibility of several original semantic segmentation networks, such as U-Net and its variants and the conditional generative adversarial network (cGAN), for building generalization in image maps. However, they suffer from critical deformation effects, especially for large and geometrically complex buildings. Since learning building generalization essentially means modeling the subtle transformation of building footprints across scales, we argue that the spatial awareness of a neural network, for instance, regarding building size and shape, is crucial to effective learning. Thus, we propose a spatially-aware generative adversarial network, SpaGAN. It takes a representative cGAN, pix2pix, as the backbone, and modifies two modules: In the U-Net-based generator, an atrous spatial pyramid pooling (ASPP) module replaces the conventional convolutional module to extract multi-scale features of buildings of varying sizes and shapes; in the PatchGAN-based discriminator, a signed distance map (SDM) module is used to capture the fine-grained shape difference for discrimination. The proposed network was comprehensively evaluated with a synthetic and a real-world dataset. The results demonstrate that SpaGAN outperforms existing baseline models (U-Net, ResU-Net, pix2pix) for building generalization, particularly in the real-world dataset. The new model can achieve more reasonable aggregation, simplification, and squaring generalization operators.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"11 1","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SpaGAN: A spatially-aware generative adversarial network for building generalization in image maps\",\"authors\":\"Zhiyong Zhou, Cheng Fu, Robert Weibel\",\"doi\":\"10.1016/j.jag.2024.104236\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Building generalization is an essential task in generating multi-scale topographic maps. The progress of deep learning offers a new paradigm to overcome the coordination challenges faced by conventional building generalization algorithms. Some studies have confirmed the feasibility of several original semantic segmentation networks, such as U-Net and its variants and the conditional generative adversarial network (cGAN), for building generalization in image maps. However, they suffer from critical deformation effects, especially for large and geometrically complex buildings. Since learning building generalization essentially means modeling the subtle transformation of building footprints across scales, we argue that the spatial awareness of a neural network, for instance, regarding building size and shape, is crucial to effective learning. Thus, we propose a spatially-aware generative adversarial network, SpaGAN. It takes a representative cGAN, pix2pix, as the backbone, and modifies two modules: In the U-Net-based generator, an atrous spatial pyramid pooling (ASPP) module replaces the conventional convolutional module to extract multi-scale features of buildings of varying sizes and shapes; in the PatchGAN-based discriminator, a signed distance map (SDM) module is used to capture the fine-grained shape difference for discrimination. The proposed network was comprehensively evaluated with a synthetic and a real-world dataset. The results demonstrate that SpaGAN outperforms existing baseline models (U-Net, ResU-Net, pix2pix) for building generalization, particularly in the real-world dataset. The new model can achieve more reasonable aggregation, simplification, and squaring generalization operators.\",\"PeriodicalId\":50341,\"journal\":{\"name\":\"International Journal of Applied Earth Observation and Geoinformation\",\"volume\":\"11 1\",\"pages\":\"\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Applied Earth Observation and Geoinformation\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jag.2024.104236\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Earth and Planetary Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Earth Observation and Geoinformation","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1016/j.jag.2024.104236","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
SpaGAN: A spatially-aware generative adversarial network for building generalization in image maps
Building generalization is an essential task in generating multi-scale topographic maps. The progress of deep learning offers a new paradigm to overcome the coordination challenges faced by conventional building generalization algorithms. Some studies have confirmed the feasibility of several original semantic segmentation networks, such as U-Net and its variants and the conditional generative adversarial network (cGAN), for building generalization in image maps. However, they suffer from critical deformation effects, especially for large and geometrically complex buildings. Since learning building generalization essentially means modeling the subtle transformation of building footprints across scales, we argue that the spatial awareness of a neural network, for instance, regarding building size and shape, is crucial to effective learning. Thus, we propose a spatially-aware generative adversarial network, SpaGAN. It takes a representative cGAN, pix2pix, as the backbone, and modifies two modules: In the U-Net-based generator, an atrous spatial pyramid pooling (ASPP) module replaces the conventional convolutional module to extract multi-scale features of buildings of varying sizes and shapes; in the PatchGAN-based discriminator, a signed distance map (SDM) module is used to capture the fine-grained shape difference for discrimination. The proposed network was comprehensively evaluated with a synthetic and a real-world dataset. The results demonstrate that SpaGAN outperforms existing baseline models (U-Net, ResU-Net, pix2pix) for building generalization, particularly in the real-world dataset. The new model can achieve more reasonable aggregation, simplification, and squaring generalization operators.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.