A dual of Transformer features-related map-intelligent generation method

Q3 Computer Science

中国图象图形学报 Pub Date : 2023-01-01 DOI:10.11834/jig.220887

Fang Zheng, Fu Ying, Liu Lixiong

{"title":"A dual of Transformer features-related map-intelligent generation method","authors":"Fang Zheng, Fu Ying, Liu Lixiong","doi":"10.11834/jig.220887","DOIUrl":null,"url":null,"abstract":"目的现有的地图智能生成技术没有考虑到地图生成任务存在的地理要素类内差异性和地理要素域间差异性，这使得生成的地图质量难以满足实际需要。针对地理要素类内差异性和地理要素域间差异性，提出了一种Transformer特征引导的双阶段地图智能生成方法。方法首先基于最新的Transformer网络，设计了一个基于该网络的特征提取模块，该模块提取遥感图像中的地理要素特征用于引导地图生成，解决了地理要素类内差异性导致的地图生成困难的问题。然后设计双阶段生成框架，该框架具备两个生成对抗网络，第1个生成对抗网络为初步生成对抗网络，利用遥感图像和Transformer特征得到初步的地图图像；第2个生成对抗网络为精修生成对抗网络利用初步地图图像生成高质量的精修地图图像，缓解了地理要素域间差异性导致的地图地理要素生成不准确问题。结果在AIDOMG（aerial image dataset for online map generation）数据集上的9个区域进行了实验，与10种经典的和最新方法进行了比较，提出方法取得了最优的结果。其中，在海口区域，相比于Creative GAN方法，FID （Frechet inception distance）值降低了16.0%，WD （Wasserstein distance）降低了4.2%，1-NN （1-nearest neighbor）降低了5.9%；在巴黎区域，相比于Creative GAN方法，FID值降低了2.9%，WD降低了1.0%，1-NN降低了2.1%。结论提出的Transformer特征引导的双阶段地图智能生成方法通过高质量的Transformer特征引导和双阶段生成框架解决了地理要素类内差异性和地理要素域间差异性所带来的地图生成质量较差的问题。;Objective Map intelligent generation technique is focused on generating map images quickly and cost efficiently. For existing intelligent map generation technique,to get quick-responsed and low-cost map generation,remote sensing image is taken as the input,and its generative adversarial network(GAN) is used to generate the corresponding map image. Inevitably,it is challenged that the intra-class differences within geographical elements in remote sensing images and the differences of geographical elements between domains in the map generation task are still not involved in. The intra-class difference of geographical elements refers that similar geographical elements in remote sensing images have several of appearances,which are difficult to be interpreted. Geographical elements segmentation is required for map generation in relevance to melting obvious intra-class differences into corresponding categories. The difference of geographical elements between different domains means that the corresponding geographical elements in remote sensing images and map images are not exactly matched well. For example,the edges of vegetation elements in remote sensing images are irregular, while the edges of vegetation elements in map images are flat. Another challenge for map generation is to generate and keep consistency to the features of map elements. Aiming at the intra-class difference of geographical elements and the superposition of geographical elements,we develop a dual of map-intelligent generation method based on Transformer features. Method The model consists of three sorts of modules relevant to feature extraction,preliminary and refined generative adversarial contexts. First,feature extraction module is developed based on the latest Transformer network. It consists of a backbone and segmentation branch in terms of Swin-Transformer structure. Self-attention mechanism based Transformer can be used to construct the global relationship of the image,and it has a larger receptive field and it can extract feature information effectively. The segmentation branch is composed of a pyramid pooling module(PPM) and a feature pyramid network(FPN). To get more effective geographic element features,feature pyramid is employed to extract multi-level feature information,and the high-level geographic element semantic information can be integrated into the middle-level and low-level geographic element semantic information,and the PPM is used to introduce the global semantic information as well. Next,feature information is sent to the segmentation branch,which uses the actual segmentation results as a guidance to generate effective geographical element features. To guide map generation and resolve the problem of map generation caused by the differences in geographical elements,this module can be used to extract the features of geographical elements in remote sensing images. Third,the preliminary generative adversarial module has a preliminary generator and a discriminator. The preliminary generator is a multi-scale generator,consisting of a local generator and a global generator, and it is used to generate the high-resolution images. Both of local and global generators are linked to encoder/decoder structures. The input of the preliminary generator is derived of remote sensing image and geographical element features, and the output is originated from preliminary map image. The discriminator is also recognized as a multi-scale discriminator,which consists of three sorts of sub discriminators for the high-resolution images. The input of the discriminator is the generated map and the real map,and the output is the single channel confidence map. Finally,a refined generator is used for refined generative adversarial module,and a discriminator with the preliminary generative adversarial module is shared in as well. The structure of the refined generator is same as the preliminary generator,which is also as a multi-scale generator in terms of local and global generators. The input of the refinement generator is originated from a preliminary map image and the output is derived of a fine map image. A dual of generation framework is constructed in terms of refined and preliminary generative adversarial-related modules. In general,to obtain preliminary map images,the preliminary generative adversarial module is as inputs based on remote sensing images and geographical element features. The preliminary map image is rough,and there are incomplete geographical elements,such as uneven road edges and fractures. For the refined generative adversarial module,to learn the geometric characteristics of geographical elements in the real map,obtain highquality fine map images,and alleviate the problem of inaccurate local map generation caused by the differences of geographical elements between domains,the generated primary map image is taken as the input,and the real map is taken as the guide as well. Result Experiments are carried out on 9 regions on the aerial image dataset for online map generation (AIDOMG) dataset in comparison with 10 sort of popular methods. For the Haikou area,Frechet inception distance(FID) is reduced by 16. 0%,Wasserstein distance(WD) is reduced by 4. 2%,and the 1-nearest neighbor(1-NN) is reduced by 5. 9% as well. For the Paris area,FID is decreased by 2. 9%,WD is decreased by 1. 0%,and 1-NN decreased by 2. 1% simultaneously. Comparative analyses demonstrate that our method proposed can improve the results of map generation effectively. At the same time,ablation studies of the model can show the effectiveness of each module,and each module can be added and the model results is improved gradually as well. Conclusion To solve the problem of poor map generation quality caused by the intra-class inconsistency of geographical elements effectively,a dual of Transformer features-related map-intelligent generation method is proposed,and the differences of geographical elements between domains can be illustrated via high-quality Transformer-guided feature and a dual of generation framework further.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中国图象图形学报","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11834/jig.220887","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

目的现有的地图智能生成技术没有考虑到地图生成任务存在的地理要素类内差异性和地理要素域间差异性，这使得生成的地图质量难以满足实际需要。针对地理要素类内差异性和地理要素域间差异性，提出了一种Transformer特征引导的双阶段地图智能生成方法。方法首先基于最新的Transformer网络，设计了一个基于该网络的特征提取模块，该模块提取遥感图像中的地理要素特征用于引导地图生成，解决了地理要素类内差异性导致的地图生成困难的问题。然后设计双阶段生成框架，该框架具备两个生成对抗网络，第1个生成对抗网络为初步生成对抗网络，利用遥感图像和Transformer特征得到初步的地图图像；第2个生成对抗网络为精修生成对抗网络利用初步地图图像生成高质量的精修地图图像，缓解了地理要素域间差异性导致的地图地理要素生成不准确问题。结果在AIDOMG（aerial image dataset for online map generation）数据集上的9个区域进行了实验，与10种经典的和最新方法进行了比较，提出方法取得了最优的结果。其中，在海口区域，相比于Creative GAN方法，FID （Frechet inception distance）值降低了16.0%，WD （Wasserstein distance）降低了4.2%，1-NN （1-nearest neighbor）降低了5.9%；在巴黎区域，相比于Creative GAN方法，FID值降低了2.9%，WD降低了1.0%，1-NN降低了2.1%。结论提出的Transformer特征引导的双阶段地图智能生成方法通过高质量的Transformer特征引导和双阶段生成框架解决了地理要素类内差异性和地理要素域间差异性所带来的地图生成质量较差的问题。;Objective Map intelligent generation technique is focused on generating map images quickly and cost efficiently. For existing intelligent map generation technique,to get quick-responsed and low-cost map generation,remote sensing image is taken as the input,and its generative adversarial network(GAN) is used to generate the corresponding map image. Inevitably,it is challenged that the intra-class differences within geographical elements in remote sensing images and the differences of geographical elements between domains in the map generation task are still not involved in. The intra-class difference of geographical elements refers that similar geographical elements in remote sensing images have several of appearances,which are difficult to be interpreted. Geographical elements segmentation is required for map generation in relevance to melting obvious intra-class differences into corresponding categories. The difference of geographical elements between different domains means that the corresponding geographical elements in remote sensing images and map images are not exactly matched well. For example,the edges of vegetation elements in remote sensing images are irregular, while the edges of vegetation elements in map images are flat. Another challenge for map generation is to generate and keep consistency to the features of map elements. Aiming at the intra-class difference of geographical elements and the superposition of geographical elements,we develop a dual of map-intelligent generation method based on Transformer features. Method The model consists of three sorts of modules relevant to feature extraction,preliminary and refined generative adversarial contexts. First,feature extraction module is developed based on the latest Transformer network. It consists of a backbone and segmentation branch in terms of Swin-Transformer structure. Self-attention mechanism based Transformer can be used to construct the global relationship of the image,and it has a larger receptive field and it can extract feature information effectively. The segmentation branch is composed of a pyramid pooling module(PPM) and a feature pyramid network(FPN). To get more effective geographic element features,feature pyramid is employed to extract multi-level feature information,and the high-level geographic element semantic information can be integrated into the middle-level and low-level geographic element semantic information,and the PPM is used to introduce the global semantic information as well. Next,feature information is sent to the segmentation branch,which uses the actual segmentation results as a guidance to generate effective geographical element features. To guide map generation and resolve the problem of map generation caused by the differences in geographical elements,this module can be used to extract the features of geographical elements in remote sensing images. Third,the preliminary generative adversarial module has a preliminary generator and a discriminator. The preliminary generator is a multi-scale generator,consisting of a local generator and a global generator, and it is used to generate the high-resolution images. Both of local and global generators are linked to encoder/decoder structures. The input of the preliminary generator is derived of remote sensing image and geographical element features, and the output is originated from preliminary map image. The discriminator is also recognized as a multi-scale discriminator,which consists of three sorts of sub discriminators for the high-resolution images. The input of the discriminator is the generated map and the real map,and the output is the single channel confidence map. Finally,a refined generator is used for refined generative adversarial module,and a discriminator with the preliminary generative adversarial module is shared in as well. The structure of the refined generator is same as the preliminary generator,which is also as a multi-scale generator in terms of local and global generators. The input of the refinement generator is originated from a preliminary map image and the output is derived of a fine map image. A dual of generation framework is constructed in terms of refined and preliminary generative adversarial-related modules. In general,to obtain preliminary map images,the preliminary generative adversarial module is as inputs based on remote sensing images and geographical element features. The preliminary map image is rough,and there are incomplete geographical elements,such as uneven road edges and fractures. For the refined generative adversarial module,to learn the geometric characteristics of geographical elements in the real map,obtain highquality fine map images,and alleviate the problem of inaccurate local map generation caused by the differences of geographical elements between domains,the generated primary map image is taken as the input,and the real map is taken as the guide as well. Result Experiments are carried out on 9 regions on the aerial image dataset for online map generation (AIDOMG) dataset in comparison with 10 sort of popular methods. For the Haikou area,Frechet inception distance(FID) is reduced by 16. 0%,Wasserstein distance(WD) is reduced by 4. 2%,and the 1-nearest neighbor(1-NN) is reduced by 5. 9% as well. For the Paris area,FID is decreased by 2. 9%,WD is decreased by 1. 0%,and 1-NN decreased by 2. 1% simultaneously. Comparative analyses demonstrate that our method proposed can improve the results of map generation effectively. At the same time,ablation studies of the model can show the effectiveness of each module,and each module can be added and the model results is improved gradually as well. Conclusion To solve the problem of poor map generation quality caused by the intra-class inconsistency of geographical elements effectively,a dual of Transformer features-related map-intelligent generation method is proposed,and the differences of geographical elements between domains can be illustrated via high-quality Transformer-guided feature and a dual of generation framework further.

查看原文本刊更多论文

一种双变压器特征相关地图智能生成方法

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

中国图象图形学报 Computer Science-Computer Graphics and Computer-Aided Design

CiteScore

1.20

自引率

0.00%

发文量

6776

期刊介绍： Journal of Image and Graphics (ISSN 1006-8961, CN 11-3758/TB, CODEN ZTTXFZ) is an authoritative academic journal supervised by the Chinese Academy of Sciences and co-sponsored by the Institute of Space and Astronautical Information Innovation of the Chinese Academy of Sciences (ISIAS), the Chinese Society of Image and Graphics (CSIG), and the Beijing Institute of Applied Physics and Computational Mathematics (BIAPM). The journal integrates high-tech theories, technical methods and industrialisation of applied research results in computer image graphics, and mainly publishes innovative and high-level scientific research papers on basic and applied research in image graphics science and its closely related fields. The form of papers includes reviews, technical reports, project progress, academic news, new technology reviews, new product introduction and industrialisation research. The content covers a wide range of fields such as image analysis and recognition, image understanding and computer vision, computer graphics, virtual reality and augmented reality, system simulation, animation, etc., and theme columns are opened according to the research hotspots and cutting-edge topics. Journal of Image and Graphics reaches a wide range of readers, including scientific and technical personnel, enterprise supervisors, and postgraduates and college students of colleges and universities engaged in the fields of national defence, military, aviation, aerospace, communications, electronics, automotive, agriculture, meteorology, environmental protection, remote sensing, mapping, oil field, construction, transportation, finance, telecommunications, education, medical care, film and television, and art. Journal of Image and Graphics is included in many important domestic and international scientific literature database systems, including EBSCO database in the United States, JST database in Japan, Scopus database in the Netherlands, China Science and Technology Thesis Statistics and Analysis (Annual Research Report), China Science Citation Database (CSCD), China Academic Journal Network Publishing Database (CAJD), and China Academic Journal Network Publishing Database (CAJD). China Science Citation Database (CSCD), China Academic Journals Network Publishing Database (CAJD), China Academic Journal Abstracts, Chinese Science Abstracts (Series A), China Electronic Science Abstracts, Chinese Core Journals Abstracts, Chinese Academic Journals on CD-ROM, and China Academic Journals Comprehensive Evaluation Database.