{"title":"Multi-E2E:集成高分辨率遥感图像和多源地理数据的端到端城市土地利用制图框架","authors":"Ruiyi Yang , Yanfei Zhong , Yu Su","doi":"10.1016/j.rse.2025.114966","DOIUrl":null,"url":null,"abstract":"<div><div>The urban land-use map reflects the distribution of the different functional lands in the city, serving as a valuable reference for urban planning and management. High-resolution remote sensing (HRS) images provide detailed spatial information about parcels but lack socio-economic information, which is crucial for identifying land-use types. To enhance the mapping performance and obtain more comprehensive land-use information, the integration of HRS images and points of interest (POIs) with socio-economic information is crucial. Nonetheless, the existing land-use mapping methods based on HRS images and POIs are generally confronted with the following challenges: 1) due to the reliance on prior knowledge, the existing methods cannot automatically capture the complex relationships between multi-source data and land-use categories; 2) there are substantial semantic disparities between HRS images and POIs, so that the simple fusion approaches cannot fully utilize the complementary information; and 3) the existing methods are generally based on the assumption of complete modalities, resulting in them failing to work on POI-deficient parcels. In this paper, to address these issues, an end-to-end urban land-use mapping framework integrating HRS images and multi-source geographic data (Multi-E2E) is proposed. The Multi-E2E framework automatically establishes the mapping from multi-source data to land-use categories through a data-driven approach, and generates informative fused representations with an adaptive fusion module (AFM). In Multi-E2E, the labeled HRS image-POI pairs are constructed using the areas of interest (AOIs), and the interactions between modalities are facilitated by the end-to-end architecture. To identify POI-deficient parcels and ensure that the modality-specific encoders are adequately supervised, a unimodal supervision module (USM) is introduced in the Multi-E2E framework. Experiments conducted with multi-source samples from 34 Chinese cities and the urban regions of Beijing, Wuhan, Hong Kong, Macao, and Helsinki validate the effectiveness and generalizability of the proposed framework for urban land-use mapping applications. The code will be publicly available at <span><span>https://github.com/Rayoll/Multi_E2E</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"330 ","pages":"Article 114966"},"PeriodicalIF":11.4000,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-E2E: An end-to-end urban land-use mapping framework integrating high-resolution remote sensing images and multi-source geographical data\",\"authors\":\"Ruiyi Yang , Yanfei Zhong , Yu Su\",\"doi\":\"10.1016/j.rse.2025.114966\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The urban land-use map reflects the distribution of the different functional lands in the city, serving as a valuable reference for urban planning and management. High-resolution remote sensing (HRS) images provide detailed spatial information about parcels but lack socio-economic information, which is crucial for identifying land-use types. To enhance the mapping performance and obtain more comprehensive land-use information, the integration of HRS images and points of interest (POIs) with socio-economic information is crucial. Nonetheless, the existing land-use mapping methods based on HRS images and POIs are generally confronted with the following challenges: 1) due to the reliance on prior knowledge, the existing methods cannot automatically capture the complex relationships between multi-source data and land-use categories; 2) there are substantial semantic disparities between HRS images and POIs, so that the simple fusion approaches cannot fully utilize the complementary information; and 3) the existing methods are generally based on the assumption of complete modalities, resulting in them failing to work on POI-deficient parcels. In this paper, to address these issues, an end-to-end urban land-use mapping framework integrating HRS images and multi-source geographic data (Multi-E2E) is proposed. The Multi-E2E framework automatically establishes the mapping from multi-source data to land-use categories through a data-driven approach, and generates informative fused representations with an adaptive fusion module (AFM). In Multi-E2E, the labeled HRS image-POI pairs are constructed using the areas of interest (AOIs), and the interactions between modalities are facilitated by the end-to-end architecture. To identify POI-deficient parcels and ensure that the modality-specific encoders are adequately supervised, a unimodal supervision module (USM) is introduced in the Multi-E2E framework. Experiments conducted with multi-source samples from 34 Chinese cities and the urban regions of Beijing, Wuhan, Hong Kong, Macao, and Helsinki validate the effectiveness and generalizability of the proposed framework for urban land-use mapping applications. The code will be publicly available at <span><span>https://github.com/Rayoll/Multi_E2E</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":417,\"journal\":{\"name\":\"Remote Sensing of Environment\",\"volume\":\"330 \",\"pages\":\"Article 114966\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Remote Sensing of Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0034425725003700\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425725003700","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Multi-E2E: An end-to-end urban land-use mapping framework integrating high-resolution remote sensing images and multi-source geographical data
The urban land-use map reflects the distribution of the different functional lands in the city, serving as a valuable reference for urban planning and management. High-resolution remote sensing (HRS) images provide detailed spatial information about parcels but lack socio-economic information, which is crucial for identifying land-use types. To enhance the mapping performance and obtain more comprehensive land-use information, the integration of HRS images and points of interest (POIs) with socio-economic information is crucial. Nonetheless, the existing land-use mapping methods based on HRS images and POIs are generally confronted with the following challenges: 1) due to the reliance on prior knowledge, the existing methods cannot automatically capture the complex relationships between multi-source data and land-use categories; 2) there are substantial semantic disparities between HRS images and POIs, so that the simple fusion approaches cannot fully utilize the complementary information; and 3) the existing methods are generally based on the assumption of complete modalities, resulting in them failing to work on POI-deficient parcels. In this paper, to address these issues, an end-to-end urban land-use mapping framework integrating HRS images and multi-source geographic data (Multi-E2E) is proposed. The Multi-E2E framework automatically establishes the mapping from multi-source data to land-use categories through a data-driven approach, and generates informative fused representations with an adaptive fusion module (AFM). In Multi-E2E, the labeled HRS image-POI pairs are constructed using the areas of interest (AOIs), and the interactions between modalities are facilitated by the end-to-end architecture. To identify POI-deficient parcels and ensure that the modality-specific encoders are adequately supervised, a unimodal supervision module (USM) is introduced in the Multi-E2E framework. Experiments conducted with multi-source samples from 34 Chinese cities and the urban regions of Beijing, Wuhan, Hong Kong, Macao, and Helsinki validate the effectiveness and generalizability of the proposed framework for urban land-use mapping applications. The code will be publicly available at https://github.com/Rayoll/Multi_E2E.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.