Chuanji Shi , Yingying Zhang , Jiaotuan Wang , Xin Guo , Qiqi Zhu
{"title":"通过遥感图像和地理先验生成感兴趣的多模式城市地区","authors":"Chuanji Shi , Yingying Zhang , Jiaotuan Wang , Xin Guo , Qiqi Zhu","doi":"10.1016/j.jag.2024.104326","DOIUrl":null,"url":null,"abstract":"<div><div>Urban area-of-interest (AOI) refers to an integrated urban functional zone with defined polygonal boundaries. The rapid development of urban commerce has led to increasing demands for highly accurate and timely AOI data. However, existing research primarily focuses on coarse-grained functional zones for urban planning or regional economic analysis, and often neglects AOI’s expiration in the real world. They fail to fulfill the precision requirements of Mobile Internet Online-to-Offline (O2O) businesses. These businesses require AOI boundary accuracy down to a specific community, school, or hospital. In this paper, we propose a fully end-to-end multimodal AOI TRansformer (AOITR) model designed for simultaneously detecting accurate AOI boundaries and validating AOI’s reliability by leveraging remote sensing imagery coupled with geographical prior. Unlike conventional AOI generation methods, such as the Road-cut method that segments road networks at various levels, our approach diverges from semantic segmentation algorithms that depend on pixel-level classification. Instead, our AOITR begins by selecting a point-of-interest (POI) of specific category, which can be easily obtained via web crawler, and uses it to retrieve corresponding remote sensing imagery and geographical prior such as entrance POIs and road nodes. This information helps to build a multimodal detection model based on transformer encoder-decoder architecture to regress the accurate AOI polygon. Additionally, we utilize the dynamic features from human mobility, nearby POIs, and logistics addresses for AOI reliability evaluation via a cascaded network module. The experimental results reveal that our algorithm achieves a significant improvement on Intersection over Union (IoU) metric, surpassing previous methods by a large margin. Furthermore, the AOIs produced by AOITR have substantially enriched our AOI library and have been successfully applied on over 10 different O2O scenarios including Alipay’s face scan payment service.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"136 ","pages":"Article 104326"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multimodal urban areas of interest generation via remote sensing imagery and geographical prior\",\"authors\":\"Chuanji Shi , Yingying Zhang , Jiaotuan Wang , Xin Guo , Qiqi Zhu\",\"doi\":\"10.1016/j.jag.2024.104326\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Urban area-of-interest (AOI) refers to an integrated urban functional zone with defined polygonal boundaries. The rapid development of urban commerce has led to increasing demands for highly accurate and timely AOI data. However, existing research primarily focuses on coarse-grained functional zones for urban planning or regional economic analysis, and often neglects AOI’s expiration in the real world. They fail to fulfill the precision requirements of Mobile Internet Online-to-Offline (O2O) businesses. These businesses require AOI boundary accuracy down to a specific community, school, or hospital. In this paper, we propose a fully end-to-end multimodal AOI TRansformer (AOITR) model designed for simultaneously detecting accurate AOI boundaries and validating AOI’s reliability by leveraging remote sensing imagery coupled with geographical prior. Unlike conventional AOI generation methods, such as the Road-cut method that segments road networks at various levels, our approach diverges from semantic segmentation algorithms that depend on pixel-level classification. Instead, our AOITR begins by selecting a point-of-interest (POI) of specific category, which can be easily obtained via web crawler, and uses it to retrieve corresponding remote sensing imagery and geographical prior such as entrance POIs and road nodes. This information helps to build a multimodal detection model based on transformer encoder-decoder architecture to regress the accurate AOI polygon. Additionally, we utilize the dynamic features from human mobility, nearby POIs, and logistics addresses for AOI reliability evaluation via a cascaded network module. The experimental results reveal that our algorithm achieves a significant improvement on Intersection over Union (IoU) metric, surpassing previous methods by a large margin. Furthermore, the AOIs produced by AOITR have substantially enriched our AOI library and have been successfully applied on over 10 different O2O scenarios including Alipay’s face scan payment service.</div></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"136 \",\"pages\":\"Article 104326\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569843224006848\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843224006848","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
Multimodal urban areas of interest generation via remote sensing imagery and geographical prior
Urban area-of-interest (AOI) refers to an integrated urban functional zone with defined polygonal boundaries. The rapid development of urban commerce has led to increasing demands for highly accurate and timely AOI data. However, existing research primarily focuses on coarse-grained functional zones for urban planning or regional economic analysis, and often neglects AOI’s expiration in the real world. They fail to fulfill the precision requirements of Mobile Internet Online-to-Offline (O2O) businesses. These businesses require AOI boundary accuracy down to a specific community, school, or hospital. In this paper, we propose a fully end-to-end multimodal AOI TRansformer (AOITR) model designed for simultaneously detecting accurate AOI boundaries and validating AOI’s reliability by leveraging remote sensing imagery coupled with geographical prior. Unlike conventional AOI generation methods, such as the Road-cut method that segments road networks at various levels, our approach diverges from semantic segmentation algorithms that depend on pixel-level classification. Instead, our AOITR begins by selecting a point-of-interest (POI) of specific category, which can be easily obtained via web crawler, and uses it to retrieve corresponding remote sensing imagery and geographical prior such as entrance POIs and road nodes. This information helps to build a multimodal detection model based on transformer encoder-decoder architecture to regress the accurate AOI polygon. Additionally, we utilize the dynamic features from human mobility, nearby POIs, and logistics addresses for AOI reliability evaluation via a cascaded network module. The experimental results reveal that our algorithm achieves a significant improvement on Intersection over Union (IoU) metric, surpassing previous methods by a large margin. Furthermore, the AOIs produced by AOITR have substantially enriched our AOI library and have been successfully applied on over 10 different O2O scenarios including Alipay’s face scan payment service.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.