地图文本标签差异检测的计算机视觉方法

Proceedings of the 35th International Conference on Scientific and Statistical Database Management Pub Date : 2023-07-10 DOI:10.1145/3603719.3603722

Abdulrahman Salama, Mahmoud Elkamhawy, Mohamed Ali, Ehab Al-Masri, Adel Sabour, Abdeltawab M. Hendawi, Ming Tan, Vashutosh Agrawal, Ravi Prakash

{"title":"地图文本标签差异检测的计算机视觉方法","authors":"Abdulrahman Salama, Mahmoud Elkamhawy, Mohamed Ali, Ehab Al-Masri, Adel Sabour, Abdeltawab M. Hendawi, Ming Tan, Vashutosh Agrawal, Ravi Prakash","doi":"10.1145/3603719.3603722","DOIUrl":null,"url":null,"abstract":"Maps provide various sources of information. An important example of such information is textual labels such as cities, neighborhoods, and street names. Although we treat this information as facts, and despite the massive effort done by providers to continuously improve their accuracy, this data is far from perfect. Discrepancies in textual labels rendered on the map are one of the major sources of inconsistencies across map providers. These discrepancies can have significant impacts on the reliability of the derived information and decision-making processes. Thus, it is important to validate the accuracy and consistency in such data. Most providers treat this data as their propriety data and it is not available to the public, thus we cannot compare the data directly. To address these challenges, we introduce a novel computer vision-based approach for automatically extracting and classifying labels based on the visual characteristics of the label, which indicates its category based on the format convention used by the specific map provider. Based on the extracted data, we detect the degree of discrepancies across map providers. We consider three map providers: Bing Maps, Google Maps, and OpenStreetMaps. The neural network we develop classifies the text labels with an accuracy up to 93% in all providers. We leverage our system to analyze randomly selected regions in different markets. The studied markets are USA, Germany, France, and Brazil. Experimental results and statistical analysis reveal the amount of discrepancies across map providers per region. We calculate the Jaccard distance between the extracted text sets for each pair of map providers, which represents the discrepancy percentage. Discrepancies percentages as high as 90% were found in some markets.","PeriodicalId":314512,"journal":{"name":"Proceedings of the 35th International Conference on Scientific and Statistical Database Management","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Computer Vision Approach for Detecting Discrepancies in Map Textual Labels\",\"authors\":\"Abdulrahman Salama, Mahmoud Elkamhawy, Mohamed Ali, Ehab Al-Masri, Adel Sabour, Abdeltawab M. Hendawi, Ming Tan, Vashutosh Agrawal, Ravi Prakash\",\"doi\":\"10.1145/3603719.3603722\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Maps provide various sources of information. An important example of such information is textual labels such as cities, neighborhoods, and street names. Although we treat this information as facts, and despite the massive effort done by providers to continuously improve their accuracy, this data is far from perfect. Discrepancies in textual labels rendered on the map are one of the major sources of inconsistencies across map providers. These discrepancies can have significant impacts on the reliability of the derived information and decision-making processes. Thus, it is important to validate the accuracy and consistency in such data. Most providers treat this data as their propriety data and it is not available to the public, thus we cannot compare the data directly. To address these challenges, we introduce a novel computer vision-based approach for automatically extracting and classifying labels based on the visual characteristics of the label, which indicates its category based on the format convention used by the specific map provider. Based on the extracted data, we detect the degree of discrepancies across map providers. We consider three map providers: Bing Maps, Google Maps, and OpenStreetMaps. The neural network we develop classifies the text labels with an accuracy up to 93% in all providers. We leverage our system to analyze randomly selected regions in different markets. The studied markets are USA, Germany, France, and Brazil. Experimental results and statistical analysis reveal the amount of discrepancies across map providers per region. We calculate the Jaccard distance between the extracted text sets for each pair of map providers, which represents the discrepancy percentage. Discrepancies percentages as high as 90% were found in some markets.\",\"PeriodicalId\":314512,\"journal\":{\"name\":\"Proceedings of the 35th International Conference on Scientific and Statistical Database Management\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 35th International Conference on Scientific and Statistical Database Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3603719.3603722\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 35th International Conference on Scientific and Statistical Database Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603719.3603722","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

地图提供各种信息来源。此类信息的一个重要示例是文本标签，如城市、社区和街道名称。尽管我们将这些信息视为事实，尽管供应商付出了巨大努力不断提高其准确性，但这些数据远非完美。地图上呈现的文本标签的差异是地图提供程序之间不一致的主要来源之一。这些差异会对所得信息和决策过程的可靠性产生重大影响。因此，验证这些数据的准确性和一致性是很重要的。大多数供应商将这些数据视为他们的专有数据，不向公众提供，因此我们无法直接比较数据。为了解决这些挑战，我们引入了一种新的基于计算机视觉的方法，基于标签的视觉特征自动提取和分类标签，该方法根据特定地图提供者使用的格式约定指示其类别。基于提取的数据，我们检测不同地图提供商之间的差异程度。我们考虑三个地图提供商:必应地图、谷歌地图和OpenStreetMaps。我们开发的神经网络在所有提供者中对文本标签的分类准确率高达93%。我们利用我们的系统来分析不同市场中随机选择的区域。研究的市场是美国、德国、法国和巴西。实验结果和统计分析揭示了不同地区地图提供商之间的差异。我们计算每对地图提供者提取的文本集之间的Jaccard距离，这表示差异百分比。在一些市场，差异率高达90%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Computer Vision Approach for Detecting Discrepancies in Map Textual Labels

Maps provide various sources of information. An important example of such information is textual labels such as cities, neighborhoods, and street names. Although we treat this information as facts, and despite the massive effort done by providers to continuously improve their accuracy, this data is far from perfect. Discrepancies in textual labels rendered on the map are one of the major sources of inconsistencies across map providers. These discrepancies can have significant impacts on the reliability of the derived information and decision-making processes. Thus, it is important to validate the accuracy and consistency in such data. Most providers treat this data as their propriety data and it is not available to the public, thus we cannot compare the data directly. To address these challenges, we introduce a novel computer vision-based approach for automatically extracting and classifying labels based on the visual characteristics of the label, which indicates its category based on the format convention used by the specific map provider. Based on the extracted data, we detect the degree of discrepancies across map providers. We consider three map providers: Bing Maps, Google Maps, and OpenStreetMaps. The neural network we develop classifies the text labels with an accuracy up to 93% in all providers. We leverage our system to analyze randomly selected regions in different markets. The studied markets are USA, Germany, France, and Brazil. Experimental results and statistical analysis reveal the amount of discrepancies across map providers per region. We calculate the Jaccard distance between the extracted text sets for each pair of map providers, which represents the discrepancy percentage. Discrepancies percentages as high as 90% were found in some markets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 35th International Conference on Scientific and Statistical Database Management

自引率

0.00%

发文量