A Computer Vision Approach for Detecting Discrepancies in Map Textual Labels

Abdulrahman Salama, Mahmoud Elkamhawy, Mohamed Ali, Ehab Al-Masri, Adel Sabour, Abdeltawab M. Hendawi, Ming Tan, Vashutosh Agrawal, Ravi Prakash
{"title":"A Computer Vision Approach for Detecting Discrepancies in Map Textual Labels","authors":"Abdulrahman Salama, Mahmoud Elkamhawy, Mohamed Ali, Ehab Al-Masri, Adel Sabour, Abdeltawab M. Hendawi, Ming Tan, Vashutosh Agrawal, Ravi Prakash","doi":"10.1145/3603719.3603722","DOIUrl":null,"url":null,"abstract":"Maps provide various sources of information. An important example of such information is textual labels such as cities, neighborhoods, and street names. Although we treat this information as facts, and despite the massive effort done by providers to continuously improve their accuracy, this data is far from perfect. Discrepancies in textual labels rendered on the map are one of the major sources of inconsistencies across map providers. These discrepancies can have significant impacts on the reliability of the derived information and decision-making processes. Thus, it is important to validate the accuracy and consistency in such data. Most providers treat this data as their propriety data and it is not available to the public, thus we cannot compare the data directly. To address these challenges, we introduce a novel computer vision-based approach for automatically extracting and classifying labels based on the visual characteristics of the label, which indicates its category based on the format convention used by the specific map provider. Based on the extracted data, we detect the degree of discrepancies across map providers. We consider three map providers: Bing Maps, Google Maps, and OpenStreetMaps. The neural network we develop classifies the text labels with an accuracy up to 93% in all providers. We leverage our system to analyze randomly selected regions in different markets. The studied markets are USA, Germany, France, and Brazil. Experimental results and statistical analysis reveal the amount of discrepancies across map providers per region. We calculate the Jaccard distance between the extracted text sets for each pair of map providers, which represents the discrepancy percentage. Discrepancies percentages as high as 90% were found in some markets.","PeriodicalId":314512,"journal":{"name":"Proceedings of the 35th International Conference on Scientific and Statistical Database Management","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 35th International Conference on Scientific and Statistical Database Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603719.3603722","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Maps provide various sources of information. An important example of such information is textual labels such as cities, neighborhoods, and street names. Although we treat this information as facts, and despite the massive effort done by providers to continuously improve their accuracy, this data is far from perfect. Discrepancies in textual labels rendered on the map are one of the major sources of inconsistencies across map providers. These discrepancies can have significant impacts on the reliability of the derived information and decision-making processes. Thus, it is important to validate the accuracy and consistency in such data. Most providers treat this data as their propriety data and it is not available to the public, thus we cannot compare the data directly. To address these challenges, we introduce a novel computer vision-based approach for automatically extracting and classifying labels based on the visual characteristics of the label, which indicates its category based on the format convention used by the specific map provider. Based on the extracted data, we detect the degree of discrepancies across map providers. We consider three map providers: Bing Maps, Google Maps, and OpenStreetMaps. The neural network we develop classifies the text labels with an accuracy up to 93% in all providers. We leverage our system to analyze randomly selected regions in different markets. The studied markets are USA, Germany, France, and Brazil. Experimental results and statistical analysis reveal the amount of discrepancies across map providers per region. We calculate the Jaccard distance between the extracted text sets for each pair of map providers, which represents the discrepancy percentage. Discrepancies percentages as high as 90% were found in some markets.
地图文本标签差异检测的计算机视觉方法
地图提供各种信息来源。此类信息的一个重要示例是文本标签,如城市、社区和街道名称。尽管我们将这些信息视为事实,尽管供应商付出了巨大努力不断提高其准确性,但这些数据远非完美。地图上呈现的文本标签的差异是地图提供程序之间不一致的主要来源之一。这些差异会对所得信息和决策过程的可靠性产生重大影响。因此,验证这些数据的准确性和一致性是很重要的。大多数供应商将这些数据视为他们的专有数据,不向公众提供,因此我们无法直接比较数据。为了解决这些挑战,我们引入了一种新的基于计算机视觉的方法,基于标签的视觉特征自动提取和分类标签,该方法根据特定地图提供者使用的格式约定指示其类别。基于提取的数据,我们检测不同地图提供商之间的差异程度。我们考虑三个地图提供商:必应地图、谷歌地图和OpenStreetMaps。我们开发的神经网络在所有提供者中对文本标签的分类准确率高达93%。我们利用我们的系统来分析不同市场中随机选择的区域。研究的市场是美国、德国、法国和巴西。实验结果和统计分析揭示了不同地区地图提供商之间的差异。我们计算每对地图提供者提取的文本集之间的Jaccard距离,这表示差异百分比。在一些市场,差异率高达90%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信