An IP Geolocation Database Evaluation and Fusion Model Based on Data Correlation and Delay Similarity

Xie Bo, Li Han, Wang Yong
{"title":"An IP Geolocation Database Evaluation and Fusion Model Based on Data Correlation and Delay Similarity","authors":"Xie Bo, Li Han, Wang Yong","doi":"10.1145/3291842.3291876","DOIUrl":null,"url":null,"abstract":"IP geolocation database is widely used in many Internet services. At present, there are many inaccurate or missing geolocations in IP geolocation databases. However, the industry lacks an effective method to evaluate them. Based on the assumption that the majority of entries in well-known databases are correct and delay measurement, this paper proposed an IP geolocation database evaluation and fusion model based on data correlation and delay similarity. Firstly, we improved the previous evaluation model based on data-consistency-rate by introducing geolocation coverage rate at different granularities. Secondly, by measuring the delays of IP addresses at large scale, the standard delay of a geographical city is determined, then, we calculated the delay similarity rate of different databases between IP's own delay and its geolocation city's standard delay. Thirdly, we used the weighted voting method to fuse inconsistent geolocations among databases, where the vote share is determined by the improved data-consistency-rate and delay similarity rate, and presented a sole fusion database. Finally, we took 340 million IP addresses allocated to mainland China as an example, compared with the existing model, the accuracy of the model we proposed is increased by 8.79%.","PeriodicalId":283197,"journal":{"name":"Proceedings of the 2nd International Conference on Telecommunications and Communication Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Telecommunications and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3291842.3291876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

IP geolocation database is widely used in many Internet services. At present, there are many inaccurate or missing geolocations in IP geolocation databases. However, the industry lacks an effective method to evaluate them. Based on the assumption that the majority of entries in well-known databases are correct and delay measurement, this paper proposed an IP geolocation database evaluation and fusion model based on data correlation and delay similarity. Firstly, we improved the previous evaluation model based on data-consistency-rate by introducing geolocation coverage rate at different granularities. Secondly, by measuring the delays of IP addresses at large scale, the standard delay of a geographical city is determined, then, we calculated the delay similarity rate of different databases between IP's own delay and its geolocation city's standard delay. Thirdly, we used the weighted voting method to fuse inconsistent geolocations among databases, where the vote share is determined by the improved data-consistency-rate and delay similarity rate, and presented a sole fusion database. Finally, we took 340 million IP addresses allocated to mainland China as an example, compared with the existing model, the accuracy of the model we proposed is increased by 8.79%.
一种基于数据相关性和延迟相似度的IP地理定位数据库评价与融合模型
IP地理定位数据库被广泛应用于许多互联网服务中。目前,IP地理定位数据库中存在许多不准确或缺失的地理信息。然而,业内缺乏一种有效的评估方法。在假设知名数据库中大部分条目都是正确的并考虑延迟度量的前提下,提出了一种基于数据相关性和延迟相似性的IP地理位置数据库评价与融合模型。首先,引入不同粒度的地理位置覆盖率,改进了基于数据一致性率的评价模型;其次,通过大规模测量IP地址的时延,确定地理城市的标准时延,然后计算不同数据库中IP地址自身时延与其所在地理城市标准时延的时延相似率。第三,采用加权投票法对数据库间不一致的地理位置进行融合,通过提高数据一致性率和延迟相似率来确定投票份额,形成单一的融合数据库;最后,以分配给中国大陆的3.4亿个IP地址为例,与现有模型相比,我们提出的模型的准确率提高了8.79%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信