Detecting geographic locations from web resources

Chuang Wang, Xing Xie, Lee Wang, Yansheng Lu, Wei-Ying Ma
{"title":"Detecting geographic locations from web resources","authors":"Chuang Wang, Xing Xie, Lee Wang, Yansheng Lu, Wei-Ying Ma","doi":"10.1145/1096985.1096991","DOIUrl":null,"url":null,"abstract":"The rapid pervasion of the web into users' daily lives has put much importance on capturing location-specific information on the web, due to the fact that most human activities occur locally around where a user is located. This is especially true in the increasingly popular mobile and local search environments. Thus, how to correctly and effectively detect geographic locations from web resources has become a key challenge to location-based web applications. In our previous work, we proposed to explicitly distinguish three types of locations for web resources, namely provider location, content location and serving location. Provider location is the physical location of the provider who owns the web resource; content location is the geographic location described in the web content; while serving location is the geographic scope that a web resource can reach. In this paper, we present a system that comprehensively employs a set of algorithms and different geographic sources by extracting geographic information from the web content, and mining hyperlink structures as well as user logs. As the result, only relevant geographic sources, rather than all of possible ones are used in computation of each category of web location. Finally, experimental results on large samples of web data show that our solution outperforms previous approaches.","PeriodicalId":167948,"journal":{"name":"Workshop on Geographic Information Retrieval","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"89","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Geographic Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1096985.1096991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 89

Abstract

The rapid pervasion of the web into users' daily lives has put much importance on capturing location-specific information on the web, due to the fact that most human activities occur locally around where a user is located. This is especially true in the increasingly popular mobile and local search environments. Thus, how to correctly and effectively detect geographic locations from web resources has become a key challenge to location-based web applications. In our previous work, we proposed to explicitly distinguish three types of locations for web resources, namely provider location, content location and serving location. Provider location is the physical location of the provider who owns the web resource; content location is the geographic location described in the web content; while serving location is the geographic scope that a web resource can reach. In this paper, we present a system that comprehensively employs a set of algorithms and different geographic sources by extracting geographic information from the web content, and mining hyperlink structures as well as user logs. As the result, only relevant geographic sources, rather than all of possible ones are used in computation of each category of web location. Finally, experimental results on large samples of web data show that our solution outperforms previous approaches.
从web资源中检测地理位置
网络迅速渗透到用户的日常生活中,这使得在网络上捕捉特定位置的信息变得非常重要,因为大多数人类活动都发生在用户所在的地方附近。在日益流行的移动和本地搜索环境中尤其如此。因此,如何从web资源中正确有效地检测地理位置已成为基于位置的web应用面临的关键挑战。在我们之前的工作中,我们提出明确区分web资源的三种位置类型,即提供者位置、内容位置和服务位置。提供商位置是拥有web资源的提供商的物理位置;内容位置是指web内容中所描述的地理位置;而服务位置是指网络资源可以到达的地理范围。在本文中,我们通过从web内容中提取地理信息,挖掘超链接结构和用户日志,综合运用一套算法和不同的地理资源,提出了一个系统。因此,在计算每一类网站位置时,只使用相关的地理资源,而不是使用所有可能的地理资源。最后,在大样本网络数据上的实验结果表明,我们的解决方案优于以前的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信