People searching for people: analysis of a people search engine log

W. Weerkamp, R. Berendsen, B. Kovachev, E. Meij, K. Balog, M. de Rijke
{"title":"People searching for people: analysis of a people search engine log","authors":"W. Weerkamp, R. Berendsen, B. Kovachev, E. Meij, K. Balog, M. de Rijke","doi":"10.1145/2009916.2009927","DOIUrl":null,"url":null,"abstract":"Recent years show an increasing interest in vertical search: searching within a particular type of information. Understanding what people search for in these \"verticals\" gives direction to research and provides pointers for the search engines themselves. In this paper we analyze the search logs of one particular vertical: people search engines. Based on an extensive analysis of the logs of a search engine geared towards finding people, we propose a classification scheme for people search at three levels: (a) queries, (b) sessions, and (c) users. For queries, we identify three types, (i) event-based high-profile queries (people that become \"popular\" because of an event happening), (ii) regular high-profile queries (celebrities), and (iii) low-profile queries (other, less-known people). We present experiments on automatic classification of queries. On the session level, we observe five types: (i) family sessions (users looking for relatives), (ii) event sessions (querying the main players of an event), (iii) spotting sessions (trying to \"spot\" different celebrities online), (iv) polymerous sessions (sessions without a clear relation between queries), and (v) repetitive sessions (query refinement and copying). Finally, for users we identify four types: (i) monitors, (ii) spotters, (iii) followers, and (iv) polymers. Our findings not only offer insight into search behavior in people search engines, but they are also useful to identify future research directions and to provide pointers for search engine improvements.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"179 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"46","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2009916.2009927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 46

Abstract

Recent years show an increasing interest in vertical search: searching within a particular type of information. Understanding what people search for in these "verticals" gives direction to research and provides pointers for the search engines themselves. In this paper we analyze the search logs of one particular vertical: people search engines. Based on an extensive analysis of the logs of a search engine geared towards finding people, we propose a classification scheme for people search at three levels: (a) queries, (b) sessions, and (c) users. For queries, we identify three types, (i) event-based high-profile queries (people that become "popular" because of an event happening), (ii) regular high-profile queries (celebrities), and (iii) low-profile queries (other, less-known people). We present experiments on automatic classification of queries. On the session level, we observe five types: (i) family sessions (users looking for relatives), (ii) event sessions (querying the main players of an event), (iii) spotting sessions (trying to "spot" different celebrities online), (iv) polymerous sessions (sessions without a clear relation between queries), and (v) repetitive sessions (query refinement and copying). Finally, for users we identify four types: (i) monitors, (ii) spotters, (iii) followers, and (iv) polymers. Our findings not only offer insight into search behavior in people search engines, but they are also useful to identify future research directions and to provide pointers for search engine improvements.
人搜索人:分析人搜索引擎日志
近年来,人们对垂直搜索越来越感兴趣:在特定类型的信息中进行搜索。了解人们在这些“垂直领域”中搜索什么,可以为研究提供方向,并为搜索引擎本身提供指针。本文分析了一个特定垂直领域的搜索日志:人员搜索引擎。基于对用于查找人员的搜索引擎日志的广泛分析,我们提出了三个层次的人员搜索分类方案:(a)查询、(b)会话和(c)用户。对于查询,我们确定了三种类型,(i)基于事件的高调查询(因为事件发生而变得“流行”的人),(ii)常规高调查询(名人),以及(iii)低调查询(其他,不太知名的人)。我们提出了对查询进行自动分类的实验。在会话级别上,我们观察到五种类型:(i)家庭会话(用户寻找亲戚),(ii)事件会话(查询事件的主要参与者),(iii)发现会话(试图在网上“发现”不同的名人),(iv)聚合会话(查询之间没有明确关系的会话),以及(v)重复会话(查询优化和复制)。最后,对于用户,我们确定了四种类型:(i)监视器,(ii)观测者,(iii)追随者和(iv)聚合物。我们的发现不仅提供了对人们在搜索引擎中的搜索行为的洞察,而且对确定未来的研究方向和提供搜索引擎改进的指针也很有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信