Locality Sensitive Hashing for Set-Queries, Motivated by Group Recommendations

Haim Kaplan, J. Tenenbaum
{"title":"Locality Sensitive Hashing for Set-Queries, Motivated by Group Recommendations","authors":"Haim Kaplan, J. Tenenbaum","doi":"10.4230/LIPIcs.SWAT.2020.28","DOIUrl":null,"url":null,"abstract":"Locality Sensitive Hashing (LSH) is an effective method to index a set of points such that we can efficiently find the nearest neighbors of a query point. We extend this method to our novel Set-query LSH (SLSH), such that it can find the nearest neighbors of a set of points, given as a query. \nLet $ s(x,y) $ be the similarity between two points $ x $ and $ y $. We define a similarity between a set $ Q$ and a point $ x $ by aggregating the similarities $ s(p,x) $ for all $ p\\in Q $. For example, we can take $ s(p,x) $ to be the angular similarity between $ p $ and $ x $ (i.e., $1-{\\angle (x,p)}/{\\pi}$), and aggregate by arithmetic or geometric averaging, or taking the lowest similarity. \nWe develop locality sensitive hash families and data structures for a large set of such arithmetic and geometric averaging similarities, and analyze their collision probabilities. We also establish an analogous framework and hash families for distance functions. Specifically, we give a structure for the euclidean distance aggregated by either averaging or taking the maximum. \nWe leverage SLSH to solve a geometric extension of the approximate near neighbors problem. In this version, we consider a metric for which the unit ball is an ellipsoid and its orientation is specified with the query. \nAn important application that motivates our work is group recommendation systems. Such a system embeds movies and users in the same feature space, and the task of recommending a movie for a group to watch together, translates to a set-query $ Q $ using an appropriate similarity.","PeriodicalId":447445,"journal":{"name":"Scandinavian Workshop on Algorithm Theory","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Workshop on Algorithm Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.SWAT.2020.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Locality Sensitive Hashing (LSH) is an effective method to index a set of points such that we can efficiently find the nearest neighbors of a query point. We extend this method to our novel Set-query LSH (SLSH), such that it can find the nearest neighbors of a set of points, given as a query. Let $ s(x,y) $ be the similarity between two points $ x $ and $ y $. We define a similarity between a set $ Q$ and a point $ x $ by aggregating the similarities $ s(p,x) $ for all $ p\in Q $. For example, we can take $ s(p,x) $ to be the angular similarity between $ p $ and $ x $ (i.e., $1-{\angle (x,p)}/{\pi}$), and aggregate by arithmetic or geometric averaging, or taking the lowest similarity. We develop locality sensitive hash families and data structures for a large set of such arithmetic and geometric averaging similarities, and analyze their collision probabilities. We also establish an analogous framework and hash families for distance functions. Specifically, we give a structure for the euclidean distance aggregated by either averaging or taking the maximum. We leverage SLSH to solve a geometric extension of the approximate near neighbors problem. In this version, we consider a metric for which the unit ball is an ellipsoid and its orientation is specified with the query. An important application that motivates our work is group recommendation systems. Such a system embeds movies and users in the same feature space, and the task of recommending a movie for a group to watch together, translates to a set-query $ Q $ using an appropriate similarity.
基于组推荐的集合查询的局部性敏感哈希
局部敏感哈希(Locality Sensitive hash, LSH)是一种有效的索引点的方法,可以有效地找到查询点的最近邻居。我们将此方法扩展到新的集-查询LSH (SLSH),这样它就可以找到作为查询给定的一组点的最近邻居。设$ s(x,y) $为两个点$ x $和$ y $之间的相似度。我们通过集合$ Q$和点$ x $之间的相似性$ s(p,x) $对Q$中所有$ p\的相似性$ s(p,x) $的聚合来定义两者之间的相似性。例如,我们可以取$ s(p,x) $作为$ p $和$ x $之间的角相似度(即$1-{\angle (x,p)}/{\pi}$),并通过算术或几何平均或取最低相似度进行聚合。我们为大量这样的算术和几何平均相似度开发了局部敏感哈希族和数据结构,并分析了它们的碰撞概率。我们还建立了距离函数的类似框架和哈希族。具体地说,我们给出了一个欧几里得距离通过取平均值或取最大值聚合的结构。我们利用SLSH来解决近似近邻问题的几何扩展。在这个版本中,我们考虑一个度量,其中单位球是一个椭球,它的方向是通过查询指定的。激励我们工作的一个重要应用是小组推荐系统。这样的系统将电影和用户嵌入到相同的特征空间中,为一组人推荐一起观看的电影的任务,转化为使用适当相似性的集查询$ Q $。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信