Towards efficient and accurate privacy preserving web search

Proceedings of the 9th Workshop on Middleware for Next Generation Internet Computing Pub Date : 2014-12-08 DOI:10.1145/2676733.2676734

Albin Petit, Sonia Ben Mokhtar, L. Brunie, H. Kosch

{"title":"Towards efficient and accurate privacy preserving web search","authors":"Albin Petit, Sonia Ben Mokhtar, L. Brunie, H. Kosch","doi":"10.1145/2676733.2676734","DOIUrl":null,"url":null,"abstract":"Querying Web search engines is by far the most frequent activity performed by online users and consequently the one in which they are likely to reveal a significant amount of personal information. Protecting the privacy of Web requesters is thus becoming increasingly important. This is often done by using systems that guarantee unlinkability between the requester and her query. The most effective solution to reach this objective is the use of anonymous communication protocols (e.g., onion routing [10]). However, according to [14], anonymity might not resist to machine learning attacks. Thus, an adversary could link a query to her requester's public profile. Other approaches (e.g., [8,17]) guarantee unidentifiability of the user interests by generating noise (e.g., creating covert queries or adding extra keywords). However, these solutions overload the network and decrease the accuracy of the results. We present in this paper the first contribution that combines both approaches. It allows a user to perform a private Web search resistant to machine learning attacks while slightly decreasing the relevance of the results. Our three stage architecture contains: (1) a Privacy Proxy that relies on two non-colluding servers to hide the requester identity from the search engine; (2) a Linkability Assessment that analyses the risk that a request is reassociated with the identity of the requester; (3) an Obfuscator that protects the queries which have been flagged linkable by the linkability assessment.","PeriodicalId":276727,"journal":{"name":"Proceedings of the 9th Workshop on Middleware for Next Generation Internet Computing","volume":"158 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th Workshop on Middleware for Next Generation Internet Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2676733.2676734","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Querying Web search engines is by far the most frequent activity performed by online users and consequently the one in which they are likely to reveal a significant amount of personal information. Protecting the privacy of Web requesters is thus becoming increasingly important. This is often done by using systems that guarantee unlinkability between the requester and her query. The most effective solution to reach this objective is the use of anonymous communication protocols (e.g., onion routing [10]). However, according to [14], anonymity might not resist to machine learning attacks. Thus, an adversary could link a query to her requester's public profile. Other approaches (e.g., [8,17]) guarantee unidentifiability of the user interests by generating noise (e.g., creating covert queries or adding extra keywords). However, these solutions overload the network and decrease the accuracy of the results. We present in this paper the first contribution that combines both approaches. It allows a user to perform a private Web search resistant to machine learning attacks while slightly decreasing the relevance of the results. Our three stage architecture contains: (1) a Privacy Proxy that relies on two non-colluding servers to hide the requester identity from the search engine; (2) a Linkability Assessment that analyses the risk that a request is reassociated with the identity of the requester; (3) an Obfuscator that protects the queries which have been flagged linkable by the linkability assessment.

查看原文本刊更多论文

迈向高效、准确的隐私保护网络搜索

查询Web搜索引擎是迄今为止在线用户执行的最频繁的活动，因此他们可能会在其中泄露大量个人信息。因此，保护Web请求者的隐私变得越来越重要。这通常是通过使用保证请求者与其查询之间不可链接的系统来实现的。实现这一目标的最有效解决方案是使用匿名通信协议(例如，洋葱路由[10])。然而，根据[14]的说法，匿名可能无法抵抗机器学习攻击。因此，攻击者可以将查询链接到其请求者的公共配置文件。其他方法(例如[8,17])通过产生噪声(例如，创建隐蔽查询或添加额外的关键字)来保证用户兴趣的不可识别性。然而，这些解决方案使网络过载，降低了结果的准确性。我们在本文中提出了结合这两种方法的第一个贡献。它允许用户执行私有Web搜索，以抵抗机器学习攻击，同时略微降低结果的相关性。我们的三阶段架构包含:(1)一个隐私代理，它依赖于两个不串通的服务器来对搜索引擎隐藏请求者的身份;(2)可链接性评估，分析请求与请求者身份重新关联的风险;(3)一个混淆器，保护查询已被标记可链接的可链接性评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 9th Workshop on Middleware for Next Generation Internet Computing

自引率

0.00%

发文量