ATLAS: Automatically Detecting Discrepancies Between Privacy Policies and Privacy Labels

Akshatha Jain, David Rodríguez Torrado, J. D. Álamo, N. Sadeh
{"title":"ATLAS: Automatically Detecting Discrepancies Between Privacy Policies and Privacy Labels","authors":"Akshatha Jain, David Rodríguez Torrado, J. D. Álamo, N. Sadeh","doi":"10.1109/EuroSPW59978.2023.00016","DOIUrl":null,"url":null,"abstract":"Privacy policies are long, complex documents that end-users seldom read. Privacy labels aim to ameliorate these issues by providing succinct summaries of salient data practices. In December 2020, Apple began requiring that app developers submit privacy labels describing their apps’ data practices. Yet, research suggests that app developers often struggle to do so. In this paper, we automatically identify possible discrepancies between mobile app privacy policies and their privacy labels. Such discrepancies could be indicators of potential privacy compliance issues. We introduce the Automated Privacy Label Analysis System (ATLAS). ATLAS includes three components: a pipeline to systematically retrieve iOS App Store listings and privacy policies; an ensemble-based classifier capable of predicting privacy labels from the text of privacy policies with 91.3% accuracy using state-of-the-art NLP techniques; and a discrepancy analysis mechanism that enables a large-scale privacy analysis of the iOS App Store. Our system has enabled us to analyze 354,725 iOS apps. We find several interesting trends. For example, only 40.3% of apps in the App Store provide easily accessible privacy policies, and only 29.6% of apps provide both accessible privacy policies and privacy labels. Among apps that provide both, 88.0% have at least one possible discrepancy between the text of their privacy policy and their privacy label, which could be indicative of a potential compliance issue. We find that, on average, apps have 5.32 such potential compliance issues. We hope that ATLAS will help app developers, researchers, regulators, and mobile app stores alike. For example, app developers could use our classifier to check for discrepancies between their privacy policies and privacy labels, and regulators could use our system to help review apps at scale for potential compliance issues.","PeriodicalId":220415,"journal":{"name":"2023 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EuroSPW59978.2023.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Privacy policies are long, complex documents that end-users seldom read. Privacy labels aim to ameliorate these issues by providing succinct summaries of salient data practices. In December 2020, Apple began requiring that app developers submit privacy labels describing their apps’ data practices. Yet, research suggests that app developers often struggle to do so. In this paper, we automatically identify possible discrepancies between mobile app privacy policies and their privacy labels. Such discrepancies could be indicators of potential privacy compliance issues. We introduce the Automated Privacy Label Analysis System (ATLAS). ATLAS includes three components: a pipeline to systematically retrieve iOS App Store listings and privacy policies; an ensemble-based classifier capable of predicting privacy labels from the text of privacy policies with 91.3% accuracy using state-of-the-art NLP techniques; and a discrepancy analysis mechanism that enables a large-scale privacy analysis of the iOS App Store. Our system has enabled us to analyze 354,725 iOS apps. We find several interesting trends. For example, only 40.3% of apps in the App Store provide easily accessible privacy policies, and only 29.6% of apps provide both accessible privacy policies and privacy labels. Among apps that provide both, 88.0% have at least one possible discrepancy between the text of their privacy policy and their privacy label, which could be indicative of a potential compliance issue. We find that, on average, apps have 5.32 such potential compliance issues. We hope that ATLAS will help app developers, researchers, regulators, and mobile app stores alike. For example, app developers could use our classifier to check for discrepancies between their privacy policies and privacy labels, and regulators could use our system to help review apps at scale for potential compliance issues.
ATLAS:自动检测隐私政策和隐私标签之间的差异
隐私政策是长而复杂的文档,最终用户很少阅读。隐私标签旨在通过提供重要数据实践的简洁摘要来改善这些问题。2020年12月,苹果开始要求应用程序开发人员提交描述其应用程序数据实践的隐私标签。然而,研究表明,应用程序开发者往往很难做到这一点。在本文中,我们自动识别移动应用程序隐私政策与其隐私标签之间可能存在的差异。这种差异可能是潜在隐私合规问题的指标。我们介绍了自动隐私标签分析系统(ATLAS)。ATLAS包括三个组成部分:系统地检索iOS App Store列表和隐私政策的管道;一个基于集成的分类器,能够使用最先进的NLP技术从隐私策略的文本中预测隐私标签,准确率为91.3%;以及一个差异分析机制,可以对iOS应用商店进行大规模的隐私分析。我们的系统使我们能够分析354725个iOS应用程序。我们发现了几个有趣的趋势。例如,App Store中只有40.3%的应用程序提供易于访问的隐私政策,只有29.6%的应用程序同时提供易于访问的隐私政策和隐私标签。在两者都提供的应用程序中,88.0%的应用程序的隐私政策和隐私标签之间至少有一处可能存在差异,这可能表明存在潜在的合规问题。我们发现,平均而言,应用程序有5.32个这样的潜在合规问题。我们希望ATLAS能够帮助应用开发者、研究人员、监管机构和移动应用商店。例如,应用程序开发人员可以使用我们的分类器来检查他们的隐私政策和隐私标签之间的差异,监管机构可以使用我们的系统来帮助大规模审查应用程序,以发现潜在的合规问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信