Identifying unfamiliar callers’ professions from privacy-preserving mobile phone data

Jiaquan Zhang, Xiaoming Yao, Xiaoming Fu
{"title":"Identifying unfamiliar callers’ professions from privacy-preserving mobile phone data","authors":"Jiaquan Zhang, Xiaoming Yao, Xiaoming Fu","doi":"10.1109/MSN50589.2020.00088","DOIUrl":null,"url":null,"abstract":"Identifying an unfamiliar caller’s profession is important to protect citizens’ personal safety and property. Due to limited data protection of many popular online services in some countries such as taxi hailing or takeouts ordering, many users encounter an increasing number of phone calls from strangers. This may aggravate the situation that criminals pretend to be delivery staff or taxi drivers, bringing threats to the society. Additionally, many people nowadays suffer from excessive digital marketing and fraud phone calls because of personal information leakage. However, previous works on malicious call detection only focused on binary classification, and do not work for identification of multiple professions. We observed that web service requests issued from users’ mobile phones which may show their Apps preferences, spatial and temporal patterns, and other profession related information. This offers us a hint to identify unfamiliar callers. In fact, some previous works already leveraged raw data from mobile phones (which includes sensitive information) for personality studies. However, accessing users’ mobile phone raw data may violate the more and more strict private data protection policies or regulations (e.g. GDPR 71). Using appropriate statistical methods to eliminate private information and preserve personal characteristics, provides a way to identify mobile phone callers without privacy concern. In this paper, we exploit privacy-preserving mobile data to develop a model which can automatically identify the callers who are divided into four categories of users: normal users (other professions), taxi drivers, delivery and takeouts staffs, telemarketers and fraudsters. The validation results over an anonymized dataset of 1,282 users with a period of 3 months in Shanghai City prove that the proposed model could achieve an accuracy of 75+%.","PeriodicalId":447605,"journal":{"name":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSN50589.2020.00088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Identifying an unfamiliar caller’s profession is important to protect citizens’ personal safety and property. Due to limited data protection of many popular online services in some countries such as taxi hailing or takeouts ordering, many users encounter an increasing number of phone calls from strangers. This may aggravate the situation that criminals pretend to be delivery staff or taxi drivers, bringing threats to the society. Additionally, many people nowadays suffer from excessive digital marketing and fraud phone calls because of personal information leakage. However, previous works on malicious call detection only focused on binary classification, and do not work for identification of multiple professions. We observed that web service requests issued from users’ mobile phones which may show their Apps preferences, spatial and temporal patterns, and other profession related information. This offers us a hint to identify unfamiliar callers. In fact, some previous works already leveraged raw data from mobile phones (which includes sensitive information) for personality studies. However, accessing users’ mobile phone raw data may violate the more and more strict private data protection policies or regulations (e.g. GDPR 71). Using appropriate statistical methods to eliminate private information and preserve personal characteristics, provides a way to identify mobile phone callers without privacy concern. In this paper, we exploit privacy-preserving mobile data to develop a model which can automatically identify the callers who are divided into four categories of users: normal users (other professions), taxi drivers, delivery and takeouts staffs, telemarketers and fraudsters. The validation results over an anonymized dataset of 1,282 users with a period of 3 months in Shanghai City prove that the proposed model could achieve an accuracy of 75+%.
从保护隐私的移动电话数据中识别不熟悉的来电者的职业
识别陌生来电者的职业对保护公民的人身安全和财产安全至关重要。由于一些国家对许多流行的在线服务(如打车或叫外卖)的数据保护有限,许多用户遇到越来越多的陌生人电话。这可能会加剧犯罪分子冒充快递员或出租车司机的情况,给社会带来威胁。此外,由于个人信息泄露,现在许多人遭受过多的数字营销和诈骗电话。然而,以往的恶意呼叫检测工作主要集中在二值分类上,无法进行多职业的识别。我们观察到,用户手机发出的web服务请求可能会显示他们的应用偏好、时空模式以及其他与职业相关的信息。这为我们提供了一个识别不熟悉的来电者的提示。事实上,之前的一些研究已经利用了来自手机的原始数据(其中包括敏感信息)进行个性研究。然而,访问用户的手机原始数据可能会违反越来越严格的私人数据保护政策或法规(例如GDPR 71)。使用适当的统计方法来消除私人信息和保留个人特征,提供了一种无需担心隐私的方式来识别移动电话呼叫者。在本文中,我们利用保护隐私的移动数据开发了一个模型,可以自动识别呼叫者分为四类用户:普通用户(其他职业),出租车司机,外卖和外卖员工,电话推销员和骗子。对上海市1282个用户3个月的匿名数据集的验证结果表明,该模型可以达到75%以上的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信