Detecting Advertisement Module Network Behavior with Graph Modeling

2014 Ninth Asia Joint Conference on Information Security Pub Date : 2014-09-01 DOI:10.1109/AsiaJCIS.2014.26

Hiroki Kuzuno, Kenichi Magata

{"title":"Detecting Advertisement Module Network Behavior with Graph Modeling","authors":"Hiroki Kuzuno, Kenichi Magata","doi":"10.1109/AsiaJCIS.2014.26","DOIUrl":null,"url":null,"abstract":"Android applications are widely used and many are 'free' applications which include advertisement (ad) modules that provide ad services and track user behavior statistics. However, these ad modules often collect users' personal information and device identification numbers along with usage statistics, which is a violation of privacy. In our analysis of 1,188 Android applications' network traffic, we identified 797 applications that included 45 previously known ad modules. We analyzed these ad modules' network behavior, and found that they have characteristic network traffic patterns for acquiring ad content, specifically images. In order to accurately differentiate between ad modules' network traffic and valid application network traffic, we propose a novel method based on the distance between network traffic graphs mapping the relationships between HTTP session data (such as HTML or Java Script). This distance describes the similarity between the sessions. Using this method, we can detect ad modules' traffic by comparing session graphs with the graphs of already known ad modules. In our evaluation, we generated 20,903 graphs of applications. We separated the application graphs into those generated by known ad modules (4,698 graphs), those we manually identified as ad modules (2,000 graphs), and standard application traffic. We then applied 1,000 graphs of known ad graphs to the other graph sets (the remaining 3,698 known ad graphs and the 2,000 manually classified ad graphs) to see how accurately they could be used to identify ad graphs. Our approach showed a 76% detection rate for known ad graphs, and a 96% detection rate for manually classified ad graphs.","PeriodicalId":354543,"journal":{"name":"2014 Ninth Asia Joint Conference on Information Security","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Ninth Asia Joint Conference on Information Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AsiaJCIS.2014.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Android applications are widely used and many are 'free' applications which include advertisement (ad) modules that provide ad services and track user behavior statistics. However, these ad modules often collect users' personal information and device identification numbers along with usage statistics, which is a violation of privacy. In our analysis of 1,188 Android applications' network traffic, we identified 797 applications that included 45 previously known ad modules. We analyzed these ad modules' network behavior, and found that they have characteristic network traffic patterns for acquiring ad content, specifically images. In order to accurately differentiate between ad modules' network traffic and valid application network traffic, we propose a novel method based on the distance between network traffic graphs mapping the relationships between HTTP session data (such as HTML or Java Script). This distance describes the similarity between the sessions. Using this method, we can detect ad modules' traffic by comparing session graphs with the graphs of already known ad modules. In our evaluation, we generated 20,903 graphs of applications. We separated the application graphs into those generated by known ad modules (4,698 graphs), those we manually identified as ad modules (2,000 graphs), and standard application traffic. We then applied 1,000 graphs of known ad graphs to the other graph sets (the remaining 3,698 known ad graphs and the 2,000 manually classified ad graphs) to see how accurately they could be used to identify ad graphs. Our approach showed a 76% detection rate for known ad graphs, and a 96% detection rate for manually classified ad graphs.

查看原文本刊更多论文

基于图建模的广告模块网络行为检测

Android应用程序被广泛使用，其中许多是“免费”应用程序，其中包含提供广告服务和跟踪用户行为统计的广告(ad)模块。然而，这些广告模块通常会收集用户的个人信息和设备识别码以及使用统计数据，这是对隐私的侵犯。在我们对1188个Android应用的网络流量分析中，我们发现797个应用包含45个已知的广告模块。我们分析了这些广告模块的网络行为，发现它们具有获取广告内容(特别是图像)的特征网络流量模式。为了准确区分广告模块的网络流量和有效的应用程序网络流量，我们提出了一种基于映射HTTP会话数据(如HTML或Java Script)之间关系的网络流量图之间距离的新方法。这个距离描述了会话之间的相似性。使用该方法，我们可以通过将会话图与已知广告模块的图进行比较来检测广告模块的流量。在我们的评估中，我们生成了20,903个应用程序图。我们将应用程序图分为由已知广告模块生成的图(4,698个图)、我们手动识别为广告模块的图(2,000个图)和标准应用程序流量。然后，我们将1000个已知广告图应用到其他图集(剩下的3698个已知广告图和2000个手动分类的广告图)，看看它们用于识别广告图的准确性如何。我们的方法对已知广告图的检测率为76%，对手动分类的广告图的检测率为96%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 Ninth Asia Joint Conference on Information Security

自引率

0.00%

发文量