KADetector: Automatic Identification of Key Actors in Online Hack Forums Based on Structured Heterogeneous Information Network

Yiming Zhang, Yujie Fan, Yanfang Ye, Liang Zhao, Jiabin Wang, Qi Xiong, Fudong Shao
{"title":"KADetector: Automatic Identification of Key Actors in Online Hack Forums Based on Structured Heterogeneous Information Network","authors":"Yiming Zhang, Yujie Fan, Yanfang Ye, Liang Zhao, Jiabin Wang, Qi Xiong, Fudong Shao","doi":"10.1109/ICBK.2018.00028","DOIUrl":null,"url":null,"abstract":"Underground forums have been widely used by cybercriminals to exchange knowledge and trade in illicit products or services, which have played a central role in the cybercriminal ecosystem. In order to facilitate the deployment of effective countermeasures, in this paper, we propose and develop an intelligent system named KADetector to automate the analysis of Hack Forums for the identification of its key actors who play the vital role in the value chain. In KADetector, to identify whether the given users are key actors, we not only analyze their posted threads, but also utilize various kinds of relations among users, threads, replies, comments, sections and topics. To model the rich semantic relationships, we first introduce a structured heterogeneous information network (HIN) for representation and then use a meta-path based approach to incorporate higher-level semantics to build up relatedness over users in Hack Forums. To reduce the high computation and space cost, given different meta-paths built from the HIN, we propose a new HIN embedding model named ActorHin2Vec to learn the low-dimensional representations for the nodes in HIN. After that, a classifier is built for key actor identification. To the best of our knowledge, this is the first work to use structured HIN for underground participant analysis. Comprehensive experiments on the data collections from Hack Forums are conducted to validate the effectiveness of our developed system KADetector in key actor identification by comparisons with alternative methods.","PeriodicalId":144958,"journal":{"name":"2018 IEEE International Conference on Big Knowledge (ICBK)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Big Knowledge (ICBK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBK.2018.00028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Underground forums have been widely used by cybercriminals to exchange knowledge and trade in illicit products or services, which have played a central role in the cybercriminal ecosystem. In order to facilitate the deployment of effective countermeasures, in this paper, we propose and develop an intelligent system named KADetector to automate the analysis of Hack Forums for the identification of its key actors who play the vital role in the value chain. In KADetector, to identify whether the given users are key actors, we not only analyze their posted threads, but also utilize various kinds of relations among users, threads, replies, comments, sections and topics. To model the rich semantic relationships, we first introduce a structured heterogeneous information network (HIN) for representation and then use a meta-path based approach to incorporate higher-level semantics to build up relatedness over users in Hack Forums. To reduce the high computation and space cost, given different meta-paths built from the HIN, we propose a new HIN embedding model named ActorHin2Vec to learn the low-dimensional representations for the nodes in HIN. After that, a classifier is built for key actor identification. To the best of our knowledge, this is the first work to use structured HIN for underground participant analysis. Comprehensive experiments on the data collections from Hack Forums are conducted to validate the effectiveness of our developed system KADetector in key actor identification by comparisons with alternative methods.
KADetector:基于结构化异构信息网络的在线黑客论坛关键行为者自动识别
网络犯罪分子广泛使用地下论坛来交换知识和交易非法产品或服务,这在网络犯罪生态系统中发挥了核心作用。为了促进有效对策的部署,在本文中,我们提出并开发了一个名为KADetector的智能系统,用于自动分析黑客论坛,以识别在价值链中发挥重要作用的关键参与者。在KADetector中,为了识别给定的用户是否是关键角色,我们不仅分析了他们发布的线程,还利用了用户、线程、回复、评论、章节和主题之间的各种关系。为了对丰富的语义关系进行建模,我们首先引入结构化异构信息网络(HIN)进行表示,然后使用基于元路径的方法合并更高级的语义来构建Hack论坛中用户之间的相关性。为了降低HIN的高计算量和空间成本,在给定HIN构建的不同元路径的情况下,我们提出了一种新的HIN嵌入模型ActorHin2Vec来学习HIN中节点的低维表示。然后,构建用于关键参与者识别的分类器。据我们所知,这是第一个使用结构化HIN进行地下参与者分析的工作。对黑客论坛收集的数据进行了全面的实验,通过与其他方法的比较,验证了我们开发的系统KADetector在关键行为者识别方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信