Random forest for relational classification with application to terrorist profiling

Jian Xu, Jianhua Chen, Bin Li
{"title":"Random forest for relational classification with application to terrorist profiling","authors":"Jian Xu, Jianhua Chen, Bin Li","doi":"10.1109/GRC.2009.5255041","DOIUrl":null,"url":null,"abstract":"We study the problem of detecting and profiling terrorists using a combination of an ensemble classifier, namely random forest and relational information. Given a database for a set of individuals characterized by both “local” attributes such as age and criminal background, and “relational” information such as communications among a subset of the individuals, with a subset of the individuals labeled as terrorist or normal people, our task is to design a classifier that captures the patterns of terrorists and achieves good accuracy in predicting the labels of the remaining part of the database. In previous work, a hybrid approach was presented that iteratively applies a flat classifier (such as decision trees, fuzzy clustering) augmented with flattened relational attributes for learning and classification. In the current work, our approach is to use random forest as the “flat” classifier in the terrorist detection setting. Random forest is known to have advantage in handling tasks with high dimensionality in input data. This merit of random forest method is very useful for relational learning if the number of “flattened” relational attributes is quite large, which is indeed the case for the terrorist detection task. We report our experiments on a synthetic terrorist database that compare the prediction accuracy of random forest with two other “flat” classifiers, namely, ordinary decision tree and fuzzy clustering. The experimental results show that random forest outperforms both ordinary decision tree and fuzzy clustering. random forest","PeriodicalId":388774,"journal":{"name":"2009 IEEE International Conference on Granular Computing","volume":"36 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Granular Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GRC.2009.5255041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

We study the problem of detecting and profiling terrorists using a combination of an ensemble classifier, namely random forest and relational information. Given a database for a set of individuals characterized by both “local” attributes such as age and criminal background, and “relational” information such as communications among a subset of the individuals, with a subset of the individuals labeled as terrorist or normal people, our task is to design a classifier that captures the patterns of terrorists and achieves good accuracy in predicting the labels of the remaining part of the database. In previous work, a hybrid approach was presented that iteratively applies a flat classifier (such as decision trees, fuzzy clustering) augmented with flattened relational attributes for learning and classification. In the current work, our approach is to use random forest as the “flat” classifier in the terrorist detection setting. Random forest is known to have advantage in handling tasks with high dimensionality in input data. This merit of random forest method is very useful for relational learning if the number of “flattened” relational attributes is quite large, which is indeed the case for the terrorist detection task. We report our experiments on a synthetic terrorist database that compare the prediction accuracy of random forest with two other “flat” classifiers, namely, ordinary decision tree and fuzzy clustering. The experimental results show that random forest outperforms both ordinary decision tree and fuzzy clustering. random forest
随机森林的关系分类及其在恐怖分子特征分析中的应用
我们使用集成分类器,即随机森林和关系信息的组合来研究检测和分析恐怖分子的问题。给定一个数据库,其中包含一组具有“本地”属性(如年龄和犯罪背景)和“关系”信息(如一组个体之间的通信)特征的个体,其中一组个体被标记为恐怖分子或正常人,我们的任务是设计一个分类器,该分类器捕获恐怖分子的模式,并在预测数据库其余部分的标签方面达到良好的准确性。在之前的工作中,提出了一种混合方法,迭代地应用平面分类器(如决策树,模糊聚类),增强平面关系属性进行学习和分类。在目前的工作中,我们的方法是在恐怖分子检测设置中使用随机森林作为“平面”分类器。随机森林在处理输入数据的高维任务方面具有优势。随机森林方法的这一优点对于关系学习非常有用,如果“扁平”的关系属性的数量相当大,这确实是恐怖分子检测任务的情况。我们报告了我们在一个合成恐怖分子数据库上的实验,将随机森林的预测精度与另外两种“平面”分类器(即普通决策树和模糊聚类)进行了比较。实验结果表明,随机森林优于普通决策树和模糊聚类。随机森林
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信