关于机器学习作为自动安全分类工具的可行性:立场文件

2016 International Conference on Computing, Networking and Communications (ICNC) Pub Date : 2016-03-24 DOI:10.1109/ICCNC.2016.7440588

A. Yazidi, H. Hammer, A. Bai, P. Engelstad

{"title":"关于机器学习作为自动安全分类工具的可行性:立场文件","authors":"A. Yazidi, H. Hammer, A. Bai, P. Engelstad","doi":"10.1109/ICCNC.2016.7440588","DOIUrl":null,"url":null,"abstract":"With the proliferation of threats of leakage of sensitive information such as military classified documents, information guards have recently gained increased interest. An information guard is merely a filter than controls the content of the exchanged information between two domains where one of them has a higher confidentiality level than the other one. The main role of an information guard is to block leakage of the sensitive information from the higher confidentiality domain to the lower confidentiality domain. An example of a higher confidentiality domain is a military network while a subcontractor network is an example of a lower confidentiality domain. The common practice is to use an automatic information guard based on predefined list of words that is called \"dirty word list\" in order to decide the security level of a document and consequently release it to the lower confidentially domain or block it. Traditional information guards are configured manually based on the notion of \"Dirty Lists\". The classification logic of traditional information guards uses the occurrence of words from the \"Dirty Lists\". In this paper, we advocate the use of machine learning as a corner stone for building advanced information guards. Machine learning can also be used as a supplement to the decision obtained based on \"Dirty Lists\" classification. Machine learning has hardly been analysed for this problem, and the analysis on topical classification presented here provides new knowledge and a basis for further work within this area. Ten different machine learning algorithms were applied on real life data from a military context. Presented results are promising and demonstrates that machine learning can become a useful tool to assist humans in determining the appropriate security label of an information object.","PeriodicalId":308458,"journal":{"name":"2016 International Conference on Computing, Networking and Communications (ICNC)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the feasibility of machine learning as a tool for automatic security classification: A position paper\",\"authors\":\"A. Yazidi, H. Hammer, A. Bai, P. Engelstad\",\"doi\":\"10.1109/ICCNC.2016.7440588\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the proliferation of threats of leakage of sensitive information such as military classified documents, information guards have recently gained increased interest. An information guard is merely a filter than controls the content of the exchanged information between two domains where one of them has a higher confidentiality level than the other one. The main role of an information guard is to block leakage of the sensitive information from the higher confidentiality domain to the lower confidentiality domain. An example of a higher confidentiality domain is a military network while a subcontractor network is an example of a lower confidentiality domain. The common practice is to use an automatic information guard based on predefined list of words that is called \\\"dirty word list\\\" in order to decide the security level of a document and consequently release it to the lower confidentially domain or block it. Traditional information guards are configured manually based on the notion of \\\"Dirty Lists\\\". The classification logic of traditional information guards uses the occurrence of words from the \\\"Dirty Lists\\\". In this paper, we advocate the use of machine learning as a corner stone for building advanced information guards. Machine learning can also be used as a supplement to the decision obtained based on \\\"Dirty Lists\\\" classification. Machine learning has hardly been analysed for this problem, and the analysis on topical classification presented here provides new knowledge and a basis for further work within this area. Ten different machine learning algorithms were applied on real life data from a military context. Presented results are promising and demonstrates that machine learning can become a useful tool to assist humans in determining the appropriate security label of an information object.\",\"PeriodicalId\":308458,\"journal\":{\"name\":\"2016 International Conference on Computing, Networking and Communications (ICNC)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Computing, Networking and Communications (ICNC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCNC.2016.7440588\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Computing, Networking and Communications (ICNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCNC.2016.7440588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

最近，随着军事机密文件等敏感信息泄露威胁的扩散，信息警卫受到了越来越多的关注。信息保护仅仅是一个过滤器，它控制两个域之间交换信息的内容，其中一个域比另一个域具有更高的机密级别。信息防护的主要作用是阻止敏感信息从高机密域向低机密域泄露。较高机密域的一个例子是军事网络，而分包商网络是较低机密域的一个例子。通常的做法是使用基于预定义的单词列表(称为“脏单词列表”)的自动信息保护来确定文档的安全级别，从而将其释放到较低的机密域或阻止它。传统的信息保护是基于“脏列表”的概念手动配置的。传统信息防护的分类逻辑是利用“脏表”中单词的出现情况。在本文中，我们提倡使用机器学习作为构建高级信息防护的基石。机器学习也可以作为基于“Dirty Lists”分类得到的决策的补充。机器学习几乎没有针对这个问题进行分析，本文对主题分类的分析为该领域的进一步工作提供了新的知识和基础。十种不同的机器学习算法应用于军事背景下的真实生活数据。所呈现的结果是有希望的，并且表明机器学习可以成为帮助人类确定信息对象的适当安全标签的有用工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On the feasibility of machine learning as a tool for automatic security classification: A position paper

With the proliferation of threats of leakage of sensitive information such as military classified documents, information guards have recently gained increased interest. An information guard is merely a filter than controls the content of the exchanged information between two domains where one of them has a higher confidentiality level than the other one. The main role of an information guard is to block leakage of the sensitive information from the higher confidentiality domain to the lower confidentiality domain. An example of a higher confidentiality domain is a military network while a subcontractor network is an example of a lower confidentiality domain. The common practice is to use an automatic information guard based on predefined list of words that is called "dirty word list" in order to decide the security level of a document and consequently release it to the lower confidentially domain or block it. Traditional information guards are configured manually based on the notion of "Dirty Lists". The classification logic of traditional information guards uses the occurrence of words from the "Dirty Lists". In this paper, we advocate the use of machine learning as a corner stone for building advanced information guards. Machine learning can also be used as a supplement to the decision obtained based on "Dirty Lists" classification. Machine learning has hardly been analysed for this problem, and the analysis on topical classification presented here provides new knowledge and a basis for further work within this area. Ten different machine learning algorithms were applied on real life data from a military context. Presented results are promising and demonstrates that machine learning can become a useful tool to assist humans in determining the appropriate security label of an information object.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 International Conference on Computing, Networking and Communications (ICNC)

自引率

0.00%

发文量