Explainable machine learning-based cybersecurity detection using LIME and Secml

2023 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT) Pub Date : 2023-05-22 DOI:10.1109/JEEIT58638.2023.10185893

Sawsan Alodibat, Ashraf Ahmad, Mohammad Azzeh

{"title":"Explainable machine learning-based cybersecurity detection using LIME and Secml","authors":"Sawsan Alodibat, Ashraf Ahmad, Mohammad Azzeh","doi":"10.1109/JEEIT58638.2023.10185893","DOIUrl":null,"url":null,"abstract":"The field of Explainable Artificial Intelligence (XAI) has gained significant momentum in recent years. This discipline is focused on developing novel approaches to explain and interpret the functioning of machine learning algorithms. As machine learning techniques increasingly adopt “black box” methods, there is growing confusion about how these algorithms work and make decisions. This uncertainty has made it challenging to implement machine learning in sensitive and critical fields. To address this issue, research in machine learning interpretability has become crucial. One particular area that requires attention is the detection process and classification of malware. Handling and preparing data for malware detection poses significant difficulties for machine learning algorithms. Thus, explainability is a critical requirement in current research. Our research leverages XAI, a novel design of explainable artificial intelligence that uses cybersecurity data to gain knowledge about the composition of malware from the Microsoft large benchmark dataset-Microsoft Malware Classification Challenge (BIG 2015). We use the LIME explainability technique and the Secml python library to develop explainable prediction results about the class of malware. We achieved 94% accuracy using Decision Tree classifier.","PeriodicalId":177556,"journal":{"name":"2023 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT)","volume":"79 11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JEEIT58638.2023.10185893","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The field of Explainable Artificial Intelligence (XAI) has gained significant momentum in recent years. This discipline is focused on developing novel approaches to explain and interpret the functioning of machine learning algorithms. As machine learning techniques increasingly adopt “black box” methods, there is growing confusion about how these algorithms work and make decisions. This uncertainty has made it challenging to implement machine learning in sensitive and critical fields. To address this issue, research in machine learning interpretability has become crucial. One particular area that requires attention is the detection process and classification of malware. Handling and preparing data for malware detection poses significant difficulties for machine learning algorithms. Thus, explainability is a critical requirement in current research. Our research leverages XAI, a novel design of explainable artificial intelligence that uses cybersecurity data to gain knowledge about the composition of malware from the Microsoft large benchmark dataset-Microsoft Malware Classification Challenge (BIG 2015). We use the LIME explainability technique and the Secml python library to develop explainable prediction results about the class of malware. We achieved 94% accuracy using Decision Tree classifier.

查看原文本刊更多论文

使用LIME和Secml进行可解释的基于机器学习的网络安全检测

近年来，可解释人工智能(XAI)领域取得了长足的发展。这门学科的重点是开发新的方法来解释和解释机器学习算法的功能。随着机器学习技术越来越多地采用“黑箱”方法，人们对这些算法如何工作和做出决策越来越困惑。这种不确定性使得在敏感和关键领域实施机器学习变得具有挑战性。为了解决这个问题，机器学习可解释性的研究变得至关重要。需要注意的一个特别领域是恶意软件的检测过程和分类。处理和准备恶意软件检测的数据给机器学习算法带来了巨大的困难。因此，可解释性是当前研究的关键要求。我们的研究利用了XAI，这是一种可解释的人工智能的新设计，它使用网络安全数据从微软大型基准数据集-微软恶意软件分类挑战(BIG 2015)中获取有关恶意软件组成的知识。我们使用LIME可解释性技术和Secml python库来开发关于恶意软件类别的可解释性预测结果。我们使用决策树分类器达到了94%的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT)

自引率

0.00%

发文量