PEDM: Pre-Ensemble Decision Making for Malware Identification and Web Files

2020 6th International Conference on Web Research (ICWR) Pub Date : 2020-04-01 DOI:10.1109/ICWR49608.2020.9122322

Elham Velayati, Seyed Mehdi Hazrati Fard

{"title":"PEDM: Pre-Ensemble Decision Making for Malware Identification and Web Files","authors":"Elham Velayati, Seyed Mehdi Hazrati Fard","doi":"10.1109/ICWR49608.2020.9122322","DOIUrl":null,"url":null,"abstract":"Connecting your system or device to an insecure network can create the possibility of infecting by the unwanted files. Malware is every malicious code that has the potential to harm any computer or network. So, detecting harmful files is a crucial duty and an important role in any system. Machine learning approaches use a variety of features such as Opcodes, Bytecodes, and System-calls to achieve accurate malware identification. Each of these feature sets provides a unique semantic view, while, considering the effect of altogether is more reliable to detect attacks. Malware can disguise itself in some views, but hiding in all views will be much more difficult. Multi-View Learning (MVL) is an outstanding approach that considers multiple views of a problem to improve the overall performance. In this paper, inspiring MVL an approach is proposed to incorporate some various feature sets and exploit complementary information to identify a file. In this way, the consensus of multiple views is used to minimize the overall error of a classifier based on sparse representation. To show the generalization power of the proposed method, various datasets are employed. Experimental results indicate that in addition to high performance, the proposed method has the advantage of overcoming the imbalanced conditions.","PeriodicalId":231982,"journal":{"name":"2020 6th International Conference on Web Research (ICWR)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR49608.2020.9122322","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Connecting your system or device to an insecure network can create the possibility of infecting by the unwanted files. Malware is every malicious code that has the potential to harm any computer or network. So, detecting harmful files is a crucial duty and an important role in any system. Machine learning approaches use a variety of features such as Opcodes, Bytecodes, and System-calls to achieve accurate malware identification. Each of these feature sets provides a unique semantic view, while, considering the effect of altogether is more reliable to detect attacks. Malware can disguise itself in some views, but hiding in all views will be much more difficult. Multi-View Learning (MVL) is an outstanding approach that considers multiple views of a problem to improve the overall performance. In this paper, inspiring MVL an approach is proposed to incorporate some various feature sets and exploit complementary information to identify a file. In this way, the consensus of multiple views is used to minimize the overall error of a classifier based on sparse representation. To show the generalization power of the proposed method, various datasets are employed. Experimental results indicate that in addition to high performance, the proposed method has the advantage of overcoming the imbalanced conditions.

查看原文本刊更多论文

PEDM:恶意软件识别和网络文件的预集成决策

将您的系统或设备连接到不安全的网络可能会产生被不需要的文件感染的可能性。恶意软件是每一个恶意代码，有可能损害任何计算机或网络。因此，检测有害文件在任何系统中都是一项至关重要的任务。机器学习方法使用各种功能，如操作码、字节码和系统调用，以实现准确的恶意软件识别。每一个特征集都提供了一个独特的语义视图，而综合考虑的效果对检测攻击更可靠。恶意软件可以在某些视图中伪装自己，但在所有视图中隐藏要困难得多。多视图学习(Multi-View Learning, MVL)是一种杰出的方法，它考虑了一个问题的多个视图，以提高整体性能。本文提出了一种鼓舞人心的MVL方法，该方法结合不同的特征集并利用互补信息来识别文件。通过这种方式，利用多视图的一致性来最小化基于稀疏表示的分类器的总体误差。为了证明该方法的泛化能力，采用了不同的数据集。实验结果表明，该方法除具有较高的性能外，还具有克服不平衡条件的优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 6th International Conference on Web Research (ICWR)

自引率

0.00%

发文量