{"title":"Optimized Zero False Positives Perceptron Training for Malware Detection","authors":"Dragos Gavrilut, Razvan Benchea, Cristina Vatamanu","doi":"10.1109/SYNASC.2012.34","DOIUrl":null,"url":null,"abstract":"The increasing number of malware in the past 4 years has determined researchers to test different machine learning techniques to automate the detection system. But because of the large size of the dataset and the need of having a high detection rate, the resulted models have often produced many false positives. This paper proposes a modified version of the perceptron algorithm able to detect malware samples while training at a low rate (even zero) of false positives. A very low number of false positives is crucial because in a real life situation detecting a clean file as malware can destroy the operating system or render other programs unusable. We also provide a method of optimizing the training speed for the algorithm while maintaining the same accuracy. The resulted algorithm can be used in an ensemble or voting system to increase detection and eliminate false positives.","PeriodicalId":173161,"journal":{"name":"2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"30","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SYNASC.2012.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 30
Abstract
The increasing number of malware in the past 4 years has determined researchers to test different machine learning techniques to automate the detection system. But because of the large size of the dataset and the need of having a high detection rate, the resulted models have often produced many false positives. This paper proposes a modified version of the perceptron algorithm able to detect malware samples while training at a low rate (even zero) of false positives. A very low number of false positives is crucial because in a real life situation detecting a clean file as malware can destroy the operating system or render other programs unusable. We also provide a method of optimizing the training speed for the algorithm while maintaining the same accuracy. The resulted algorithm can be used in an ensemble or voting system to increase detection and eliminate false positives.