Fang Li, Ziyuan Zhu, Chao Yan, Bowen Chen, Dan Meng
{"title":"基于GPRs特征词频分析的恶意软件检测","authors":"Fang Li, Ziyuan Zhu, Chao Yan, Bowen Chen, Dan Meng","doi":"10.1109/TrustCom50675.2020.00037","DOIUrl":null,"url":null,"abstract":"Recently, low-level hardware micro-architecture features are widely used for malware detection, but they always have redundant information, which will inevitably affect malware detection. To address the above problem, this paper proposed a novel dynamic analysis method to detect malware. The feature matrices are first extracted from the General-Purpose Registers (GPRs) that contain a large amount of valuable but redundant information. To reduce the feature dimension, Term Frequency-Inverse Document Frequency (TF-IDF) technique is then used to select the discriminative information from feature matrices. With the selected features, this paper also designs an ensemble learning model for malware detection. Experimental results show that the proposed method performs better than other state-of-art methods.","PeriodicalId":221956,"journal":{"name":"2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Malware Detection Based on Term Frequency Analysis of GPRs Features\",\"authors\":\"Fang Li, Ziyuan Zhu, Chao Yan, Bowen Chen, Dan Meng\",\"doi\":\"10.1109/TrustCom50675.2020.00037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, low-level hardware micro-architecture features are widely used for malware detection, but they always have redundant information, which will inevitably affect malware detection. To address the above problem, this paper proposed a novel dynamic analysis method to detect malware. The feature matrices are first extracted from the General-Purpose Registers (GPRs) that contain a large amount of valuable but redundant information. To reduce the feature dimension, Term Frequency-Inverse Document Frequency (TF-IDF) technique is then used to select the discriminative information from feature matrices. With the selected features, this paper also designs an ensemble learning model for malware detection. Experimental results show that the proposed method performs better than other state-of-art methods.\",\"PeriodicalId\":221956,\"journal\":{\"name\":\"2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TrustCom50675.2020.00037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TrustCom50675.2020.00037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Malware Detection Based on Term Frequency Analysis of GPRs Features
Recently, low-level hardware micro-architecture features are widely used for malware detection, but they always have redundant information, which will inevitably affect malware detection. To address the above problem, this paper proposed a novel dynamic analysis method to detect malware. The feature matrices are first extracted from the General-Purpose Registers (GPRs) that contain a large amount of valuable but redundant information. To reduce the feature dimension, Term Frequency-Inverse Document Frequency (TF-IDF) technique is then used to select the discriminative information from feature matrices. With the selected features, this paper also designs an ensemble learning model for malware detection. Experimental results show that the proposed method performs better than other state-of-art methods.