{"title":"评价压缩设置对图像文件格式分类的影响","authors":"Z. Seyedghorban, M. Teimouri","doi":"10.1109/ICCKE50421.2020.9303655","DOIUrl":null,"url":null,"abstract":"The classification of file fragments of various file formats is an important task in many applications such as intrusion detection systems, web content filtering, and digital forensics. To date, many research works have presented various feature sets and methods for the task of file fragments classification. Despite this variety, no research work has mainly focused on image file formats in particular. In this paper, the classification of the image file formats is studied. Moreover, we examine the effect of different compression settings on the accuracy of a trained model. It is shown that when during the training phase only specific compression settings are considered, the trained machine performs poorly for unseen compression settings. Considering this fact, we propose our method, in which, fragments with different compression settings but the same file format are merged to form a more general class label. We compare our approach with three other methods proposed in the literature. Results indicate that the proposed feature set leads to a more accurate classifier.","PeriodicalId":402043,"journal":{"name":"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating the Effect of Compression Settings in the Classification of Image File Formats\",\"authors\":\"Z. Seyedghorban, M. Teimouri\",\"doi\":\"10.1109/ICCKE50421.2020.9303655\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The classification of file fragments of various file formats is an important task in many applications such as intrusion detection systems, web content filtering, and digital forensics. To date, many research works have presented various feature sets and methods for the task of file fragments classification. Despite this variety, no research work has mainly focused on image file formats in particular. In this paper, the classification of the image file formats is studied. Moreover, we examine the effect of different compression settings on the accuracy of a trained model. It is shown that when during the training phase only specific compression settings are considered, the trained machine performs poorly for unseen compression settings. Considering this fact, we propose our method, in which, fragments with different compression settings but the same file format are merged to form a more general class label. We compare our approach with three other methods proposed in the literature. Results indicate that the proposed feature set leads to a more accurate classifier.\",\"PeriodicalId\":402043,\"journal\":{\"name\":\"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"76 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE50421.2020.9303655\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE50421.2020.9303655","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluating the Effect of Compression Settings in the Classification of Image File Formats
The classification of file fragments of various file formats is an important task in many applications such as intrusion detection systems, web content filtering, and digital forensics. To date, many research works have presented various feature sets and methods for the task of file fragments classification. Despite this variety, no research work has mainly focused on image file formats in particular. In this paper, the classification of the image file formats is studied. Moreover, we examine the effect of different compression settings on the accuracy of a trained model. It is shown that when during the training phase only specific compression settings are considered, the trained machine performs poorly for unseen compression settings. Considering this fact, we propose our method, in which, fragments with different compression settings but the same file format are merged to form a more general class label. We compare our approach with three other methods proposed in the literature. Results indicate that the proposed feature set leads to a more accurate classifier.