Machine Learning Based Encrypted Content Type Identification

2023 International Conference on Communication Technologies (ComTech) Pub Date : 2023-03-15 DOI:10.1109/ComTech57708.2023.10164955

Zeeshan Mehmood, Aiman Sultan, Fawad Khan, Shahzaib Tahir

{"title":"Machine Learning Based Encrypted Content Type Identification","authors":"Zeeshan Mehmood, Aiman Sultan, Fawad Khan, Shahzaib Tahir","doi":"10.1109/ComTech57708.2023.10164955","DOIUrl":null,"url":null,"abstract":"In the advancing era, Machine Learning has become the backbone of IT and is being used almost in every system. Whereas Cryptography is another widely used technology, which is used for the communication of data via secure means. If the cryptosystems provide indistinguishability, it is considered secure, which means that the attacker cannot get anything from encrypted data, in case of chosen ciphertext attack. To check the feasibility of distinguishability on the ciphertext of secured block ciphers and the identification of the underlying content, this research has applied cryptanalysis on AES-128, CBC, and ECB mode over multiple ML classification models of SVM, KNN, and RF. Datasets were created by using the frequency distribution method, and they were divided into training and testing datasets. The results demonstrate that the EBC mode of AES 128 encryption for different data types is found susceptible to content identification while the accuracy for data encrypted by AES 128 in CBC mode remains low to yield any information regarding its content type.","PeriodicalId":203804,"journal":{"name":"2023 International Conference on Communication Technologies (ComTech)","volume":"137 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Communication Technologies (ComTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ComTech57708.2023.10164955","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the advancing era, Machine Learning has become the backbone of IT and is being used almost in every system. Whereas Cryptography is another widely used technology, which is used for the communication of data via secure means. If the cryptosystems provide indistinguishability, it is considered secure, which means that the attacker cannot get anything from encrypted data, in case of chosen ciphertext attack. To check the feasibility of distinguishability on the ciphertext of secured block ciphers and the identification of the underlying content, this research has applied cryptanalysis on AES-128, CBC, and ECB mode over multiple ML classification models of SVM, KNN, and RF. Datasets were created by using the frequency distribution method, and they were divided into training and testing datasets. The results demonstrate that the EBC mode of AES 128 encryption for different data types is found susceptible to content identification while the accuracy for data encrypted by AES 128 in CBC mode remains low to yield any information regarding its content type.

查看原文本刊更多论文

基于机器学习的加密内容类型识别

在这个不断发展的时代，机器学习已经成为IT的支柱，几乎在每个系统中都有使用。而密码学是另一种广泛使用的技术，它用于通过安全手段进行数据通信。如果密码系统提供不可区分性，则认为它是安全的，这意味着攻击者在选择密文攻击的情况下无法从加密数据中获取任何东西。为了检验安全分组密码的密文可识别性和底层内容识别的可行性，本研究在SVM、KNN和RF的多个ML分类模型上应用了AES-128、CBC和ECB模式的密码分析。采用频率分布法生成数据集，并将其分为训练数据集和测试数据集。结果表明，不同数据类型的AES 128加密的EBC模式容易受到内容识别的影响，而AES 128在CBC模式下加密的数据的准确性仍然很低，无法产生有关其内容类型的任何信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Communication Technologies (ComTech)

自引率

0.00%

发文量