Machine Learning Based Encrypted Content Type Identification

Zeeshan Mehmood, Aiman Sultan, Fawad Khan, Shahzaib Tahir
{"title":"Machine Learning Based Encrypted Content Type Identification","authors":"Zeeshan Mehmood, Aiman Sultan, Fawad Khan, Shahzaib Tahir","doi":"10.1109/ComTech57708.2023.10164955","DOIUrl":null,"url":null,"abstract":"In the advancing era, Machine Learning has become the backbone of IT and is being used almost in every system. Whereas Cryptography is another widely used technology, which is used for the communication of data via secure means. If the cryptosystems provide indistinguishability, it is considered secure, which means that the attacker cannot get anything from encrypted data, in case of chosen ciphertext attack. To check the feasibility of distinguishability on the ciphertext of secured block ciphers and the identification of the underlying content, this research has applied cryptanalysis on AES-128, CBC, and ECB mode over multiple ML classification models of SVM, KNN, and RF. Datasets were created by using the frequency distribution method, and they were divided into training and testing datasets. The results demonstrate that the EBC mode of AES 128 encryption for different data types is found susceptible to content identification while the accuracy for data encrypted by AES 128 in CBC mode remains low to yield any information regarding its content type.","PeriodicalId":203804,"journal":{"name":"2023 International Conference on Communication Technologies (ComTech)","volume":"137 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Communication Technologies (ComTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ComTech57708.2023.10164955","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the advancing era, Machine Learning has become the backbone of IT and is being used almost in every system. Whereas Cryptography is another widely used technology, which is used for the communication of data via secure means. If the cryptosystems provide indistinguishability, it is considered secure, which means that the attacker cannot get anything from encrypted data, in case of chosen ciphertext attack. To check the feasibility of distinguishability on the ciphertext of secured block ciphers and the identification of the underlying content, this research has applied cryptanalysis on AES-128, CBC, and ECB mode over multiple ML classification models of SVM, KNN, and RF. Datasets were created by using the frequency distribution method, and they were divided into training and testing datasets. The results demonstrate that the EBC mode of AES 128 encryption for different data types is found susceptible to content identification while the accuracy for data encrypted by AES 128 in CBC mode remains low to yield any information regarding its content type.
基于机器学习的加密内容类型识别
在这个不断发展的时代,机器学习已经成为IT的支柱,几乎在每个系统中都有使用。而密码学是另一种广泛使用的技术,它用于通过安全手段进行数据通信。如果密码系统提供不可区分性,则认为它是安全的,这意味着攻击者在选择密文攻击的情况下无法从加密数据中获取任何东西。为了检验安全分组密码的密文可识别性和底层内容识别的可行性,本研究在SVM、KNN和RF的多个ML分类模型上应用了AES-128、CBC和ECB模式的密码分析。采用频率分布法生成数据集,并将其分为训练数据集和测试数据集。结果表明,不同数据类型的AES 128加密的EBC模式容易受到内容识别的影响,而AES 128在CBC模式下加密的数据的准确性仍然很低,无法产生有关其内容类型的任何信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信