{"title":"网络环境中 DDoS 攻击分类的机器学习技术分析与比较","authors":"Gregorius Airlangga","doi":"10.37034/infeb.v6i1.795","DOIUrl":null,"url":null,"abstract":"This research presents a comparative analysis of machine learning techniques for classifying Distributed Denial of Service (DDoS) attacks within network traffic. We evaluated the performance of three algorithms: Logistic Regression, Decision Tree, and Random Forest, including their scaled-feature counterparts. The study utilized a robust methodology incorporating advanced data preprocessing, feature engineering, and Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance. The models were rigorously tested using a cross-validation framework, assessing their accuracy, precision, recall, and F1 score. Results indicated that the Random Forest algorithm outperformed the others, demonstrating superior predictive accuracy and consistency, albeit with higher computational costs. Logistic Regression, when feature-scaled, showed significant improvement in performance, highlighting the importance of data normalization in models sensitive to feature scaling. Decision Trees provided a quick and interpretable model, though slightly less accurate than the Random Forest. The research findings highlight the trade-offs between predictive performance and computational efficiency in selecting machine learning models for cybersecurity applications. The study contributes to the cybersecurity domain by elucidating the efficacy of ensemble techniques in DDoS attack classification and underscores the potential for model improvement through scaling and data balancing.","PeriodicalId":242689,"journal":{"name":"Jurnal Informatika Ekonomi Bisnis","volume":"33 7","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis and Comparison of Machine Learning Techniques for DDoS Attack Classification in Network Environments\",\"authors\":\"Gregorius Airlangga\",\"doi\":\"10.37034/infeb.v6i1.795\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research presents a comparative analysis of machine learning techniques for classifying Distributed Denial of Service (DDoS) attacks within network traffic. We evaluated the performance of three algorithms: Logistic Regression, Decision Tree, and Random Forest, including their scaled-feature counterparts. The study utilized a robust methodology incorporating advanced data preprocessing, feature engineering, and Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance. The models were rigorously tested using a cross-validation framework, assessing their accuracy, precision, recall, and F1 score. Results indicated that the Random Forest algorithm outperformed the others, demonstrating superior predictive accuracy and consistency, albeit with higher computational costs. Logistic Regression, when feature-scaled, showed significant improvement in performance, highlighting the importance of data normalization in models sensitive to feature scaling. Decision Trees provided a quick and interpretable model, though slightly less accurate than the Random Forest. The research findings highlight the trade-offs between predictive performance and computational efficiency in selecting machine learning models for cybersecurity applications. The study contributes to the cybersecurity domain by elucidating the efficacy of ensemble techniques in DDoS attack classification and underscores the potential for model improvement through scaling and data balancing.\",\"PeriodicalId\":242689,\"journal\":{\"name\":\"Jurnal Informatika Ekonomi Bisnis\",\"volume\":\"33 7\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Informatika Ekonomi Bisnis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37034/infeb.v6i1.795\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Informatika Ekonomi Bisnis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37034/infeb.v6i1.795","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本研究对用于对网络流量中的分布式拒绝服务(DDoS)攻击进行分类的机器学习技术进行了比较分析。我们评估了三种算法的性能:逻辑回归、决策树和随机森林,包括它们的缩放特征对应算法。这项研究采用了一种稳健的方法,结合了先进的数据预处理、特征工程和合成少数群体过度采样技术(SMOTE)来解决类不平衡问题。利用交叉验证框架对模型进行了严格测试,评估了它们的准确度、精确度、召回率和 F1 分数。结果表明,随机森林算法的表现优于其他算法,显示出更高的预测准确性和一致性,尽管计算成本较高。逻辑回归在对特征进行缩放后,性能有了显著提高,这突出表明了数据归一化在对特征缩放敏感的模型中的重要性。决策树提供了一个快速、可解释的模型,但准确性略低于随机森林。研究结果强调了在为网络安全应用选择机器学习模型时,预测性能和计算效率之间的权衡。这项研究阐明了集合技术在 DDoS 攻击分类中的功效,并强调了通过扩展和数据平衡改进模型的潜力,从而为网络安全领域做出了贡献。
Analysis and Comparison of Machine Learning Techniques for DDoS Attack Classification in Network Environments
This research presents a comparative analysis of machine learning techniques for classifying Distributed Denial of Service (DDoS) attacks within network traffic. We evaluated the performance of three algorithms: Logistic Regression, Decision Tree, and Random Forest, including their scaled-feature counterparts. The study utilized a robust methodology incorporating advanced data preprocessing, feature engineering, and Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance. The models were rigorously tested using a cross-validation framework, assessing their accuracy, precision, recall, and F1 score. Results indicated that the Random Forest algorithm outperformed the others, demonstrating superior predictive accuracy and consistency, albeit with higher computational costs. Logistic Regression, when feature-scaled, showed significant improvement in performance, highlighting the importance of data normalization in models sensitive to feature scaling. Decision Trees provided a quick and interpretable model, though slightly less accurate than the Random Forest. The research findings highlight the trade-offs between predictive performance and computational efficiency in selecting machine learning models for cybersecurity applications. The study contributes to the cybersecurity domain by elucidating the efficacy of ensemble techniques in DDoS attack classification and underscores the potential for model improvement through scaling and data balancing.