Moises S. De Sousa, Carlos Eduardo Lacerda Veiga, R. D. O. Albuquerque, W. Giozza
{"title":"在基于决策树的入侵检测系统中应用信息增益来减少模型构建时间","authors":"Moises S. De Sousa, Carlos Eduardo Lacerda Veiga, R. D. O. Albuquerque, W. Giozza","doi":"10.23919/cisti54924.2022.9820579","DOIUrl":null,"url":null,"abstract":"Due to the large amount of sensitive data generated by websites, it is possible to understand the progress of attacks to their databases. This work proposes an intrusion detection system based on data mining and machine learning techniques to detect and mitigate the damage caused by these attacks. It adopts the Information Gain method of selecting attributes in order to reduce the model-building time without affecting the classification performance. Using the CIC-IDS 2017 dataset, this work shows how different decision tree algorithms (Random Forest and J48 Algorithm) behave even if they receive equal parameters and data. Using Information Gain to select attributes, the proposed system achieves a processing time reduction of up to 90%.","PeriodicalId":187896,"journal":{"name":"2022 17th Iberian Conference on Information Systems and Technologies (CISTI)","volume":"476 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Information Gain applied to reduce model-building time in decision-tree-based intrusion detection system\",\"authors\":\"Moises S. De Sousa, Carlos Eduardo Lacerda Veiga, R. D. O. Albuquerque, W. Giozza\",\"doi\":\"10.23919/cisti54924.2022.9820579\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the large amount of sensitive data generated by websites, it is possible to understand the progress of attacks to their databases. This work proposes an intrusion detection system based on data mining and machine learning techniques to detect and mitigate the damage caused by these attacks. It adopts the Information Gain method of selecting attributes in order to reduce the model-building time without affecting the classification performance. Using the CIC-IDS 2017 dataset, this work shows how different decision tree algorithms (Random Forest and J48 Algorithm) behave even if they receive equal parameters and data. Using Information Gain to select attributes, the proposed system achieves a processing time reduction of up to 90%.\",\"PeriodicalId\":187896,\"journal\":{\"name\":\"2022 17th Iberian Conference on Information Systems and Technologies (CISTI)\",\"volume\":\"476 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 17th Iberian Conference on Information Systems and Technologies (CISTI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/cisti54924.2022.9820579\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 17th Iberian Conference on Information Systems and Technologies (CISTI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/cisti54924.2022.9820579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Information Gain applied to reduce model-building time in decision-tree-based intrusion detection system
Due to the large amount of sensitive data generated by websites, it is possible to understand the progress of attacks to their databases. This work proposes an intrusion detection system based on data mining and machine learning techniques to detect and mitigate the damage caused by these attacks. It adopts the Information Gain method of selecting attributes in order to reduce the model-building time without affecting the classification performance. Using the CIC-IDS 2017 dataset, this work shows how different decision tree algorithms (Random Forest and J48 Algorithm) behave even if they receive equal parameters and data. Using Information Gain to select attributes, the proposed system achieves a processing time reduction of up to 90%.