{"title":"A new features vector matching for big heterogeneous data in intrusion detection context","authors":"Marwa Elayni, F. Jemili, O. Korbaa, B. Solaiman","doi":"10.1109/ATSIP49331.2020.9231671","DOIUrl":null,"url":null,"abstract":"Nowadays, the volume of data considerably increasing, the data is exploding on the scale of the Exabyte and the Zettabyte at an exceptionally high rate. These can be characterized as big data. Hence, the security of the network, Internet, websites, Iot devices and the organizations, of this growth is indispensable. Detecting intrusions in such a big heterogeneous data environment is challenging. In this paper, we will present a new representation of data that can support this big heterogeneous environment. We will use three different datasets and propose an automatically matching algorithm that measures the semantic similarity between each two features existing on different datasets. Thereafter, an approximate vector is created that any type of coming data can be stored. With this representation, we can have subsequently an efficient intrusion detection system that can be able to acknowledge any instance of the existing data in the networks.","PeriodicalId":384018,"journal":{"name":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP49331.2020.9231671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, the volume of data considerably increasing, the data is exploding on the scale of the Exabyte and the Zettabyte at an exceptionally high rate. These can be characterized as big data. Hence, the security of the network, Internet, websites, Iot devices and the organizations, of this growth is indispensable. Detecting intrusions in such a big heterogeneous data environment is challenging. In this paper, we will present a new representation of data that can support this big heterogeneous environment. We will use three different datasets and propose an automatically matching algorithm that measures the semantic similarity between each two features existing on different datasets. Thereafter, an approximate vector is created that any type of coming data can be stored. With this representation, we can have subsequently an efficient intrusion detection system that can be able to acknowledge any instance of the existing data in the networks.