{"title":"Data Augmentation Methods and their Effects on Long-Range Dependence","authors":"M. Ghanbari, W. Kinsner","doi":"10.1109/ICCICC50026.2020.9450221","DOIUrl":null,"url":null,"abstract":"Data augmentation is a common method for expanding datasets to train machine learning models. In this paper, five different methods are used to obtain augmented sets. In addition, eight measures are used for experimental evaluation of datasets before and after data augmentation methods. The key requirement is that any data augmentation should not alter the fundamental properties and characteristics of the original dataset. This research shows how some data augmentation methods can destroy the long-range dependence of the Internet traffic data (ITD) with distributed denial of service (DDoS) attacks (DDoS ITD), and consequently alter the probability mass function (PMF) and data labelling (tagging) of the DDoS ITD.","PeriodicalId":212248,"journal":{"name":"2020 IEEE 19th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 19th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICC50026.2020.9450221","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Data augmentation is a common method for expanding datasets to train machine learning models. In this paper, five different methods are used to obtain augmented sets. In addition, eight measures are used for experimental evaluation of datasets before and after data augmentation methods. The key requirement is that any data augmentation should not alter the fundamental properties and characteristics of the original dataset. This research shows how some data augmentation methods can destroy the long-range dependence of the Internet traffic data (ITD) with distributed denial of service (DDoS) attacks (DDoS ITD), and consequently alter the probability mass function (PMF) and data labelling (tagging) of the DDoS ITD.