{"title":"自然数据分布变化下的连续对比异常检测","authors":"J. Yang, Yi Shen, Linan Deng","doi":"10.1109/CACRE58689.2023.10208545","DOIUrl":null,"url":null,"abstract":"This article carries out a research on the topic of data-driven anomaly detection with a focus on its continual learning ability for nonstationary data streams. The study has two primary objectives: to process the precious label information in a semi-supervised paradigm, and to deal with the stability-plasticity dilemma caused by natural data distribution shifts. It means that the anomaly detector should adapt to the new data distribution while retaining and utilizing the previous knowledge. To address these objectives, a novel continual anomaly detection framework, called Continual Contrastive Anomaly Detection (CCAD), is proposed through the lens of contrastive learning. CCAD is highlighted by prototype learning through a continual contrastive loss function that includes both homologous sample-to-prototype and heterologous sample-to-sample contrastive components, which aim to find out the prototypical representation for new data and to constrain the updated representation space within the proximal region of previous ones, respectively. In addition, it provides a natural rehearsal selection method based on the affinity of samples to prototypes. The efficacy of CCAD is experimentally demonstrated through a case study over a network intrusion detection system spanning approximately a decade. The source code is available at https://github.com/JingyuYang1997/CCAD.","PeriodicalId":447007,"journal":{"name":"2023 8th International Conference on Automation, Control and Robotics Engineering (CACRE)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Continual Contrastive Anomaly Detection under Natural Data Distribution Shifts\",\"authors\":\"J. Yang, Yi Shen, Linan Deng\",\"doi\":\"10.1109/CACRE58689.2023.10208545\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article carries out a research on the topic of data-driven anomaly detection with a focus on its continual learning ability for nonstationary data streams. The study has two primary objectives: to process the precious label information in a semi-supervised paradigm, and to deal with the stability-plasticity dilemma caused by natural data distribution shifts. It means that the anomaly detector should adapt to the new data distribution while retaining and utilizing the previous knowledge. To address these objectives, a novel continual anomaly detection framework, called Continual Contrastive Anomaly Detection (CCAD), is proposed through the lens of contrastive learning. CCAD is highlighted by prototype learning through a continual contrastive loss function that includes both homologous sample-to-prototype and heterologous sample-to-sample contrastive components, which aim to find out the prototypical representation for new data and to constrain the updated representation space within the proximal region of previous ones, respectively. In addition, it provides a natural rehearsal selection method based on the affinity of samples to prototypes. The efficacy of CCAD is experimentally demonstrated through a case study over a network intrusion detection system spanning approximately a decade. The source code is available at https://github.com/JingyuYang1997/CCAD.\",\"PeriodicalId\":447007,\"journal\":{\"name\":\"2023 8th International Conference on Automation, Control and Robotics Engineering (CACRE)\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 8th International Conference on Automation, Control and Robotics Engineering (CACRE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACRE58689.2023.10208545\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 8th International Conference on Automation, Control and Robotics Engineering (CACRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACRE58689.2023.10208545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Continual Contrastive Anomaly Detection under Natural Data Distribution Shifts
This article carries out a research on the topic of data-driven anomaly detection with a focus on its continual learning ability for nonstationary data streams. The study has two primary objectives: to process the precious label information in a semi-supervised paradigm, and to deal with the stability-plasticity dilemma caused by natural data distribution shifts. It means that the anomaly detector should adapt to the new data distribution while retaining and utilizing the previous knowledge. To address these objectives, a novel continual anomaly detection framework, called Continual Contrastive Anomaly Detection (CCAD), is proposed through the lens of contrastive learning. CCAD is highlighted by prototype learning through a continual contrastive loss function that includes both homologous sample-to-prototype and heterologous sample-to-sample contrastive components, which aim to find out the prototypical representation for new data and to constrain the updated representation space within the proximal region of previous ones, respectively. In addition, it provides a natural rehearsal selection method based on the affinity of samples to prototypes. The efficacy of CCAD is experimentally demonstrated through a case study over a network intrusion detection system spanning approximately a decade. The source code is available at https://github.com/JingyuYang1997/CCAD.