变化点检测的深度强化单次学习

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton) Pub Date : 2018-10-01 DOI:10.1109/ALLERTON.2018.8635928

A. Puzanov, Kobi Cohen

{"title":"变化点检测的深度强化单次学习","authors":"A. Puzanov, Kobi Cohen","doi":"10.1109/ALLERTON.2018.8635928","DOIUrl":null,"url":null,"abstract":"We consider the problem of detecting a change in a time series quickly and reliably, where only a few training instances are available. Examples include identifying changes in network traffic due to zero-day attacks, and computer vision applications where changes in series of images that represent significant events needed to be detected. These are known as cases of one-shot learning. We develop a novel Deep Reinforcement One-shot Learning (DeROL) framework to address this challenge. The basic idea of the DeROL algorithm is to train a deep-Q network to obtain a policy which is oblivious to the unseen classes in the testing data. Then, in real-time, DeROL maps the current state of the one-shot learning process to operational actions based on the trained deep-Q network, to maximize the objective function. We tested the algorithm using the OMNIGLOT dataset to demonstrate the efficiency of the DeROL framework.","PeriodicalId":299280,"journal":{"name":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Deep Reinforcement One-Shot Learning for Change Point Detection\",\"authors\":\"A. Puzanov, Kobi Cohen\",\"doi\":\"10.1109/ALLERTON.2018.8635928\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the problem of detecting a change in a time series quickly and reliably, where only a few training instances are available. Examples include identifying changes in network traffic due to zero-day attacks, and computer vision applications where changes in series of images that represent significant events needed to be detected. These are known as cases of one-shot learning. We develop a novel Deep Reinforcement One-shot Learning (DeROL) framework to address this challenge. The basic idea of the DeROL algorithm is to train a deep-Q network to obtain a policy which is oblivious to the unseen classes in the testing data. Then, in real-time, DeROL maps the current state of the one-shot learning process to operational actions based on the trained deep-Q network, to maximize the objective function. We tested the algorithm using the OMNIGLOT dataset to demonstrate the efficiency of the DeROL framework.\",\"PeriodicalId\":299280,\"journal\":{\"name\":\"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ALLERTON.2018.8635928\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ALLERTON.2018.8635928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

我们考虑在只有少数训练实例可用的情况下快速可靠地检测时间序列中的变化的问题。示例包括识别由于零日攻击导致的网络流量变化，以及需要检测代表重要事件的一系列图像变化的计算机视觉应用程序。这些被称为一次性学习的案例。我们开发了一种新的深度强化单次学习(DeROL)框架来解决这一挑战。DeROL算法的基本思想是训练一个深度q网络来获得一个对测试数据中未见类无关的策略。然后，DeROL实时地将单次学习过程的当前状态映射到基于训练好的deep-Q网络的操作动作上，以最大化目标函数。我们使用OMNIGLOT数据集测试了该算法，以证明DeROL框架的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Reinforcement One-Shot Learning for Change Point Detection

We consider the problem of detecting a change in a time series quickly and reliably, where only a few training instances are available. Examples include identifying changes in network traffic due to zero-day attacks, and computer vision applications where changes in series of images that represent significant events needed to be detected. These are known as cases of one-shot learning. We develop a novel Deep Reinforcement One-shot Learning (DeROL) framework to address this challenge. The basic idea of the DeROL algorithm is to train a deep-Q network to obtain a policy which is oblivious to the unseen classes in the testing data. Then, in real-time, DeROL maps the current state of the one-shot learning process to operational actions based on the trained deep-Q network, to maximize the objective function. We tested the algorithm using the OMNIGLOT dataset to demonstrate the efficiency of the DeROL framework.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

自引率

0.00%

发文量