Keynote speaker 1: Active online learning

2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS) Pub Date : 2015-12-01 DOI:10.1109/EAIS.2015.7368771

A. Bouchachia

{"title":"Keynote speaker 1: Active online learning","authors":"A. Bouchachia","doi":"10.1109/EAIS.2015.7368771","DOIUrl":null,"url":null,"abstract":"Summary form only given. Over the recent years learning from data streams that evolve over time has been witnessing an ever-increasing interest within research and industry communities. Typically a wide range of applications exploit data streams for different sorts of decision making, including monitoring, industrial processes, internet traffic, surveillance, etc. By their very nature, data streams are usually unlabeled given the high velocity of their generation. Collecting labelled examples become very difficult, delayed, costly and sometimes prone to errors. It is therefore very important to devise mechanisms to optimize the labeling process. Active learning offers a principled and systematic way to selectively choose candidate data examples whose labels are to be queried. The overall goal of active learning is to provide, in the worst case, the same performance as that of passive learning (i.e., relying on random sampling) while using less labeled examples. Obviously, the learner should also be able to accommodate unlabeled and labeled data in an online manner. In this talk we will cover recent work on active learning for data stream classification, which is known as stream-based selective sampling. In this latter, the learner makes immediate query decision for each data example during a single scan of the data stream. Stream-based selective sampling is in particular suitable for applications that demand on-the-fly interactive labelling. It is however difficult, because the learner lacks complete knowledge of the underlying data distribution and because such distribution may suffer dynamic change over time. We will overview active learning for stationary as well as non-stationary evolving data streams. In particular, we will discuss multi-criteria active learning and methods for dealing with data drift using online active learning. We will also highlight some of the typical applications where online active learning is relevant.","PeriodicalId":325875,"journal":{"name":"2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EAIS.2015.7368771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Summary form only given. Over the recent years learning from data streams that evolve over time has been witnessing an ever-increasing interest within research and industry communities. Typically a wide range of applications exploit data streams for different sorts of decision making, including monitoring, industrial processes, internet traffic, surveillance, etc. By their very nature, data streams are usually unlabeled given the high velocity of their generation. Collecting labelled examples become very difficult, delayed, costly and sometimes prone to errors. It is therefore very important to devise mechanisms to optimize the labeling process. Active learning offers a principled and systematic way to selectively choose candidate data examples whose labels are to be queried. The overall goal of active learning is to provide, in the worst case, the same performance as that of passive learning (i.e., relying on random sampling) while using less labeled examples. Obviously, the learner should also be able to accommodate unlabeled and labeled data in an online manner. In this talk we will cover recent work on active learning for data stream classification, which is known as stream-based selective sampling. In this latter, the learner makes immediate query decision for each data example during a single scan of the data stream. Stream-based selective sampling is in particular suitable for applications that demand on-the-fly interactive labelling. It is however difficult, because the learner lacks complete knowledge of the underlying data distribution and because such distribution may suffer dynamic change over time. We will overview active learning for stationary as well as non-stationary evolving data streams. In particular, we will discuss multi-criteria active learning and methods for dealing with data drift using online active learning. We will also highlight some of the typical applications where online active learning is relevant.

查看原文本刊更多论文

主讲人1:积极的在线学习

只提供摘要形式。近年来，随着时间的推移，从数据流中学习已经见证了研究和行业社区日益增长的兴趣。通常，广泛的应用程序利用数据流进行不同类型的决策，包括监控、工业流程、互联网流量、监视等。就其本质而言，由于数据流的生成速度非常快，因此通常没有标记。收集标记的样本变得非常困难、延迟、昂贵，有时还容易出错。因此，设计机制来优化标签过程是非常重要的。主动学习提供了一种有原则和系统的方法来选择性地选择要查询标签的候选数据示例。主动学习的总体目标是在最坏的情况下提供与被动学习(即依赖随机抽样)相同的性能，同时使用较少标记的示例。显然，学习者还应该能够以在线方式容纳未标记和标记的数据。在这次演讲中，我们将介绍最近在数据流分类的主动学习方面的工作，即基于流的选择性采样。在后者中，学习器在对数据流进行一次扫描期间对每个数据示例立即做出查询决策。基于流的选择性采样特别适合于需要动态交互标签的应用。然而，这是困难的，因为学习者对底层数据分布缺乏完整的了解，而且这种分布可能会随着时间的推移而发生动态变化。我们将概述平稳和非平稳演化数据流的主动学习。特别是，我们将讨论多标准主动学习和使用在线主动学习处理数据漂移的方法。我们还将重点介绍一些与在线主动学习相关的典型应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)

自引率

0.00%

发文量