Open-world continual learning: Unifying novelty detection and continual learning

IF 4.6 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Pub Date : 2024-10-31 DOI:10.1016/j.artint.2024.104237

Gyuhak Kim , Changnan Xiao , Tatsuya Konishi , Zixuan Ke , Bing Liu

{"title":"Open-world continual learning: Unifying novelty detection and continual learning","authors":"Gyuhak Kim , Changnan Xiao , Tatsuya Konishi , Zixuan Ke , Bing Liu","doi":"10.1016/j.artint.2024.104237","DOIUrl":null,"url":null,"abstract":"<div><div>As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is called class incremental learning (CIL), which is a setting of continual learning (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called closed-world OOD detection) is necessary for successful CIL. We show this by decomposing CIL into two sub-problems: within-task prediction (WP) and task-id prediction (TP), and proving that TP is correlated with closed-world OOD detection. The key theoretical result is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are necessary and sufficient conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the closed-world CIL as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to open-world CIL, which is the proposed open-world continual learning, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"338 ","pages":"Article 104237"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0004370224001735","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is called class incremental learning (CIL), which is a setting of continual learning (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called closed-world OOD detection) is necessary for successful CIL. We show this by decomposing CIL into two sub-problems: within-task prediction (WP) and task-id prediction (TP), and proving that TP is correlated with closed-world OOD detection. The key theoretical result is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are necessary and sufficient conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the closed-world CIL as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to open-world CIL, which is the proposed open-world continual learning, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.

查看原文本刊更多论文

开放世界持续学习：统一新奇事物检测和持续学习

随着人工智能代理越来越多地应用于未知或新奇的真实开放世界，它们需要具备以下能力：(1) 识别(a) 它们以前学习过的对象；(b) 检测它们从未见过或学习过的项目；(2) 逐步学习新项目，以变得越来越博学和强大。(1)被称为新颖性检测或分布外（OOD）检测，(2)被称为类增量学习（CIL），它是持续学习（CL）的一种设置。在现有研究中，OOD 检测和 CIL 被视为两个完全不同的问题。本文首先从理论上证明，要成功实现 CIL，就必须对已学任务集合中的每个任务进行良好的 OOD 检测（称为封闭世界 OOD 检测）。我们将 CIL 分解为两个子问题：任务内预测（WP）和任务 ID 预测（TP），并证明 TP 与封闭世界 OOD 检测相关。关键的理论结果是，无论 WP 和 OOD 检测（或 TP）是由 CIL 算法显式定义还是隐式定义，良好的 WP 和良好的封闭世界 OOD 检测都是良好 CIL 的必要条件和充分条件，它将新颖性或 OOD 检测与持续学习（尤其是 CIL）统一起来。我们称这种传统的 CIL 为封闭世界 CIL，因为它不能检测开放世界中的未来 OOD 数据。本文随后证明，该理论可以推广或扩展到开放世界 CIL，即所提出的开放世界持续学习，它可以在开放世界中执行 CIL 并检测未来或开放世界的 OOD 数据。基于理论结果，还设计了新的 CIL 方法，这些方法在 CIL 准确性和持续 OOD 检测方面大大优于强基准方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence 工程技术-计算机：人工智能

CiteScore

11.20

自引率

1.40%

发文量

118

审稿时长

8 months

期刊介绍： The Journal of Artificial Intelligence (AIJ) welcomes papers covering a broad spectrum of AI topics, including cognition, automated reasoning, computer vision, machine learning, and more. Papers should demonstrate advancements in AI and propose innovative approaches to AI problems. Additionally, the journal accepts papers describing AI applications, focusing on how new methods enhance performance rather than reiterating conventional approaches. In addition to regular papers, AIJ also accepts Research Notes, Research Field Reviews, Position Papers, Book Reviews, and summary papers on AI challenges and competitions.