开放世界持续学习:统一新奇事物检测和持续学习

IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Gyuhak Kim , Changnan Xiao , Tatsuya Konishi , Zixuan Ke , Bing Liu
{"title":"开放世界持续学习:统一新奇事物检测和持续学习","authors":"Gyuhak Kim ,&nbsp;Changnan Xiao ,&nbsp;Tatsuya Konishi ,&nbsp;Zixuan Ke ,&nbsp;Bing Liu","doi":"10.1016/j.artint.2024.104237","DOIUrl":null,"url":null,"abstract":"<div><div>As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called <em>novelty detection</em> or <em>out-of-distribution</em> (OOD) <em>detection</em> and (2) is called <em>class incremental learning</em> (CIL), which is a setting of <em>continual learning</em> (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called <em>closed-world OOD detection</em>) is <em>necessary</em> for successful CIL. We show this by decomposing CIL into two sub-problems: <em>within-task prediction</em> (WP) and <em>task-id prediction</em> (TP), and proving that TP is correlated with closed-world OOD detection. The <em>key theoretical result</em> is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are <em>necessary</em> and <em>sufficient</em> conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the <em>closed-world CIL</em> as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to <em>open-world CIL</em>, which is the proposed <em>open-world continual learning</em>, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"338 ","pages":"Article 104237"},"PeriodicalIF":5.1000,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Open-world continual learning: Unifying novelty detection and continual learning\",\"authors\":\"Gyuhak Kim ,&nbsp;Changnan Xiao ,&nbsp;Tatsuya Konishi ,&nbsp;Zixuan Ke ,&nbsp;Bing Liu\",\"doi\":\"10.1016/j.artint.2024.104237\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called <em>novelty detection</em> or <em>out-of-distribution</em> (OOD) <em>detection</em> and (2) is called <em>class incremental learning</em> (CIL), which is a setting of <em>continual learning</em> (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called <em>closed-world OOD detection</em>) is <em>necessary</em> for successful CIL. We show this by decomposing CIL into two sub-problems: <em>within-task prediction</em> (WP) and <em>task-id prediction</em> (TP), and proving that TP is correlated with closed-world OOD detection. The <em>key theoretical result</em> is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are <em>necessary</em> and <em>sufficient</em> conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the <em>closed-world CIL</em> as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to <em>open-world CIL</em>, which is the proposed <em>open-world continual learning</em>, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.</div></div>\",\"PeriodicalId\":8434,\"journal\":{\"name\":\"Artificial Intelligence\",\"volume\":\"338 \",\"pages\":\"Article 104237\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2024-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0004370224001735\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0004370224001735","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

随着人工智能代理越来越多地应用于未知或新奇的真实开放世界,它们需要具备以下能力:(1) 识别(a) 它们以前学习过的对象;(b) 检测它们从未见过或学习过的项目;(2) 逐步学习新项目,以变得越来越博学和强大。(1)被称为新颖性检测或分布外(OOD)检测,(2)被称为类增量学习(CIL),它是持续学习(CL)的一种设置。在现有研究中,OOD 检测和 CIL 被视为两个完全不同的问题。本文首先从理论上证明,要成功实现 CIL,就必须对已学任务集合中的每个任务进行良好的 OOD 检测(称为封闭世界 OOD 检测)。我们将 CIL 分解为两个子问题:任务内预测(WP)和任务 ID 预测(TP),并证明 TP 与封闭世界 OOD 检测相关。关键的理论结果是,无论 WP 和 OOD 检测(或 TP)是由 CIL 算法显式定义还是隐式定义,良好的 WP 和良好的封闭世界 OOD 检测都是良好 CIL 的必要条件和充分条件,它将新颖性或 OOD 检测与持续学习(尤其是 CIL)统一起来。我们称这种传统的 CIL 为封闭世界 CIL,因为它不能检测开放世界中的未来 OOD 数据。本文随后证明,该理论可以推广或扩展到开放世界 CIL,即所提出的开放世界持续学习,它可以在开放世界中执行 CIL 并检测未来或开放世界的 OOD 数据。基于理论结果,还设计了新的 CIL 方法,这些方法在 CIL 准确性和持续 OOD 检测方面大大优于强基准方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Open-world continual learning: Unifying novelty detection and continual learning
As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is called class incremental learning (CIL), which is a setting of continual learning (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called closed-world OOD detection) is necessary for successful CIL. We show this by decomposing CIL into two sub-problems: within-task prediction (WP) and task-id prediction (TP), and proving that TP is correlated with closed-world OOD detection. The key theoretical result is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are necessary and sufficient conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the closed-world CIL as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to open-world CIL, which is the proposed open-world continual learning, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Artificial Intelligence
Artificial Intelligence 工程技术-计算机:人工智能
CiteScore
11.20
自引率
1.40%
发文量
118
审稿时长
8 months
期刊介绍: The Journal of Artificial Intelligence (AIJ) welcomes papers covering a broad spectrum of AI topics, including cognition, automated reasoning, computer vision, machine learning, and more. Papers should demonstrate advancements in AI and propose innovative approaches to AI problems. Additionally, the journal accepts papers describing AI applications, focusing on how new methods enhance performance rather than reiterating conventional approaches. In addition to regular papers, AIJ also accepts Research Notes, Research Field Reviews, Position Papers, Book Reviews, and summary papers on AI challenges and competitions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信