W-DOE: Wasserstein分布不可知论异常值暴露

IF 18.6
Qizhou Wang;Bo Han;Yang Liu;Chen Gong;Tongliang Liu;Jiming Liu
{"title":"W-DOE: Wasserstein分布不可知论异常值暴露","authors":"Qizhou Wang;Bo Han;Yang Liu;Chen Gong;Tongliang Liu;Jiming Liu","doi":"10.1109/TPAMI.2025.3531000","DOIUrl":null,"url":null,"abstract":"In open-world environments, classification models should be adept at identifying out-of-distribution (OOD) data whose semantics differ from in-distribution (ID) data, leading to the emerging research in OOD detection. As a promising learning scheme, <italic>outlier exposure</i> (OE) enables the models to learn from <italic>auxiliary OOD data</i>, enhancing model representations in discerning between ID and OOD patterns. However, these auxiliary OOD data often do not fully represent real OOD scenarios, potentially biasing our models in practical OOD detection. Hence, we propose a novel OE-based learning method termed <italic>Wasserstein Distribution-agnostic Outlier Exposure</i> (W-DOE), which is both theoretically sound and experimentally superior to previous works. The intuition is that by expanding the coverage of training-time OOD data, the models will encounter fewer unseen OOD cases upon deployment. In W-DOE, we achieve additional OOD data to enlarge the OOD coverage, based on a new data synthesis approach called <italic>implicit data synthesis</i> (IDS). It is driven by our new insight that perturbing model parameters can lead to implicit data transformation, which is simple to implement yet effective to realize. Furthermore, we suggest a general learning framework to search for the synthesized OOD data that can benefit the models most, ensuring the OOD performance for the enlarged OOD coverage measured by the Wasserstein metric. Our approach comes with provable guarantees for open-world settings, demonstrating that broader OOD coverage ensures reduced estimation errors and thereby improved generalization for real OOD cases. We conduct extensive experiments across a series of representative OOD detection setups, further validating the superiority of W-DOE against state-of-the-art counterparts in the field.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 5","pages":"3530-3545"},"PeriodicalIF":18.6000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10844561","citationCount":"0","resultStr":"{\"title\":\"W-DOE: Wasserstein Distribution-Agnostic Outlier Exposure\",\"authors\":\"Qizhou Wang;Bo Han;Yang Liu;Chen Gong;Tongliang Liu;Jiming Liu\",\"doi\":\"10.1109/TPAMI.2025.3531000\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In open-world environments, classification models should be adept at identifying out-of-distribution (OOD) data whose semantics differ from in-distribution (ID) data, leading to the emerging research in OOD detection. As a promising learning scheme, <italic>outlier exposure</i> (OE) enables the models to learn from <italic>auxiliary OOD data</i>, enhancing model representations in discerning between ID and OOD patterns. However, these auxiliary OOD data often do not fully represent real OOD scenarios, potentially biasing our models in practical OOD detection. Hence, we propose a novel OE-based learning method termed <italic>Wasserstein Distribution-agnostic Outlier Exposure</i> (W-DOE), which is both theoretically sound and experimentally superior to previous works. The intuition is that by expanding the coverage of training-time OOD data, the models will encounter fewer unseen OOD cases upon deployment. In W-DOE, we achieve additional OOD data to enlarge the OOD coverage, based on a new data synthesis approach called <italic>implicit data synthesis</i> (IDS). It is driven by our new insight that perturbing model parameters can lead to implicit data transformation, which is simple to implement yet effective to realize. Furthermore, we suggest a general learning framework to search for the synthesized OOD data that can benefit the models most, ensuring the OOD performance for the enlarged OOD coverage measured by the Wasserstein metric. Our approach comes with provable guarantees for open-world settings, demonstrating that broader OOD coverage ensures reduced estimation errors and thereby improved generalization for real OOD cases. We conduct extensive experiments across a series of representative OOD detection setups, further validating the superiority of W-DOE against state-of-the-art counterparts in the field.\",\"PeriodicalId\":94034,\"journal\":{\"name\":\"IEEE transactions on pattern analysis and machine intelligence\",\"volume\":\"47 5\",\"pages\":\"3530-3545\"},\"PeriodicalIF\":18.6000,\"publicationDate\":\"2025-01-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10844561\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on pattern analysis and machine intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10844561/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10844561/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在开放世界环境下,分类模型需要善于识别语义不同于分布内(ID)数据的分布外(out- distribution, OOD)数据,从而导致了分布外(out- distribution, OOD)数据检测研究的兴起。作为一种很有前途的学习方案,离群暴露(OE)使模型能够从辅助的OOD数据中学习,增强模型在识别ID和OOD模式方面的表征。然而,这些辅助的OOD数据通常不能完全代表真实的OOD场景,这可能会使我们的模型在实际的OOD检测中产生偏差。因此,我们提出了一种新的基于oe的学习方法,称为Wasserstein分布不可知论异常值暴露(W-DOE),该方法在理论上和实验上都优于以往的研究成果。直觉是,通过扩大训练时间OOD数据的覆盖范围,模型在部署时将遇到更少的未见过的OOD案例。在W-DOE中,我们基于一种新的数据合成方法,即隐式数据合成(IDS),获得额外的OOD数据以扩大OOD覆盖范围。这是由我们的新见解驱动的,即扰动模型参数可以导致隐式数据转换,该转换实现简单而有效。此外,我们提出了一个通用的学习框架来搜索最能使模型受益的合成OOD数据,以确保由Wasserstein度量的扩大的OOD覆盖范围的OOD性能。我们的方法具有开放世界设置的可证明保证,表明更广泛的OOD覆盖确保减少估计误差,从而提高对真实OOD案例的泛化。我们在一系列具有代表性的OOD检测设置中进行了广泛的实验,进一步验证了W-DOE与该领域最先进的同类产品相比的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
W-DOE: Wasserstein Distribution-Agnostic Outlier Exposure
In open-world environments, classification models should be adept at identifying out-of-distribution (OOD) data whose semantics differ from in-distribution (ID) data, leading to the emerging research in OOD detection. As a promising learning scheme, outlier exposure (OE) enables the models to learn from auxiliary OOD data, enhancing model representations in discerning between ID and OOD patterns. However, these auxiliary OOD data often do not fully represent real OOD scenarios, potentially biasing our models in practical OOD detection. Hence, we propose a novel OE-based learning method termed Wasserstein Distribution-agnostic Outlier Exposure (W-DOE), which is both theoretically sound and experimentally superior to previous works. The intuition is that by expanding the coverage of training-time OOD data, the models will encounter fewer unseen OOD cases upon deployment. In W-DOE, we achieve additional OOD data to enlarge the OOD coverage, based on a new data synthesis approach called implicit data synthesis (IDS). It is driven by our new insight that perturbing model parameters can lead to implicit data transformation, which is simple to implement yet effective to realize. Furthermore, we suggest a general learning framework to search for the synthesized OOD data that can benefit the models most, ensuring the OOD performance for the enlarged OOD coverage measured by the Wasserstein metric. Our approach comes with provable guarantees for open-world settings, demonstrating that broader OOD coverage ensures reduced estimation errors and thereby improved generalization for real OOD cases. We conduct extensive experiments across a series of representative OOD detection setups, further validating the superiority of W-DOE against state-of-the-art counterparts in the field.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信