{"title":"Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery","authors":"Muli Yang, Liancheng Wang, Cheng Deng, Hanwang Zhang","doi":"10.1109/CVPR52729.2023.00337","DOIUrl":null,"url":null,"abstract":"Novel Class Discovery (NCD) aims to discover unknown classes without any annotation, by exploiting the transferable knowledge already learned from a base set of known classes. Existing works hold an impractical assumption that the novel class distribution prior is uniform, yet neglect the imbalanced nature of real-world data. In this paper, we relax this assumption by proposing a new challenging task: distribution-agnostic NCD, which allows data drawn from arbitrary unknown class distributions and thus renders existing methods useless or even harmful. We tackle this challenge by proposing a new method, dubbed “Boot-strapping Your Own Prior (BYOP)”, which iteratively estimates the class prior based on the model prediction it-self. At each iteration, we devise a dynamic temperature technique that better estimates the class prior by encouraging sharper predictions for less-confident samples. Thus, BYOP obtains more accurate pseudo-labels for the novel samples, which are beneficial for the next training iteration. Extensive experiments show that existing methods suffer from imbalanced class distributions, while BYOp11Code: https://github.com/muliyangm/BYOP. out-performs them by clear margins, demonstrating its effectiveness across various distribution scenarios.","PeriodicalId":376416,"journal":{"name":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR52729.2023.00337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Novel Class Discovery (NCD) aims to discover unknown classes without any annotation, by exploiting the transferable knowledge already learned from a base set of known classes. Existing works hold an impractical assumption that the novel class distribution prior is uniform, yet neglect the imbalanced nature of real-world data. In this paper, we relax this assumption by proposing a new challenging task: distribution-agnostic NCD, which allows data drawn from arbitrary unknown class distributions and thus renders existing methods useless or even harmful. We tackle this challenge by proposing a new method, dubbed “Boot-strapping Your Own Prior (BYOP)”, which iteratively estimates the class prior based on the model prediction it-self. At each iteration, we devise a dynamic temperature technique that better estimates the class prior by encouraging sharper predictions for less-confident samples. Thus, BYOP obtains more accurate pseudo-labels for the novel samples, which are beneficial for the next training iteration. Extensive experiments show that existing methods suffer from imbalanced class distributions, while BYOp11Code: https://github.com/muliyangm/BYOP. out-performs them by clear margins, demonstrating its effectiveness across various distribution scenarios.
新类发现(NCD)的目的是利用从已知类的基本集合中已经学习到的可转移知识,在没有任何注释的情况下发现未知类。现有的工作不切实际地假设新的类分布先验是均匀的,而忽视了现实世界数据的不平衡性。在本文中,我们通过提出一个新的具有挑战性的任务来放松这个假设:分布不可知的NCD,它允许从任意未知的类分布中提取数据,从而使现有的方法无用甚至有害。为了解决这个问题,我们提出了一种新方法,称为“bootstrapping Your Own Prior (BYOP)”,该方法基于模型本身的预测迭代估计类先验。在每次迭代中,我们设计了一种动态温度技术,通过鼓励对不太自信的样本进行更精确的预测来更好地估计先验类。因此,BYOP为新样本获得了更准确的伪标签,这有利于下一次训练迭代。大量实验表明,现有方法存在类分布不平衡的问题,而BYOp11Code: https://github.com/muliyangm/BYOP。表现明显优于它们,证明其在各种分销场景中的有效性。