Prototype optimization and self-training for few-shot 3D point cloud semantic segmentation

IF 3.1 3区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Jie Zhou , Yong Zhao , Fan Zhong
{"title":"Prototype optimization and self-training for few-shot 3D point cloud semantic segmentation","authors":"Jie Zhou ,&nbsp;Yong Zhao ,&nbsp;Fan Zhong","doi":"10.1016/j.cad.2025.103976","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot point cloud segmentation aims to accurately decompose 3D point clouds into different semantic classes with few samples, and is crucial for subsequent tasks, such as analysis, modeling and editing. Despite the popularity of prototype-based approaches, prototypes often fail to adequately capture class-specific information. Therefore, for each class, a few points may exhibit significant differences from their prototype. And the lack of sufficient distinction between foreground and background prototypes presents a great challenge for precise segmentation. To address these issues, we propose a prototype optimization module to mitigate the interference among support prototypes, thereby generating prototypes of superior quality. These refined prototypes are capable of capturing the key characteristics of the data, which can prominently improve the generalization capability of our model. Then, we devise a self-training strategy that leverages pseudo query prototypes generated from high-confidence predicted labels. These prototypes are applied to query features to produce pseudo query labels and formulate a reconstruction constraint during training. By harnessing the contextual information embedded within query features, this approach significantly elevates segmentation performance. Extensive results on two popular benchmark datasets validate the superiority of our model, especially in the challenging 1-shot settings. Under the classic experimental setup, our method surpasses existing state-of-the-arts by 2.64% in 2-way 1-shot setting on the S3DIS dataset. On the ScanNet dataset, the improvements are 7.58% in 2-way 1-shot setting and 6.44% in 3-way 1-shot setting, respectively.</div></div>","PeriodicalId":50632,"journal":{"name":"Computer-Aided Design","volume":"190 ","pages":"Article 103976"},"PeriodicalIF":3.1000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001044852500137X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

Few-shot point cloud segmentation aims to accurately decompose 3D point clouds into different semantic classes with few samples, and is crucial for subsequent tasks, such as analysis, modeling and editing. Despite the popularity of prototype-based approaches, prototypes often fail to adequately capture class-specific information. Therefore, for each class, a few points may exhibit significant differences from their prototype. And the lack of sufficient distinction between foreground and background prototypes presents a great challenge for precise segmentation. To address these issues, we propose a prototype optimization module to mitigate the interference among support prototypes, thereby generating prototypes of superior quality. These refined prototypes are capable of capturing the key characteristics of the data, which can prominently improve the generalization capability of our model. Then, we devise a self-training strategy that leverages pseudo query prototypes generated from high-confidence predicted labels. These prototypes are applied to query features to produce pseudo query labels and formulate a reconstruction constraint during training. By harnessing the contextual information embedded within query features, this approach significantly elevates segmentation performance. Extensive results on two popular benchmark datasets validate the superiority of our model, especially in the challenging 1-shot settings. Under the classic experimental setup, our method surpasses existing state-of-the-arts by 2.64% in 2-way 1-shot setting on the S3DIS dataset. On the ScanNet dataset, the improvements are 7.58% in 2-way 1-shot setting and 6.44% in 3-way 1-shot setting, respectively.
少镜头三维点云语义分割的原型优化与自训练
少镜头点云分割旨在用较少的样本将三维点云准确分解为不同的语义类,对后续的分析、建模和编辑等任务至关重要。尽管基于原型的方法很流行,但原型常常不能充分捕获特定于类的信息。因此,对于每个类,可能会有一些点与其原型有显著差异。前景和背景原型之间缺乏足够的区分,这给精确分割带来了很大的挑战。为了解决这些问题,我们提出了一个原型优化模块,以减轻支持原型之间的干扰,从而产生高质量的原型。这些精细化的原型能够捕获数据的关键特征,这可以显著提高我们模型的泛化能力。然后,我们设计了一种自我训练策略,利用高置信度预测标签生成的伪查询原型。将这些原型应用于查询特征,生成伪查询标签,并在训练过程中形成重构约束。通过利用嵌入在查询特征中的上下文信息,这种方法显著提高了分割性能。在两个流行的基准数据集上的广泛结果验证了我们模型的优越性,特别是在具有挑战性的1次射击设置中。在经典的实验设置下,我们的方法在S3DIS数据集上的2-way 1-shot设置比现有的最先进的方法高出2.64%。在ScanNet数据集上,2-way 1-shot设置的改进率为7.58%,3-way 1-shot设置的改进率为6.44%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computer-Aided Design
Computer-Aided Design 工程技术-计算机:软件工程
CiteScore
5.50
自引率
4.70%
发文量
117
审稿时长
4.2 months
期刊介绍: Computer-Aided Design is a leading international journal that provides academia and industry with key papers on research and developments in the application of computers to design. Computer-Aided Design invites papers reporting new research, as well as novel or particularly significant applications, within a wide range of topics, spanning all stages of design process from concept creation to manufacture and beyond.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信