{"title":"Prototype optimization and self-training for few-shot 3D point cloud semantic segmentation","authors":"Jie Zhou , Yong Zhao , Fan Zhong","doi":"10.1016/j.cad.2025.103976","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot point cloud segmentation aims to accurately decompose 3D point clouds into different semantic classes with few samples, and is crucial for subsequent tasks, such as analysis, modeling and editing. Despite the popularity of prototype-based approaches, prototypes often fail to adequately capture class-specific information. Therefore, for each class, a few points may exhibit significant differences from their prototype. And the lack of sufficient distinction between foreground and background prototypes presents a great challenge for precise segmentation. To address these issues, we propose a prototype optimization module to mitigate the interference among support prototypes, thereby generating prototypes of superior quality. These refined prototypes are capable of capturing the key characteristics of the data, which can prominently improve the generalization capability of our model. Then, we devise a self-training strategy that leverages pseudo query prototypes generated from high-confidence predicted labels. These prototypes are applied to query features to produce pseudo query labels and formulate a reconstruction constraint during training. By harnessing the contextual information embedded within query features, this approach significantly elevates segmentation performance. Extensive results on two popular benchmark datasets validate the superiority of our model, especially in the challenging 1-shot settings. Under the classic experimental setup, our method surpasses existing state-of-the-arts by 2.64% in 2-way 1-shot setting on the S3DIS dataset. On the ScanNet dataset, the improvements are 7.58% in 2-way 1-shot setting and 6.44% in 3-way 1-shot setting, respectively.</div></div>","PeriodicalId":50632,"journal":{"name":"Computer-Aided Design","volume":"190 ","pages":"Article 103976"},"PeriodicalIF":3.1000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001044852500137X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Few-shot point cloud segmentation aims to accurately decompose 3D point clouds into different semantic classes with few samples, and is crucial for subsequent tasks, such as analysis, modeling and editing. Despite the popularity of prototype-based approaches, prototypes often fail to adequately capture class-specific information. Therefore, for each class, a few points may exhibit significant differences from their prototype. And the lack of sufficient distinction between foreground and background prototypes presents a great challenge for precise segmentation. To address these issues, we propose a prototype optimization module to mitigate the interference among support prototypes, thereby generating prototypes of superior quality. These refined prototypes are capable of capturing the key characteristics of the data, which can prominently improve the generalization capability of our model. Then, we devise a self-training strategy that leverages pseudo query prototypes generated from high-confidence predicted labels. These prototypes are applied to query features to produce pseudo query labels and formulate a reconstruction constraint during training. By harnessing the contextual information embedded within query features, this approach significantly elevates segmentation performance. Extensive results on two popular benchmark datasets validate the superiority of our model, especially in the challenging 1-shot settings. Under the classic experimental setup, our method surpasses existing state-of-the-arts by 2.64% in 2-way 1-shot setting on the S3DIS dataset. On the ScanNet dataset, the improvements are 7.58% in 2-way 1-shot setting and 6.44% in 3-way 1-shot setting, respectively.
期刊介绍:
Computer-Aided Design is a leading international journal that provides academia and industry with key papers on research and developments in the application of computers to design.
Computer-Aided Design invites papers reporting new research, as well as novel or particularly significant applications, within a wide range of topics, spanning all stages of design process from concept creation to manufacture and beyond.