Tichao Wang , Fusheng Hao , Qieshi Zhang , Jun Cheng
{"title":"基于渐进式背景前景差分增强的少镜头三维点云语义分割","authors":"Tichao Wang , Fusheng Hao , Qieshi Zhang , Jun Cheng","doi":"10.1016/j.imavis.2025.105656","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot 3D point cloud semantic segmentation aims to segment query point clouds given only few annotated support point clouds. Most existing methods focus on exploring the complex relationships between support data and query data within the prototype-based framework. However, the ignored background ambiguity issue, i.e., the foregrounds of a support class are treated as backgrounds by other support classes, severely limits few-shot models’ ability to distinguish foregrounds and backgrounds, resulting in biased prototypes. In this paper, we propose a progressive background–foreground difference enhancement method to eliminate background ambiguity. Firstly, based on the fact that the background ambiguity only affects background prototypes, we develop a background–foreground difference enhancement strategy, which eliminates background ambiguity via enhancing the difference between foregrounds and backgrounds in query data. Then, we present a geometric-guided feature aggregation module, which integrates geometrical information to improve the reliability of pseudo labels. Finally, we aggregate high-confidence query features as pseudo prototypes to refine the prototypes. The iteration of these steps further improves prototype quality. Comprehensive experiments suggest that our method achieves competing performance on both S3DIS and ScanNet datasets.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"162 ","pages":"Article 105656"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Progressive background–foreground difference enhancement for few-shot 3D point cloud semantic segmentation\",\"authors\":\"Tichao Wang , Fusheng Hao , Qieshi Zhang , Jun Cheng\",\"doi\":\"10.1016/j.imavis.2025.105656\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Few-shot 3D point cloud semantic segmentation aims to segment query point clouds given only few annotated support point clouds. Most existing methods focus on exploring the complex relationships between support data and query data within the prototype-based framework. However, the ignored background ambiguity issue, i.e., the foregrounds of a support class are treated as backgrounds by other support classes, severely limits few-shot models’ ability to distinguish foregrounds and backgrounds, resulting in biased prototypes. In this paper, we propose a progressive background–foreground difference enhancement method to eliminate background ambiguity. Firstly, based on the fact that the background ambiguity only affects background prototypes, we develop a background–foreground difference enhancement strategy, which eliminates background ambiguity via enhancing the difference between foregrounds and backgrounds in query data. Then, we present a geometric-guided feature aggregation module, which integrates geometrical information to improve the reliability of pseudo labels. Finally, we aggregate high-confidence query features as pseudo prototypes to refine the prototypes. The iteration of these steps further improves prototype quality. Comprehensive experiments suggest that our method achieves competing performance on both S3DIS and ScanNet datasets.</div></div>\",\"PeriodicalId\":50374,\"journal\":{\"name\":\"Image and Vision Computing\",\"volume\":\"162 \",\"pages\":\"Article 105656\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Image and Vision Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0262885625002446\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0262885625002446","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Progressive background–foreground difference enhancement for few-shot 3D point cloud semantic segmentation
Few-shot 3D point cloud semantic segmentation aims to segment query point clouds given only few annotated support point clouds. Most existing methods focus on exploring the complex relationships between support data and query data within the prototype-based framework. However, the ignored background ambiguity issue, i.e., the foregrounds of a support class are treated as backgrounds by other support classes, severely limits few-shot models’ ability to distinguish foregrounds and backgrounds, resulting in biased prototypes. In this paper, we propose a progressive background–foreground difference enhancement method to eliminate background ambiguity. Firstly, based on the fact that the background ambiguity only affects background prototypes, we develop a background–foreground difference enhancement strategy, which eliminates background ambiguity via enhancing the difference between foregrounds and backgrounds in query data. Then, we present a geometric-guided feature aggregation module, which integrates geometrical information to improve the reliability of pseudo labels. Finally, we aggregate high-confidence query features as pseudo prototypes to refine the prototypes. The iteration of these steps further improves prototype quality. Comprehensive experiments suggest that our method achieves competing performance on both S3DIS and ScanNet datasets.
期刊介绍:
Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.