基于渐进式背景前景差分增强的少镜头三维点云语义分割

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Image and Vision Computing Pub Date : 2025-07-15 DOI:10.1016/j.imavis.2025.105656

Tichao Wang , Fusheng Hao , Qieshi Zhang , Jun Cheng

{"title":"基于渐进式背景前景差分增强的少镜头三维点云语义分割","authors":"Tichao Wang , Fusheng Hao , Qieshi Zhang , Jun Cheng","doi":"10.1016/j.imavis.2025.105656","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot 3D point cloud semantic segmentation aims to segment query point clouds given only few annotated support point clouds. Most existing methods focus on exploring the complex relationships between support data and query data within the prototype-based framework. However, the ignored background ambiguity issue, i.e., the foregrounds of a support class are treated as backgrounds by other support classes, severely limits few-shot models’ ability to distinguish foregrounds and backgrounds, resulting in biased prototypes. In this paper, we propose a progressive background–foreground difference enhancement method to eliminate background ambiguity. Firstly, based on the fact that the background ambiguity only affects background prototypes, we develop a background–foreground difference enhancement strategy, which eliminates background ambiguity via enhancing the difference between foregrounds and backgrounds in query data. Then, we present a geometric-guided feature aggregation module, which integrates geometrical information to improve the reliability of pseudo labels. Finally, we aggregate high-confidence query features as pseudo prototypes to refine the prototypes. The iteration of these steps further improves prototype quality. Comprehensive experiments suggest that our method achieves competing performance on both S3DIS and ScanNet datasets.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"162 ","pages":"Article 105656"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Progressive background–foreground difference enhancement for few-shot 3D point cloud semantic segmentation\",\"authors\":\"Tichao Wang , Fusheng Hao , Qieshi Zhang , Jun Cheng\",\"doi\":\"10.1016/j.imavis.2025.105656\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Few-shot 3D point cloud semantic segmentation aims to segment query point clouds given only few annotated support point clouds. Most existing methods focus on exploring the complex relationships between support data and query data within the prototype-based framework. However, the ignored background ambiguity issue, i.e., the foregrounds of a support class are treated as backgrounds by other support classes, severely limits few-shot models’ ability to distinguish foregrounds and backgrounds, resulting in biased prototypes. In this paper, we propose a progressive background–foreground difference enhancement method to eliminate background ambiguity. Firstly, based on the fact that the background ambiguity only affects background prototypes, we develop a background–foreground difference enhancement strategy, which eliminates background ambiguity via enhancing the difference between foregrounds and backgrounds in query data. Then, we present a geometric-guided feature aggregation module, which integrates geometrical information to improve the reliability of pseudo labels. Finally, we aggregate high-confidence query features as pseudo prototypes to refine the prototypes. The iteration of these steps further improves prototype quality. Comprehensive experiments suggest that our method achieves competing performance on both S3DIS and ScanNet datasets.</div></div>\",\"PeriodicalId\":50374,\"journal\":{\"name\":\"Image and Vision Computing\",\"volume\":\"162 \",\"pages\":\"Article 105656\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Image and Vision Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0262885625002446\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0262885625002446","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

少镜头三维点云语义分割的目的是在只有少量注释支持点云的情况下对查询点云进行分割。大多数现有方法都侧重于在基于原型的框架中探索支持数据和查询数据之间的复杂关系。然而，由于忽略了背景模糊问题，即一个支持类的前景被其他支持类视为背景，严重限制了少拍模型区分前景和背景的能力，从而导致原型的偏差。本文提出了一种渐进式背景前景差增强方法来消除背景模糊。首先，基于背景模糊只影响背景原型的事实，我们开发了一种背景前景差异增强策略，通过增强查询数据中前景和背景的差异来消除背景模糊。然后，我们提出了一个几何引导的特征聚合模块，该模块集成了几何信息，以提高伪标签的可靠性。最后，我们将高置信度的查询特征聚合为伪原型，以细化原型。这些步骤的迭代进一步提高了原型的质量。综合实验表明，我们的方法在S3DIS和ScanNet数据集上都取得了相当的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Progressive background–foreground difference enhancement for few-shot 3D point cloud semantic segmentation

Few-shot 3D point cloud semantic segmentation aims to segment query point clouds given only few annotated support point clouds. Most existing methods focus on exploring the complex relationships between support data and query data within the prototype-based framework. However, the ignored background ambiguity issue, i.e., the foregrounds of a support class are treated as backgrounds by other support classes, severely limits few-shot models’ ability to distinguish foregrounds and backgrounds, resulting in biased prototypes. In this paper, we propose a progressive background–foreground difference enhancement method to eliminate background ambiguity. Firstly, based on the fact that the background ambiguity only affects background prototypes, we develop a background–foreground difference enhancement strategy, which eliminates background ambiguity via enhancing the difference between foregrounds and backgrounds in query data. Then, we present a geometric-guided feature aggregation module, which integrates geometrical information to improve the reliability of pseudo labels. Finally, we aggregate high-confidence query features as pseudo prototypes to refine the prototypes. The iteration of these steps further improves prototype quality. Comprehensive experiments suggest that our method achieves competing performance on both S3DIS and ScanNet datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Image and Vision Computing 工程技术-工程：电子与电气

CiteScore

8.50

自引率

8.50%

发文量

143

审稿时长

7.8 months

期刊介绍： Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.