HFA-Net: hybrid feature-aware network for large-scale point cloud semantic segmentation

IF 10.7 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2025-01-25 DOI:10.1007/s10462-025-11111-2

Changji Wen, Long Zhang, Junfeng Ren, Rundong Hong, Chenshuang Li, Ce Yang, Yanfeng Lv, Hongbing Chen, Ning yang

{"title":"HFA-Net: hybrid feature-aware network for large-scale point cloud semantic segmentation","authors":"Changji Wen, Long Zhang, Junfeng Ren, Rundong Hong, Chenshuang Li, Ce Yang, Yanfeng Lv, Hongbing Chen, Ning yang","doi":"10.1007/s10462-025-11111-2","DOIUrl":null,"url":null,"abstract":"<div><p>Semantic segmentation of large-scale point clouds in 3D computer vision is a challenging problem. Existing feature extraction modules often emphasize learning local geometry while not giving adequate consideration to the integration of color information. This limitation prevents the network from thoroughly learning local features, thereby impacting segmentation accuracy. In this study, we propose three modules for robust feature extraction and aggregation, forming a novel point cloud segmentation network (HFA-Net) for large-scale point cloud semantic segmentation. First, we introduce the Hybrid Feature Extraction Component (HFEC) and the Hybrid Bilateral Enhancement Component (HBAC) to comprehensively extract and enhance the geometric, color, and semantic information of point clouds. Second, we incorporate the Ternary-Distance Attention Pooling (TDAP) module, which leverages trilateral distances to further refine the network’s focus on various features, enabling it to emphasize both locally important features and broader local neighborhoods. These modules are stacked into dense residual components to expand the network’s receptive field. Our experiments on several large-scale benchmark datasets, including Semantic3D, Toronto3D, S3DIS and LASDU demonstrate the effectiveness of HFA-Net when compared to state-of-the-art networks.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 4","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11111-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11111-2","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Semantic segmentation of large-scale point clouds in 3D computer vision is a challenging problem. Existing feature extraction modules often emphasize learning local geometry while not giving adequate consideration to the integration of color information. This limitation prevents the network from thoroughly learning local features, thereby impacting segmentation accuracy. In this study, we propose three modules for robust feature extraction and aggregation, forming a novel point cloud segmentation network (HFA-Net) for large-scale point cloud semantic segmentation. First, we introduce the Hybrid Feature Extraction Component (HFEC) and the Hybrid Bilateral Enhancement Component (HBAC) to comprehensively extract and enhance the geometric, color, and semantic information of point clouds. Second, we incorporate the Ternary-Distance Attention Pooling (TDAP) module, which leverages trilateral distances to further refine the network’s focus on various features, enabling it to emphasize both locally important features and broader local neighborhoods. These modules are stacked into dense residual components to expand the network’s receptive field. Our experiments on several large-scale benchmark datasets, including Semantic3D, Toronto3D, S3DIS and LASDU demonstrate the effectiveness of HFA-Net when compared to state-of-the-art networks.

查看原文本刊更多论文

HFA-Net：用于大规模点云语义分割的混合特征感知网络

大规模点云的语义分割是三维计算机视觉中一个具有挑战性的问题。现有的特征提取模块往往强调局部几何的学习，而没有充分考虑颜色信息的融合。这个限制阻碍了网络对局部特征的彻底学习，从而影响了分割的准确性。在这项研究中，我们提出了三个鲁棒特征提取和聚合模块，形成了一种新的点云分割网络（HFA-Net），用于大规模点云语义分割。首先，引入混合特征提取分量（HFEC）和混合双边增强分量（HBAC），对点云的几何、颜色和语义信息进行综合提取和增强；其次，我们整合了三边距离注意池（TDAP）模块，该模块利用三边距离进一步优化网络对各种特征的关注，使其能够强调本地重要特征和更广泛的本地社区。这些模块被堆叠成密集的残余组件，以扩大网络的接受域。我们在几个大型基准数据集上的实验，包括Semantic3D、Toronto3D、S3DIS和LASDU，与最先进的网络相比，证明了HFA-Net的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.