HFA-Net: hybrid feature-aware network for large-scale point cloud semantic segmentation

IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Changji Wen, Long Zhang, Junfeng Ren, Rundong Hong, Chenshuang Li, Ce Yang, Yanfeng Lv, Hongbing Chen, Ning yang
{"title":"HFA-Net: hybrid feature-aware network for large-scale point cloud semantic segmentation","authors":"Changji Wen,&nbsp;Long Zhang,&nbsp;Junfeng Ren,&nbsp;Rundong Hong,&nbsp;Chenshuang Li,&nbsp;Ce Yang,&nbsp;Yanfeng Lv,&nbsp;Hongbing Chen,&nbsp;Ning yang","doi":"10.1007/s10462-025-11111-2","DOIUrl":null,"url":null,"abstract":"<div><p>Semantic segmentation of large-scale point clouds in 3D computer vision is a challenging problem. Existing feature extraction modules often emphasize learning local geometry while not giving adequate consideration to the integration of color information. This limitation prevents the network from thoroughly learning local features, thereby impacting segmentation accuracy. In this study, we propose three modules for robust feature extraction and aggregation, forming a novel point cloud segmentation network (HFA-Net) for large-scale point cloud semantic segmentation. First, we introduce the Hybrid Feature Extraction Component (HFEC) and the Hybrid Bilateral Enhancement Component (HBAC) to comprehensively extract and enhance the geometric, color, and semantic information of point clouds. Second, we incorporate the Ternary-Distance Attention Pooling (TDAP) module, which leverages trilateral distances to further refine the network’s focus on various features, enabling it to emphasize both locally important features and broader local neighborhoods. These modules are stacked into dense residual components to expand the network’s receptive field. Our experiments on several large-scale benchmark datasets, including Semantic3D, Toronto3D, S3DIS and LASDU demonstrate the effectiveness of HFA-Net when compared to state-of-the-art networks.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 4","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11111-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11111-2","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Semantic segmentation of large-scale point clouds in 3D computer vision is a challenging problem. Existing feature extraction modules often emphasize learning local geometry while not giving adequate consideration to the integration of color information. This limitation prevents the network from thoroughly learning local features, thereby impacting segmentation accuracy. In this study, we propose three modules for robust feature extraction and aggregation, forming a novel point cloud segmentation network (HFA-Net) for large-scale point cloud semantic segmentation. First, we introduce the Hybrid Feature Extraction Component (HFEC) and the Hybrid Bilateral Enhancement Component (HBAC) to comprehensively extract and enhance the geometric, color, and semantic information of point clouds. Second, we incorporate the Ternary-Distance Attention Pooling (TDAP) module, which leverages trilateral distances to further refine the network’s focus on various features, enabling it to emphasize both locally important features and broader local neighborhoods. These modules are stacked into dense residual components to expand the network’s receptive field. Our experiments on several large-scale benchmark datasets, including Semantic3D, Toronto3D, S3DIS and LASDU demonstrate the effectiveness of HFA-Net when compared to state-of-the-art networks.

HFA-Net:用于大规模点云语义分割的混合特征感知网络
大规模点云的语义分割是三维计算机视觉中一个具有挑战性的问题。现有的特征提取模块往往强调局部几何的学习,而没有充分考虑颜色信息的融合。这个限制阻碍了网络对局部特征的彻底学习,从而影响了分割的准确性。在这项研究中,我们提出了三个鲁棒特征提取和聚合模块,形成了一种新的点云分割网络(HFA-Net),用于大规模点云语义分割。首先,引入混合特征提取分量(HFEC)和混合双边增强分量(HBAC),对点云的几何、颜色和语义信息进行综合提取和增强;其次,我们整合了三边距离注意池(TDAP)模块,该模块利用三边距离进一步优化网络对各种特征的关注,使其能够强调本地重要特征和更广泛的本地社区。这些模块被堆叠成密集的残余组件,以扩大网络的接受域。我们在几个大型基准数据集上的实验,包括Semantic3D、Toronto3D、S3DIS和LASDU,与最先进的网络相比,证明了HFA-Net的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Artificial Intelligence Review
Artificial Intelligence Review 工程技术-计算机:人工智能
CiteScore
22.00
自引率
3.30%
发文量
194
审稿时长
5.3 months
期刊介绍: Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信