Domain-adaptive point cloud semantic segmentation via knowledge-augmented deep learning

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Fengyu Xu , Yongxiong Xiao , Jilin Mei , Yu Hu , Qiang Fu , Hang Shi
{"title":"Domain-adaptive point cloud semantic segmentation via knowledge-augmented deep learning","authors":"Fengyu Xu ,&nbsp;Yongxiong Xiao ,&nbsp;Jilin Mei ,&nbsp;Yu Hu ,&nbsp;Qiang Fu ,&nbsp;Hang Shi","doi":"10.1016/j.patcog.2025.112528","DOIUrl":null,"url":null,"abstract":"<div><div>Owing to limitations in both the quality and quantity of training data, supervised deep learning methods for point cloud semantic segmentation often exhibit poor generalization and hallucinations, particularly when unstructured scenes are processed in autonomous driving applications. These challenges motivate us to develop a domain-adaptive point cloud semantic segmentation network based on knowledge-augmented deep learning, which can be applied to off-road scenes after being trained on urban datasets. Specifically, our method adopts the strategy of combining implicit and explicit knowledge augmentation. Moreover, network modulation based on attribution analysis is employed to integrate domain knowledge into the deep learning model, thereby mitigating the model’s susceptibility to spurious correlations. Additionally, we propose new network modulation performance metrics to evaluate the efficiency and benefit of modulation correction. For experimental validation, we use datasets with significant disparities between the urban and off-road environments in the training and testing phases, respectively. Furthermore, we release our self-collected off-road dataset PCSS-Gobi-3D, which is the first point cloud dataset of the Gobi Desert scene. Compared with other domain-adaptive methods, our method demonstrates promising cross-domain 3D semantic segmentation performance with excellent results in terms of mean Intersection over Union (mIoU) and overall accuracy.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112528"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325011914","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Owing to limitations in both the quality and quantity of training data, supervised deep learning methods for point cloud semantic segmentation often exhibit poor generalization and hallucinations, particularly when unstructured scenes are processed in autonomous driving applications. These challenges motivate us to develop a domain-adaptive point cloud semantic segmentation network based on knowledge-augmented deep learning, which can be applied to off-road scenes after being trained on urban datasets. Specifically, our method adopts the strategy of combining implicit and explicit knowledge augmentation. Moreover, network modulation based on attribution analysis is employed to integrate domain knowledge into the deep learning model, thereby mitigating the model’s susceptibility to spurious correlations. Additionally, we propose new network modulation performance metrics to evaluate the efficiency and benefit of modulation correction. For experimental validation, we use datasets with significant disparities between the urban and off-road environments in the training and testing phases, respectively. Furthermore, we release our self-collected off-road dataset PCSS-Gobi-3D, which is the first point cloud dataset of the Gobi Desert scene. Compared with other domain-adaptive methods, our method demonstrates promising cross-domain 3D semantic segmentation performance with excellent results in terms of mean Intersection over Union (mIoU) and overall accuracy.

Abstract Image

基于知识增强深度学习的域自适应点云语义分割
由于训练数据的质量和数量的限制,点云语义分割的监督深度学习方法往往表现出较差的泛化和幻觉,特别是在自动驾驶应用中处理非结构化场景时。这些挑战促使我们开发基于知识增强深度学习的领域自适应点云语义分割网络,该网络可以在城市数据集上进行训练后应用于越野场景。具体来说,我们的方法采用了隐式和显式知识增强相结合的策略。此外,利用基于归因分析的网络调制将领域知识整合到深度学习模型中,从而降低模型对虚假相关的敏感性。此外,我们提出了新的网络调制性能指标来评估调制校正的效率和效益。为了进行实验验证,我们分别在训练和测试阶段使用了城市和越野环境之间存在显著差异的数据集。此外,我们还发布了自采集的越野数据集PCSS-Gobi-3D,这是戈壁沙漠场景的第一个点云数据集。与其他领域自适应方法相比,我们的方法在平均交联数(Intersection over Union, mIoU)和总体精度方面表现出了良好的跨领域3D语义分割性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信