Urban multi-domain mixing (UMDMix) based unsupervised domain adaptation for LiDAR semantic segmentation

IF 6.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-09-16 DOI:10.1016/j.neucom.2025.131526

Anurag Nihal , Pyare Lal , Vaibhav Kumar

{"title":"Urban multi-domain mixing (UMDMix) based unsupervised domain adaptation for LiDAR semantic segmentation","authors":"Anurag Nihal , Pyare Lal , Vaibhav Kumar","doi":"10.1016/j.neucom.2025.131526","DOIUrl":null,"url":null,"abstract":"<div><div>3D semantic maps generated from Light Detection and Ranging (LiDAR) point clouds enable scene understanding in diverse applications such as autonomous driving and urban planning. However, existing deep learning models struggle when tested on different domains, worsened by limited labeled data. Unsupervised Domain Adaptation (UDA) can bridge this gap, but existing UDA methods often face adaptation challenges due to domain shifts arising from variations in the physical environment, data sparsity, and sensor differences. To address these limitations, we propose <em>UMDMix</em>, a novel UDA architecture that operates on the mixing of multiple labeled source domains with unlabeled target domains to make the predictive model robust to cross-domain variations. <em>UMDMix</em> integrates a teacher–student learning scheme to produce a robust teacher model and an adaptable student model. The performance of the teacher model in the source domain is further strengthened by a position-aware loss that assigns greater significance to semantically rich neighborhoods. A combination of entropy regularization and KL-divergence loss in the target domain updates the knowledge of the teacher model to the student model during adaptation. Our extensive experiments across diverse environments show that <em>UMDMix</em> achieves an average improvement of 13 % on minor classes such as bicycle, traffic sign, and person in target domain datasets, outperforming previous State-Of-The-Art (SOTA) UDA methods.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"657 ","pages":"Article 131526"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225021988","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

3D semantic maps generated from Light Detection and Ranging (LiDAR) point clouds enable scene understanding in diverse applications such as autonomous driving and urban planning. However, existing deep learning models struggle when tested on different domains, worsened by limited labeled data. Unsupervised Domain Adaptation (UDA) can bridge this gap, but existing UDA methods often face adaptation challenges due to domain shifts arising from variations in the physical environment, data sparsity, and sensor differences. To address these limitations, we propose UMDMix, a novel UDA architecture that operates on the mixing of multiple labeled source domains with unlabeled target domains to make the predictive model robust to cross-domain variations. UMDMix integrates a teacher–student learning scheme to produce a robust teacher model and an adaptable student model. The performance of the teacher model in the source domain is further strengthened by a position-aware loss that assigns greater significance to semantically rich neighborhoods. A combination of entropy regularization and KL-divergence loss in the target domain updates the knowledge of the teacher model to the student model during adaptation. Our extensive experiments across diverse environments show that UMDMix achieves an average improvement of 13 % on minor classes such as bicycle, traffic sign, and person in target domain datasets, outperforming previous State-Of-The-Art (SOTA) UDA methods.

查看原文本刊更多论文

基于城市多域混合（UMDMix）的无监督域自适应激光雷达语义分割

由光探测和测距（LiDAR）点云生成的3D语义地图可以在自动驾驶和城市规划等各种应用中实现场景理解。然而，现有的深度学习模型在不同领域的测试中表现不佳，并且由于有限的标记数据而恶化。无监督域自适应（UDA）可以弥补这一差距，但由于物理环境的变化、数据稀疏性和传感器差异引起的域转移，现有的UDA方法经常面临适应性挑战。为了解决这些限制，我们提出了UMDMix，这是一种新颖的UDA架构，它可以混合多个标记的源域和未标记的目标域，使预测模型对跨域变化具有鲁棒性。UMDMix集成了一个师生学习方案，产生一个健壮的教师模型和一个适应性强的学生模型。教师模型在源域中的性能通过位置感知损失得到进一步加强，该损失对语义丰富的邻域赋予了更大的意义。在适应过程中，熵正则化和目标域kl -散度损失的结合将教师模型的知识更新到学生模型。我们在不同环境中的广泛实验表明，UMDMix在目标域数据集中，在自行车、交通标志和人等次要类别上实现了平均13%的改进，优于以前的最先进（SOTA） UDA方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.