具有不同域偏移的地理空间点云语义分割的测试时间自适应

IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL
Puzuo Wang , Wei Yao , Jie Shao , Zhiyi He
{"title":"具有不同域偏移的地理空间点云语义分割的测试时间自适应","authors":"Puzuo Wang ,&nbsp;Wei Yao ,&nbsp;Jie Shao ,&nbsp;Zhiyi He","doi":"10.1016/j.isprsjprs.2025.08.022","DOIUrl":null,"url":null,"abstract":"<div><div>Domain adaptation (DA) techniques aim to close the gap between source and target domains, enabling deep learning models to generalize across different data shift paradigms for point cloud semantic segmentation (PCSS). Among emerging DA schemes, test-time adaptation (TTA) facilitates direct adaptation of a pre-trained model to unlabeled data during the inference stage without access to source domain data and need for additional training process, which mitigates data privacy concerns and removes the requirement for substantial computational power. To fill the gap of leveraging TTA for geospatial PCSS, we introduce three typical domain shift paradigms in handling geospatial point clouds and construct three practical adaptation benchmarks, including photogrammetric point clouds to airborne LiDAR, airborne LiDAR to mobile LiDAR, and synthetic to mobile LiDAR. Then, a TTA method is proposed by exploiting the domain-specific knowledge embedded within the batch normalization (BN) layers. Given the pre-trained model, BN statistical information is progressively updated by fusing the statistics of each testing batch. Furthermore, we develop a self-supervised module to optimize the learnable BN affine parameters. Information maximization is used to generate confident and category-specific predictions, and reliability constrained pseudo-labeling is further incorporated to create supervisory signals. Extensive experimental analysis demonstrates that our proposed method significantly improves classification accuracy compared to directly applying the inference by up to 20% in terms of mIoU, which not only outperforms other popular counterparts but also maintains a high efficiency while avoiding retraining. In an adaptation of photogrammetric (SensatUrban) to airborne (Hessigheim 3D), our method achieves a mIoU of 59.46% and an OA of 85. 97%.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"229 ","pages":"Pages 422-435"},"PeriodicalIF":12.2000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts\",\"authors\":\"Puzuo Wang ,&nbsp;Wei Yao ,&nbsp;Jie Shao ,&nbsp;Zhiyi He\",\"doi\":\"10.1016/j.isprsjprs.2025.08.022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Domain adaptation (DA) techniques aim to close the gap between source and target domains, enabling deep learning models to generalize across different data shift paradigms for point cloud semantic segmentation (PCSS). Among emerging DA schemes, test-time adaptation (TTA) facilitates direct adaptation of a pre-trained model to unlabeled data during the inference stage without access to source domain data and need for additional training process, which mitigates data privacy concerns and removes the requirement for substantial computational power. To fill the gap of leveraging TTA for geospatial PCSS, we introduce three typical domain shift paradigms in handling geospatial point clouds and construct three practical adaptation benchmarks, including photogrammetric point clouds to airborne LiDAR, airborne LiDAR to mobile LiDAR, and synthetic to mobile LiDAR. Then, a TTA method is proposed by exploiting the domain-specific knowledge embedded within the batch normalization (BN) layers. Given the pre-trained model, BN statistical information is progressively updated by fusing the statistics of each testing batch. Furthermore, we develop a self-supervised module to optimize the learnable BN affine parameters. Information maximization is used to generate confident and category-specific predictions, and reliability constrained pseudo-labeling is further incorporated to create supervisory signals. Extensive experimental analysis demonstrates that our proposed method significantly improves classification accuracy compared to directly applying the inference by up to 20% in terms of mIoU, which not only outperforms other popular counterparts but also maintains a high efficiency while avoiding retraining. In an adaptation of photogrammetric (SensatUrban) to airborne (Hessigheim 3D), our method achieves a mIoU of 59.46% and an OA of 85. 97%.</div></div>\",\"PeriodicalId\":50269,\"journal\":{\"name\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"volume\":\"229 \",\"pages\":\"Pages 422-435\"},\"PeriodicalIF\":12.2000,\"publicationDate\":\"2025-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0924271625003338\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOGRAPHY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625003338","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

领域自适应(DA)技术旨在缩小源域和目标域之间的差距,使深度学习模型能够在点云语义分割(PCSS)中跨越不同的数据转移范式进行泛化。在新兴的数据分析方案中,测试时间自适应(TTA)有助于在推理阶段将预训练模型直接适应未标记的数据,而无需访问源域数据和需要额外的训练过程,从而减轻了数据隐私问题并消除了对大量计算能力的要求。为了填补利用TTA进行地理空间PCSS的空白,本文介绍了处理地理空间点云的三种典型域移范式,并构建了三个实用的自适应基准,包括摄影测量点云对机载激光雷达、机载激光雷达对移动激光雷达和合成激光雷达对移动激光雷达。然后,利用批归一化层中嵌入的领域特定知识,提出了一种TTA方法。给定预训练模型,通过融合每个测试批次的统计信息,逐步更新BN统计信息。此外,我们开发了一个自监督模块来优化可学习的BN仿射参数。信息最大化用于生成自信和特定类别的预测,并进一步结合可靠性约束伪标记来创建监督信号。大量的实验分析表明,与直接应用mIoU推理相比,我们提出的方法显著提高了分类准确率高达20%,不仅优于其他流行的同类方法,而且在避免再训练的同时保持了较高的效率。在将摄影测量(SensatUrban)应用于航空测量(hessighheim 3D)中,我们的方法实现了59.46%的mIoU和85的OA。97%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts
Domain adaptation (DA) techniques aim to close the gap between source and target domains, enabling deep learning models to generalize across different data shift paradigms for point cloud semantic segmentation (PCSS). Among emerging DA schemes, test-time adaptation (TTA) facilitates direct adaptation of a pre-trained model to unlabeled data during the inference stage without access to source domain data and need for additional training process, which mitigates data privacy concerns and removes the requirement for substantial computational power. To fill the gap of leveraging TTA for geospatial PCSS, we introduce three typical domain shift paradigms in handling geospatial point clouds and construct three practical adaptation benchmarks, including photogrammetric point clouds to airborne LiDAR, airborne LiDAR to mobile LiDAR, and synthetic to mobile LiDAR. Then, a TTA method is proposed by exploiting the domain-specific knowledge embedded within the batch normalization (BN) layers. Given the pre-trained model, BN statistical information is progressively updated by fusing the statistics of each testing batch. Furthermore, we develop a self-supervised module to optimize the learnable BN affine parameters. Information maximization is used to generate confident and category-specific predictions, and reliability constrained pseudo-labeling is further incorporated to create supervisory signals. Extensive experimental analysis demonstrates that our proposed method significantly improves classification accuracy compared to directly applying the inference by up to 20% in terms of mIoU, which not only outperforms other popular counterparts but also maintains a high efficiency while avoiding retraining. In an adaptation of photogrammetric (SensatUrban) to airborne (Hessigheim 3D), our method achieves a mIoU of 59.46% and an OA of 85. 97%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ISPRS Journal of Photogrammetry and Remote Sensing
ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术
CiteScore
21.00
自引率
6.30%
发文量
273
审稿时长
40 days
期刊介绍: The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信