Gated multi-source fusion with geometric sequence modeling for novel urban structure discovery

IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL
Jing Du, John Zelek, Dedong Zhang, Jonathan Li
{"title":"Gated multi-source fusion with geometric sequence modeling for novel urban structure discovery","authors":"Jing Du, John Zelek, Dedong Zhang, Jonathan Li","doi":"10.1016/j.isprsjprs.2025.09.017","DOIUrl":null,"url":null,"abstract":"Novel Class Discovery (NCD) in 3D point cloud semantic segmentation presents critical challenges for urban management systems, where models must segment previously unseen object classes in rapidly evolving urban environments. Traditional 3D semantic segmentation models struggle to adapt to heterogeneous spatial characteristics and complex geometric structures of urban point clouds, limiting their ability to handle novel objects without extensive retraining. This paper introduces Adaptive Geometric Discovery Network (AGDNet), a comprehensive framework enhancing NCD through three key innovations: Adaptive Geometric Sequence Modeling module (AGSM), Dynamic Gaussian Embedding module (DGE), and Gated Multi-Source Feature Fusion module (GMSFF). AGSM addresses heterogeneous spatial characteristics through density-aware adaptive sampling, dynamic grouping, and multi-aspect geometric feature encoding. DGE represents point clouds as learnable 3D Gaussians parameterized by position, scale, orientation, and features, providing continuous probabilistic representations capturing both local geometric details and global spatial contexts. GMSFF integrates features from AGSM, DGE, and MinkowskiNet through context-aware gating mechanisms. The framework introduces three specialized knowledge transfer objectives for NCD: Prototype Relation Loss establishes semantic connections between known and novel class prototypes; Contrastive Alignment Loss creates instance-level semantic bridges; and Semantic Transfer Loss enables distribution-based knowledge propagation. These objectives bridge the semantic gap between known and novel categories while mitigating class imbalance challenges. Comprehensive evaluation on Toronto-3D, SemanticSTF, and SemanticPOSS datasets demonstrates significant improvements over state-of-the-art methods NOPS and CHNCD. For novel class discovery, the framework achieves average improvements of 6.47%/3.48%, 4.61%/3.12%, and 6.64%/4.24% in novel class mean Intersection over Union (mIoU) over NOPS/CHNCD respectively. For overall performance, improvements reach 6.59%/3.88%, 7.32%/4.80%, and 7.27%/4.62% in overall mIoU. These results validate the framework’s effectiveness for urban management, environmental monitoring, and infrastructure planning applications.","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"32 1","pages":""},"PeriodicalIF":12.2000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.isprsjprs.2025.09.017","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Novel Class Discovery (NCD) in 3D point cloud semantic segmentation presents critical challenges for urban management systems, where models must segment previously unseen object classes in rapidly evolving urban environments. Traditional 3D semantic segmentation models struggle to adapt to heterogeneous spatial characteristics and complex geometric structures of urban point clouds, limiting their ability to handle novel objects without extensive retraining. This paper introduces Adaptive Geometric Discovery Network (AGDNet), a comprehensive framework enhancing NCD through three key innovations: Adaptive Geometric Sequence Modeling module (AGSM), Dynamic Gaussian Embedding module (DGE), and Gated Multi-Source Feature Fusion module (GMSFF). AGSM addresses heterogeneous spatial characteristics through density-aware adaptive sampling, dynamic grouping, and multi-aspect geometric feature encoding. DGE represents point clouds as learnable 3D Gaussians parameterized by position, scale, orientation, and features, providing continuous probabilistic representations capturing both local geometric details and global spatial contexts. GMSFF integrates features from AGSM, DGE, and MinkowskiNet through context-aware gating mechanisms. The framework introduces three specialized knowledge transfer objectives for NCD: Prototype Relation Loss establishes semantic connections between known and novel class prototypes; Contrastive Alignment Loss creates instance-level semantic bridges; and Semantic Transfer Loss enables distribution-based knowledge propagation. These objectives bridge the semantic gap between known and novel categories while mitigating class imbalance challenges. Comprehensive evaluation on Toronto-3D, SemanticSTF, and SemanticPOSS datasets demonstrates significant improvements over state-of-the-art methods NOPS and CHNCD. For novel class discovery, the framework achieves average improvements of 6.47%/3.48%, 4.61%/3.12%, and 6.64%/4.24% in novel class mean Intersection over Union (mIoU) over NOPS/CHNCD respectively. For overall performance, improvements reach 6.59%/3.88%, 7.32%/4.80%, and 7.27%/4.62% in overall mIoU. These results validate the framework’s effectiveness for urban management, environmental monitoring, and infrastructure planning applications.
基于几何序列建模的门控多源融合新城市结构发现
3D点云语义分割中的新型类发现(NCD)对城市管理系统提出了严峻的挑战,其中模型必须在快速发展的城市环境中分割以前未见过的对象类。传统的三维语义分割模型难以适应城市点云的异构空间特征和复杂的几何结构,限制了其处理新对象的能力,而不需要大量的再训练。本文介绍了自适应几何发现网络(AGDNet),这是一个通过自适应几何序列建模模块(AGSM)、动态高斯嵌入模块(DGE)和门控多源特征融合模块(GMSFF)三个关键创新来增强NCD的综合框架。AGSM通过密度感知的自适应采样、动态分组和多向几何特征编码来处理异构空间特征。DGE将点云表示为可学习的三维高斯数据,通过位置、规模、方向和特征参数化,提供捕获局部几何细节和全局空间背景的连续概率表示。GMSFF通过上下文感知门控机制集成了AGSM、DGE和MinkowskiNet的特性。该框架为NCD引入了三个专门的知识转移目标:原型关系损失建立了已知和新类原型之间的语义联系;对比对齐损失创建实例级语义桥;语义转移损失实现了基于分布的知识传播。这些目标弥合了已知和新类别之间的语义差距,同时减轻了类不平衡的挑战。对Toronto-3D、SemanticSTF和SemanticPOSS数据集的综合评估表明,与最先进的方法NOPS和CHNCD相比,有了显著的改进。对于新类发现,该框架比NOPS/CHNCD在新类平均交联(Intersection over Union, mIoU)上分别实现了6.47%/3.48%、4.61%/3.12%和6.64%/4.24%的平均改进。在总体性能方面,mIoU的改进幅度分别为6.59%/3.88%、7.32%/4.80%和7.27%/4.62%。这些结果验证了该框架在城市管理、环境监测和基础设施规划应用方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ISPRS Journal of Photogrammetry and Remote Sensing
ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术
CiteScore
21.00
自引率
6.30%
发文量
273
审稿时长
40 days
期刊介绍: The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信