面向遮挡感知立面解析的潜在空间特征学习

IF 7.1 1区工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY

Building and Environment Pub Date : 2025-04-12 DOI:10.1016/j.buildenv.2025.112955

Yi Zhong , Jie Jiang , Weize Quan , Mingyang Zhao , Dong-ming Yan

{"title":"面向遮挡感知立面解析的潜在空间特征学习","authors":"Yi Zhong , Jie Jiang , Weize Quan , Mingyang Zhao , Dong-ming Yan","doi":"10.1016/j.buildenv.2025.112955","DOIUrl":null,"url":null,"abstract":"<div><div>Significant strides in deep learning have propelled facade parsing in computer vision, a process that classifies architectural elements into semantic blocks. A key challenge is handling occlusions in facade images. Current methods struggle due to suboptimal use of multi-scale modules and insufficient differentiation of feature categories in latent space. Addressing this, we propose a novel multi-scale deep learning architecture, enhanced by a distinction loss function, to better capture multi-scale characteristics. This architecture includes a three-stream latent space feature enhancement structure: a main stream for primary processing and two auxiliary streams for feature refinement. Within the architecture, we utilize our designed dual-branch Context Aggregation Module to reconcile discrepancies between global and local features. We propose a distinction loss tailored to our network architecture, guiding the two auxiliary streams to concentrate on specific feature types, thereby enhancing discrimination and reducing confusion. On one hand, the proposed framework for distinct learning in the latent feature space introduces a novel learning paradigm for neural network training, where simple yet effective modifications to the loss function lead to performance optimization. On the other hand, the potential of our method to parse facades under occluded scenarios could be significantly impactful in engineering applications such as urban planning, architectural design, and autonomous driving. Our experiments on benchmark facade datasets demonstrate the superior performance of our approach in handling occlusions and effectively parsing facades, indicating the potential of our method to advance the application of facade parsing in increasingly complex scenarios.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"279 ","pages":"Article 112955"},"PeriodicalIF":7.1000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distinctive learning of latent space feature for occlusion-aware facade parsing\",\"authors\":\"Yi Zhong , Jie Jiang , Weize Quan , Mingyang Zhao , Dong-ming Yan\",\"doi\":\"10.1016/j.buildenv.2025.112955\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Significant strides in deep learning have propelled facade parsing in computer vision, a process that classifies architectural elements into semantic blocks. A key challenge is handling occlusions in facade images. Current methods struggle due to suboptimal use of multi-scale modules and insufficient differentiation of feature categories in latent space. Addressing this, we propose a novel multi-scale deep learning architecture, enhanced by a distinction loss function, to better capture multi-scale characteristics. This architecture includes a three-stream latent space feature enhancement structure: a main stream for primary processing and two auxiliary streams for feature refinement. Within the architecture, we utilize our designed dual-branch Context Aggregation Module to reconcile discrepancies between global and local features. We propose a distinction loss tailored to our network architecture, guiding the two auxiliary streams to concentrate on specific feature types, thereby enhancing discrimination and reducing confusion. On one hand, the proposed framework for distinct learning in the latent feature space introduces a novel learning paradigm for neural network training, where simple yet effective modifications to the loss function lead to performance optimization. On the other hand, the potential of our method to parse facades under occluded scenarios could be significantly impactful in engineering applications such as urban planning, architectural design, and autonomous driving. Our experiments on benchmark facade datasets demonstrate the superior performance of our approach in handling occlusions and effectively parsing facades, indicating the potential of our method to advance the application of facade parsing in increasingly complex scenarios.</div></div>\",\"PeriodicalId\":9273,\"journal\":{\"name\":\"Building and Environment\",\"volume\":\"279 \",\"pages\":\"Article 112955\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Building and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360132325004378\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132325004378","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

深度学习的重大进展推动了计算机视觉中的立面解析，这是一个将建筑元素分类为语义块的过程。一个关键的挑战是处理立面图像中的遮挡。目前的方法由于多尺度模块的使用不理想和潜在空间中特征类别的区分不足而陷入困境。为了解决这个问题，我们提出了一种新的多尺度深度学习架构，通过区分损失函数增强，以更好地捕获多尺度特征。该体系结构包括一个三流潜空间特征增强结构：一个主流用于初级处理，两个辅助流用于特征细化。在体系结构中，我们利用我们设计的双分支上下文聚合模块来协调全局和局部特性之间的差异。我们提出了针对我们的网络架构量身定制的区分损失，引导两个辅助流专注于特定的特征类型，从而增强识别并减少混淆。一方面，提出的潜在特征空间中不同学习的框架为神经网络训练引入了一种新的学习范式，其中对损失函数的简单而有效的修改导致性能优化。另一方面，我们的方法在遮挡情况下解析立面的潜力可能会在城市规划、建筑设计和自动驾驶等工程应用中产生重大影响。我们在基准立面数据集上的实验证明了我们的方法在处理遮挡和有效解析立面方面的卓越性能，表明了我们的方法在日益复杂的场景中推进立面解析应用的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Distinctive learning of latent space feature for occlusion-aware facade parsing

Significant strides in deep learning have propelled facade parsing in computer vision, a process that classifies architectural elements into semantic blocks. A key challenge is handling occlusions in facade images. Current methods struggle due to suboptimal use of multi-scale modules and insufficient differentiation of feature categories in latent space. Addressing this, we propose a novel multi-scale deep learning architecture, enhanced by a distinction loss function, to better capture multi-scale characteristics. This architecture includes a three-stream latent space feature enhancement structure: a main stream for primary processing and two auxiliary streams for feature refinement. Within the architecture, we utilize our designed dual-branch Context Aggregation Module to reconcile discrepancies between global and local features. We propose a distinction loss tailored to our network architecture, guiding the two auxiliary streams to concentrate on specific feature types, thereby enhancing discrimination and reducing confusion. On one hand, the proposed framework for distinct learning in the latent feature space introduces a novel learning paradigm for neural network training, where simple yet effective modifications to the loss function lead to performance optimization. On the other hand, the potential of our method to parse facades under occluded scenarios could be significantly impactful in engineering applications such as urban planning, architectural design, and autonomous driving. Our experiments on benchmark facade datasets demonstrate the superior performance of our approach in handling occlusions and effectively parsing facades, indicating the potential of our method to advance the application of facade parsing in increasingly complex scenarios.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Building and Environment 工程技术-工程：环境

CiteScore

12.50

自引率

23.00%

发文量

1130

审稿时长

27 days

期刊介绍： Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.