QuadWindow：街景图像几何窗口提取的视角感知框架

IF 2 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Engineering reports : open access Pub Date : 2025-07-17 DOI:10.1002/eng2.70294

Zhuangqun Niu, Ke Xi, Yifan Liao, Pengjie Tao, Tao Ke

{"title":"QuadWindow：街景图像几何窗口提取的视角感知框架","authors":"Zhuangqun Niu, Ke Xi, Yifan Liao, Pengjie Tao, Tao Ke","doi":"10.1002/eng2.70294","DOIUrl":null,"url":null,"abstract":"<p>Rapid and reliable assessment of building damage is essential for post-disaster response and recovery. As windows often reflect critical structural changes, their automatic extraction from street-view images provides valuable insights for emergency assessment, urban risk modeling, and disaster database updates. Existing methods struggle to leverage the quadrilateral prior of windows due to two main issues: poor handling of perspective distortion and the lack of robust loss functions when precise vector annotations are unavailable. To overcome these challenges, we introduce QuadWindow, a framework specifically designed to handle perspective distortions through a perspective transformation sub-network that predicts transformations from street-view images to frontal views, significantly simplifying window extraction tasks without manual correction. Additionally, we propose a differentiable rendering loss that directly aligns predicted quadrangles with raster-based ground truth, bypassing the need for explicit corner-point annotations. Experimental results demonstrate that QuadWindow outperforms state-of-the-art methods across five façade datasets, with an average F1-score of 87.6% and Intersection over Union (IoU) of 78.03%, achieving 1.47% and 5.2% improvement, respectively.</p>","PeriodicalId":72922,"journal":{"name":"Engineering reports : open access","volume":"7 7","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70294","citationCount":"0","resultStr":"{\"title\":\"QuadWindow: A Perspective-Aware Framework for Geometric Window Extraction From Street-View Imagery\",\"authors\":\"Zhuangqun Niu, Ke Xi, Yifan Liao, Pengjie Tao, Tao Ke\",\"doi\":\"10.1002/eng2.70294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Rapid and reliable assessment of building damage is essential for post-disaster response and recovery. As windows often reflect critical structural changes, their automatic extraction from street-view images provides valuable insights for emergency assessment, urban risk modeling, and disaster database updates. Existing methods struggle to leverage the quadrilateral prior of windows due to two main issues: poor handling of perspective distortion and the lack of robust loss functions when precise vector annotations are unavailable. To overcome these challenges, we introduce QuadWindow, a framework specifically designed to handle perspective distortions through a perspective transformation sub-network that predicts transformations from street-view images to frontal views, significantly simplifying window extraction tasks without manual correction. Additionally, we propose a differentiable rendering loss that directly aligns predicted quadrangles with raster-based ground truth, bypassing the need for explicit corner-point annotations. Experimental results demonstrate that QuadWindow outperforms state-of-the-art methods across five façade datasets, with an average F1-score of 87.6% and Intersection over Union (IoU) of 78.03%, achieving 1.47% and 5.2% improvement, respectively.</p>\",\"PeriodicalId\":72922,\"journal\":{\"name\":\"Engineering reports : open access\",\"volume\":\"7 7\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70294\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering reports : open access\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70294\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering reports : open access","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

快速、可靠地评估建筑物受损情况对于灾后应对和恢复至关重要。由于窗户经常反映关键的结构变化，因此从街景图像中自动提取窗户可为紧急情况评估、城市风险建模和灾害数据库更新提供有价值的见解。现有的方法难以利用窗口的四边形先验，主要有两个问题：当无法获得精确的矢量注释时，透视失真处理不良和缺乏鲁棒损失函数。为了克服这些挑战，我们引入了QuadWindow，这是一个专门设计用于处理视角失真的框架，通过一个视角转换子网络来预测从街景图像到正面视图的转换，大大简化了窗口提取任务，无需手动校正。此外，我们提出了一种可微分的渲染损失，它直接将预测的四边形与基于栅格的地面真值对齐，而不需要显式的角点注释。实验结果表明，在5个farade数据集上，QuadWindow优于最先进的方法，平均f1得分为87.6%，交叉比联合（IoU）得分为78.03%，分别提高了1.47%和5.2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

QuadWindow: A Perspective-Aware Framework for Geometric Window Extraction From Street-View Imagery

查看原文本刊更多论文

QuadWindow: A Perspective-Aware Framework for Geometric Window Extraction From Street-View Imagery

Rapid and reliable assessment of building damage is essential for post-disaster response and recovery. As windows often reflect critical structural changes, their automatic extraction from street-view images provides valuable insights for emergency assessment, urban risk modeling, and disaster database updates. Existing methods struggle to leverage the quadrilateral prior of windows due to two main issues: poor handling of perspective distortion and the lack of robust loss functions when precise vector annotations are unavailable. To overcome these challenges, we introduce QuadWindow, a framework specifically designed to handle perspective distortions through a perspective transformation sub-network that predicts transformations from street-view images to frontal views, significantly simplifying window extraction tasks without manual correction. Additionally, we propose a differentiable rendering loss that directly aligns predicted quadrangles with raster-based ground truth, bypassing the need for explicit corner-point annotations. Experimental results demonstrate that QuadWindow outperforms state-of-the-art methods across five façade datasets, with an average F1-score of 87.6% and Intersection over Union (IoU) of 78.03%, achieving 1.47% and 5.2% improvement, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering reports : open access

CiteScore

5.10

自引率

0.00%

发文量

审稿时长

19 weeks