QuadWindow:街景图像几何窗口提取的视角感知框架

IF 2 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Zhuangqun Niu, Ke Xi, Yifan Liao, Pengjie Tao, Tao Ke
{"title":"QuadWindow:街景图像几何窗口提取的视角感知框架","authors":"Zhuangqun Niu,&nbsp;Ke Xi,&nbsp;Yifan Liao,&nbsp;Pengjie Tao,&nbsp;Tao Ke","doi":"10.1002/eng2.70294","DOIUrl":null,"url":null,"abstract":"<p>Rapid and reliable assessment of building damage is essential for post-disaster response and recovery. As windows often reflect critical structural changes, their automatic extraction from street-view images provides valuable insights for emergency assessment, urban risk modeling, and disaster database updates. Existing methods struggle to leverage the quadrilateral prior of windows due to two main issues: poor handling of perspective distortion and the lack of robust loss functions when precise vector annotations are unavailable. To overcome these challenges, we introduce QuadWindow, a framework specifically designed to handle perspective distortions through a perspective transformation sub-network that predicts transformations from street-view images to frontal views, significantly simplifying window extraction tasks without manual correction. Additionally, we propose a differentiable rendering loss that directly aligns predicted quadrangles with raster-based ground truth, bypassing the need for explicit corner-point annotations. Experimental results demonstrate that QuadWindow outperforms state-of-the-art methods across five façade datasets, with an average F1-score of 87.6% and Intersection over Union (IoU) of 78.03%, achieving 1.47% and 5.2% improvement, respectively.</p>","PeriodicalId":72922,"journal":{"name":"Engineering reports : open access","volume":"7 7","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70294","citationCount":"0","resultStr":"{\"title\":\"QuadWindow: A Perspective-Aware Framework for Geometric Window Extraction From Street-View Imagery\",\"authors\":\"Zhuangqun Niu,&nbsp;Ke Xi,&nbsp;Yifan Liao,&nbsp;Pengjie Tao,&nbsp;Tao Ke\",\"doi\":\"10.1002/eng2.70294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Rapid and reliable assessment of building damage is essential for post-disaster response and recovery. As windows often reflect critical structural changes, their automatic extraction from street-view images provides valuable insights for emergency assessment, urban risk modeling, and disaster database updates. Existing methods struggle to leverage the quadrilateral prior of windows due to two main issues: poor handling of perspective distortion and the lack of robust loss functions when precise vector annotations are unavailable. To overcome these challenges, we introduce QuadWindow, a framework specifically designed to handle perspective distortions through a perspective transformation sub-network that predicts transformations from street-view images to frontal views, significantly simplifying window extraction tasks without manual correction. Additionally, we propose a differentiable rendering loss that directly aligns predicted quadrangles with raster-based ground truth, bypassing the need for explicit corner-point annotations. Experimental results demonstrate that QuadWindow outperforms state-of-the-art methods across five façade datasets, with an average F1-score of 87.6% and Intersection over Union (IoU) of 78.03%, achieving 1.47% and 5.2% improvement, respectively.</p>\",\"PeriodicalId\":72922,\"journal\":{\"name\":\"Engineering reports : open access\",\"volume\":\"7 7\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70294\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering reports : open access\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70294\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering reports : open access","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

快速、可靠地评估建筑物受损情况对于灾后应对和恢复至关重要。由于窗户经常反映关键的结构变化,因此从街景图像中自动提取窗户可为紧急情况评估、城市风险建模和灾害数据库更新提供有价值的见解。现有的方法难以利用窗口的四边形先验,主要有两个问题:当无法获得精确的矢量注释时,透视失真处理不良和缺乏鲁棒损失函数。为了克服这些挑战,我们引入了QuadWindow,这是一个专门设计用于处理视角失真的框架,通过一个视角转换子网络来预测从街景图像到正面视图的转换,大大简化了窗口提取任务,无需手动校正。此外,我们提出了一种可微分的渲染损失,它直接将预测的四边形与基于栅格的地面真值对齐,而不需要显式的角点注释。实验结果表明,在5个farade数据集上,QuadWindow优于最先进的方法,平均f1得分为87.6%,交叉比联合(IoU)得分为78.03%,分别提高了1.47%和5.2%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

QuadWindow: A Perspective-Aware Framework for Geometric Window Extraction From Street-View Imagery

QuadWindow: A Perspective-Aware Framework for Geometric Window Extraction From Street-View Imagery

Rapid and reliable assessment of building damage is essential for post-disaster response and recovery. As windows often reflect critical structural changes, their automatic extraction from street-view images provides valuable insights for emergency assessment, urban risk modeling, and disaster database updates. Existing methods struggle to leverage the quadrilateral prior of windows due to two main issues: poor handling of perspective distortion and the lack of robust loss functions when precise vector annotations are unavailable. To overcome these challenges, we introduce QuadWindow, a framework specifically designed to handle perspective distortions through a perspective transformation sub-network that predicts transformations from street-view images to frontal views, significantly simplifying window extraction tasks without manual correction. Additionally, we propose a differentiable rendering loss that directly aligns predicted quadrangles with raster-based ground truth, bypassing the need for explicit corner-point annotations. Experimental results demonstrate that QuadWindow outperforms state-of-the-art methods across five façade datasets, with an average F1-score of 87.6% and Intersection over Union (IoU) of 78.03%, achieving 1.47% and 5.2% improvement, respectively.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.10
自引率
0.00%
发文量
0
审稿时长
19 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信