BFA-YOLO:用于建筑立面元素检测的平衡多尺度物体检测网络

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yangguang Chen , Tong Wang , Guanzhou Chen , Kun Zhu , Xiaoliang Tan , Jiaqi Wang , Wenchao Guo , Qing Wang , Xiaolong Luo , Xiaodong Zhang
{"title":"BFA-YOLO:用于建筑立面元素检测的平衡多尺度物体检测网络","authors":"Yangguang Chen ,&nbsp;Tong Wang ,&nbsp;Guanzhou Chen ,&nbsp;Kun Zhu ,&nbsp;Xiaoliang Tan ,&nbsp;Jiaqi Wang ,&nbsp;Wenchao Guo ,&nbsp;Qing Wang ,&nbsp;Xiaolong Luo ,&nbsp;Xiaodong Zhang","doi":"10.1016/j.aei.2025.103289","DOIUrl":null,"url":null,"abstract":"<div><div>The detection of façade elements on buildings, such as doors, windows, balconies, air conditioning units, billboards, and glass curtain walls, is a critical step in automating the creation of Building Information Modeling (BIM). However, this field faces significant challenges, including the uneven distribution of façade elements, the presence of small objects, and substantial background noise, which hamper detection accuracy. To address these issues, we developed the BFA-YOLO model and the BFA-3D dataset in this study. The BFA-YOLO model is an advanced architecture designed specifically for analyzing multi-view images of façade elements. It integrates three novel components: the Feature Balanced Spindle Module (FBSM) that tackles the issue of uneven object distribution; the Target Dynamic Alignment Task Detection Head (TDATH) that enhances the detection of small objects; and the Position Memory Enhanced Self-Attention Mechanism (PMESA), aimed at reducing the impact of background noise. These elements collectively enable BFA-YOLO to effectively address each challenge, thereby improving model robustness and detection precision. The BFA-3D dataset offers multi-view images with precise annotations across a wide range of façade element categories. This dataset is developed to address the limitations present in existing façade detection datasets, which often feature a single perspective and insufficient category coverage. Through comparative analysis, BFA-YOLO demonstrated improvements of 1.8% and 2.9% in mAP<sub>50</sub> on the BFA-3D dataset and the public Façade-WHU dataset, respectively, when compared to the baseline YOLOv8 model. These results highlight the superior performance of BFA-YOLO in façade element detection and the advancement of intelligent BIM technologies. The dataset and code are available at <span><span>https://github.com/CVEO/BFA-YOLO</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103289"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"BFA-YOLO: A balanced multiscale object detection network for building façade elements detection\",\"authors\":\"Yangguang Chen ,&nbsp;Tong Wang ,&nbsp;Guanzhou Chen ,&nbsp;Kun Zhu ,&nbsp;Xiaoliang Tan ,&nbsp;Jiaqi Wang ,&nbsp;Wenchao Guo ,&nbsp;Qing Wang ,&nbsp;Xiaolong Luo ,&nbsp;Xiaodong Zhang\",\"doi\":\"10.1016/j.aei.2025.103289\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The detection of façade elements on buildings, such as doors, windows, balconies, air conditioning units, billboards, and glass curtain walls, is a critical step in automating the creation of Building Information Modeling (BIM). However, this field faces significant challenges, including the uneven distribution of façade elements, the presence of small objects, and substantial background noise, which hamper detection accuracy. To address these issues, we developed the BFA-YOLO model and the BFA-3D dataset in this study. The BFA-YOLO model is an advanced architecture designed specifically for analyzing multi-view images of façade elements. It integrates three novel components: the Feature Balanced Spindle Module (FBSM) that tackles the issue of uneven object distribution; the Target Dynamic Alignment Task Detection Head (TDATH) that enhances the detection of small objects; and the Position Memory Enhanced Self-Attention Mechanism (PMESA), aimed at reducing the impact of background noise. These elements collectively enable BFA-YOLO to effectively address each challenge, thereby improving model robustness and detection precision. The BFA-3D dataset offers multi-view images with precise annotations across a wide range of façade element categories. This dataset is developed to address the limitations present in existing façade detection datasets, which often feature a single perspective and insufficient category coverage. Through comparative analysis, BFA-YOLO demonstrated improvements of 1.8% and 2.9% in mAP<sub>50</sub> on the BFA-3D dataset and the public Façade-WHU dataset, respectively, when compared to the baseline YOLOv8 model. These results highlight the superior performance of BFA-YOLO in façade element detection and the advancement of intelligent BIM technologies. The dataset and code are available at <span><span>https://github.com/CVEO/BFA-YOLO</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"65 \",\"pages\":\"Article 103289\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S147403462500182X\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S147403462500182X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

检测建筑物上的立面元素,如门、窗、阳台、空调机组、广告牌和玻璃幕墙,是自动化创建建筑信息模型(BIM)的关键步骤。然而,这一领域面临着巨大的挑战,包括表面元素分布不均匀、小物体的存在以及大量背景噪声,这些都会影响检测的准确性。为了解决这些问题,本研究开发了BFA-YOLO模型和BFA-3D数据集。BFA-YOLO模型是一种先进的体系结构,专门用于分析farade元素的多视图图像。它集成了三个新颖的组件:特征平衡主轴模块(FBSM),解决了物体分布不均匀的问题;目标动态对准任务检测头(TDATH),增强对小目标的检测;以及旨在降低背景噪声影响的位置记忆增强自我注意机制(PMESA)。这些元素共同使BFA-YOLO能够有效地应对每个挑战,从而提高模型的鲁棒性和检测精度。BFA-3D数据集在广泛的farade元素类别中提供具有精确注释的多视图图像。开发此数据集是为了解决现有farade检测数据集存在的局限性,这些数据集通常具有单一视角和类别覆盖不足的特点。通过对比分析,BFA-YOLO在BFA-3D数据集和公开farade - whu数据集上的mAP50分别比基线YOLOv8模型提高了1.8%和2.9%。这些结果凸显了BFA-YOLO在立面元素检测方面的优越性能和智能BIM技术的先进性。数据集和代码可在https://github.com/CVEO/BFA-YOLO上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
BFA-YOLO: A balanced multiscale object detection network for building façade elements detection
The detection of façade elements on buildings, such as doors, windows, balconies, air conditioning units, billboards, and glass curtain walls, is a critical step in automating the creation of Building Information Modeling (BIM). However, this field faces significant challenges, including the uneven distribution of façade elements, the presence of small objects, and substantial background noise, which hamper detection accuracy. To address these issues, we developed the BFA-YOLO model and the BFA-3D dataset in this study. The BFA-YOLO model is an advanced architecture designed specifically for analyzing multi-view images of façade elements. It integrates three novel components: the Feature Balanced Spindle Module (FBSM) that tackles the issue of uneven object distribution; the Target Dynamic Alignment Task Detection Head (TDATH) that enhances the detection of small objects; and the Position Memory Enhanced Self-Attention Mechanism (PMESA), aimed at reducing the impact of background noise. These elements collectively enable BFA-YOLO to effectively address each challenge, thereby improving model robustness and detection precision. The BFA-3D dataset offers multi-view images with precise annotations across a wide range of façade element categories. This dataset is developed to address the limitations present in existing façade detection datasets, which often feature a single perspective and insufficient category coverage. Through comparative analysis, BFA-YOLO demonstrated improvements of 1.8% and 2.9% in mAP50 on the BFA-3D dataset and the public Façade-WHU dataset, respectively, when compared to the baseline YOLOv8 model. These results highlight the superior performance of BFA-YOLO in façade element detection and the advancement of intelligent BIM technologies. The dataset and code are available at https://github.com/CVEO/BFA-YOLO.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信