AdaLN:一种多领域学习和灾前建筑信息提取的视觉转换器

IF 4.7 2区 工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Yunhui Guo, Chaofeng Wang, Stella X. Yu, F. McKenna, K. Law
{"title":"AdaLN:一种多领域学习和灾前建筑信息提取的视觉转换器","authors":"Yunhui Guo, Chaofeng Wang, Stella X. Yu, F. McKenna, K. Law","doi":"10.1061/(asce)cp.1943-5487.0001034","DOIUrl":null,"url":null,"abstract":": Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks. DOI: 10.1061/(ASCE)CP.1943-5487.0001034. © 2022 American Society of Civil Engineers.","PeriodicalId":50221,"journal":{"name":"Journal of Computing in Civil Engineering","volume":"241 1","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"AdaLN: A Vision Transformer for Multidomain Learning and Predisaster Building Information Extraction from Images\",\"authors\":\"Yunhui Guo, Chaofeng Wang, Stella X. Yu, F. McKenna, K. Law\",\"doi\":\"10.1061/(asce)cp.1943-5487.0001034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks. DOI: 10.1061/(ASCE)CP.1943-5487.0001034. © 2022 American Society of Civil Engineers.\",\"PeriodicalId\":50221,\"journal\":{\"name\":\"Journal of Computing in Civil Engineering\",\"volume\":\"241 1\",\"pages\":\"\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2022-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computing in Civil Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1061/(asce)cp.1943-5487.0001034\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computing in Civil Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1061/(asce)cp.1943-5487.0001034","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 5

摘要

:卫星和街景图像被广泛应用于各个学科,作为了解建筑环境的信息来源。在自然灾害工程中,高质量的建筑清单数据集对于模拟灾害影响和支持决策至关重要。筛选建筑物库存以收集模拟信息并检测易受自然灾害影响的潜在结构缺陷是一项耗时且劳动密集型的任务。本文提出了一种利用卫星和街景图像自动提取建筑物信息的方法。该方法是建立在我们开发的一种新颖的基于变压器的深度神经网络之上的。具体来说,采用多领域学习方法开发了一个单一的紧凑模型,用于使用多个数据源(例如卫星和街景图像)的多个基于图像的深度学习信息提取任务。我们的多域视觉转换器被设计成一个统一的体系结构,可以有效地部署在多个分类任务中。在一个案例研究中,我们使用预训练模型来收集与自然灾害风险相关的区域尺度建筑信息,证明了所提出方法的有效性。DOI: 10.1061 /(第3期)cp.1943 - 5487.0001034。©2022美国土木工程师学会。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
AdaLN: A Vision Transformer for Multidomain Learning and Predisaster Building Information Extraction from Images
: Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks. DOI: 10.1061/(ASCE)CP.1943-5487.0001034. © 2022 American Society of Civil Engineers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Computing in Civil Engineering
Journal of Computing in Civil Engineering 工程技术-工程:土木
CiteScore
11.90
自引率
7.20%
发文量
58
审稿时长
6 months
期刊介绍: The Journal of Computing in Civil Engineering serves as a resource to researchers, practitioners, and students on advances and innovative ideas in computing as applicable to the engineering profession. Many such ideas emerge from recent developments in computer science, information science, computer engineering, knowledge engineering, and other technical fields. Some examples are innovations in artificial intelligence, parallel processing, distributed computing, graphics and imaging, and information technology. The journal publishes research, implementation, and applications in cross-disciplinary areas including software, such as new programming languages, database-management systems, computer-aided design systems, and expert systems; hardware for robotics, bar coding, remote sensing, data mining, and knowledge acquisition; and strategic issues such as the management of computing resources, implementation strategies, and organizational impacts.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信