AdaLN:一种多领域学习和灾前建筑信息提取的视觉转换器

IF 4.7 2区工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Computing in Civil Engineering Pub Date : 2022-09-01 DOI:10.1061/(asce)cp.1943-5487.0001034

Yunhui Guo, Chaofeng Wang, Stella X. Yu, F. McKenna, K. Law

{"title":"AdaLN:一种多领域学习和灾前建筑信息提取的视觉转换器","authors":"Yunhui Guo, Chaofeng Wang, Stella X. Yu, F. McKenna, K. Law","doi":"10.1061/(asce)cp.1943-5487.0001034","DOIUrl":null,"url":null,"abstract":": Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks. DOI: 10.1061/(ASCE)CP.1943-5487.0001034. © 2022 American Society of Civil Engineers.","PeriodicalId":50221,"journal":{"name":"Journal of Computing in Civil Engineering","volume":"241 1","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"AdaLN: A Vision Transformer for Multidomain Learning and Predisaster Building Information Extraction from Images\",\"authors\":\"Yunhui Guo, Chaofeng Wang, Stella X. Yu, F. McKenna, K. Law\",\"doi\":\"10.1061/(asce)cp.1943-5487.0001034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks. DOI: 10.1061/(ASCE)CP.1943-5487.0001034. © 2022 American Society of Civil Engineers.\",\"PeriodicalId\":50221,\"journal\":{\"name\":\"Journal of Computing in Civil Engineering\",\"volume\":\"241 1\",\"pages\":\"\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2022-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computing in Civil Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1061/(asce)cp.1943-5487.0001034\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computing in Civil Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1061/(asce)cp.1943-5487.0001034","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 5

摘要

:卫星和街景图像被广泛应用于各个学科，作为了解建筑环境的信息来源。在自然灾害工程中，高质量的建筑清单数据集对于模拟灾害影响和支持决策至关重要。筛选建筑物库存以收集模拟信息并检测易受自然灾害影响的潜在结构缺陷是一项耗时且劳动密集型的任务。本文提出了一种利用卫星和街景图像自动提取建筑物信息的方法。该方法是建立在我们开发的一种新颖的基于变压器的深度神经网络之上的。具体来说，采用多领域学习方法开发了一个单一的紧凑模型，用于使用多个数据源(例如卫星和街景图像)的多个基于图像的深度学习信息提取任务。我们的多域视觉转换器被设计成一个统一的体系结构，可以有效地部署在多个分类任务中。在一个案例研究中，我们使用预训练模型来收集与自然灾害风险相关的区域尺度建筑信息，证明了所提出方法的有效性。DOI: 10.1061 /(第3期)cp.1943 - 5487.0001034。©2022美国土木工程师学会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

AdaLN: A Vision Transformer for Multidomain Learning and Predisaster Building Information Extraction from Images

: Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks. DOI: 10.1061/(ASCE)CP.1943-5487.0001034. © 2022 American Society of Civil Engineers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Computing in Civil Engineering 工程技术-工程：土木

CiteScore

11.90

自引率

7.20%

发文量

审稿时长

6 months

期刊介绍： The Journal of Computing in Civil Engineering serves as a resource to researchers, practitioners, and students on advances and innovative ideas in computing as applicable to the engineering profession. Many such ideas emerge from recent developments in computer science, information science, computer engineering, knowledge engineering, and other technical fields. Some examples are innovations in artificial intelligence, parallel processing, distributed computing, graphics and imaging, and information technology. The journal publishes research, implementation, and applications in cross-disciplinary areas including software, such as new programming languages, database-management systems, computer-aided design systems, and expert systems; hardware for robotics, bar coding, remote sensing, data mining, and knowledge acquisition; and strategic issues such as the management of computing resources, implementation strategies, and organizational impacts.