{"title":"磁悬浮铁路系统智能维修的可视化问答","authors":"Gao-Feng Jiang , Su-Mei Wang , Yi-Qing Ni","doi":"10.1016/j.autcon.2025.106559","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional methodologies effectively assess the railway components by categorizing them as either normal or abnormal. However, these methods provide limited insight into underlying conditions and the reasons for abnormalities, often lacking a comprehensive explanation. With advancements in multimodal feature learning, multimodal data are potentially integrated for various downstream tasks, such as visual question answering (VQA). This paper proposes a three-phase procedure for VQA-guided maintenance of maglev conditions, aiming to automatically recognize evidence of damage and failure using accumulated multimodal knowledge. As one of the early VQA datasets designed for railway condition monitoring, it is organized as image-question-answer tuples, where images are generated from time-frequency spectrograms, questions and answers are formulated based on maglev structural dynamic characteristics. The results indicate that the proposed model is reliable in answer accuracy and expression quality. This advancement contributes to forming intelligent decision-making processes in railway infrastructure, ultimately promoting safer and more efficient railway operations.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"180 ","pages":"Article 106559"},"PeriodicalIF":11.5000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Visual question answering for intelligent maintenance of maglev railway systems\",\"authors\":\"Gao-Feng Jiang , Su-Mei Wang , Yi-Qing Ni\",\"doi\":\"10.1016/j.autcon.2025.106559\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traditional methodologies effectively assess the railway components by categorizing them as either normal or abnormal. However, these methods provide limited insight into underlying conditions and the reasons for abnormalities, often lacking a comprehensive explanation. With advancements in multimodal feature learning, multimodal data are potentially integrated for various downstream tasks, such as visual question answering (VQA). This paper proposes a three-phase procedure for VQA-guided maintenance of maglev conditions, aiming to automatically recognize evidence of damage and failure using accumulated multimodal knowledge. As one of the early VQA datasets designed for railway condition monitoring, it is organized as image-question-answer tuples, where images are generated from time-frequency spectrograms, questions and answers are formulated based on maglev structural dynamic characteristics. The results indicate that the proposed model is reliable in answer accuracy and expression quality. This advancement contributes to forming intelligent decision-making processes in railway infrastructure, ultimately promoting safer and more efficient railway operations.</div></div>\",\"PeriodicalId\":8660,\"journal\":{\"name\":\"Automation in Construction\",\"volume\":\"180 \",\"pages\":\"Article 106559\"},\"PeriodicalIF\":11.5000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Automation in Construction\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926580525005990\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580525005990","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Visual question answering for intelligent maintenance of maglev railway systems
Traditional methodologies effectively assess the railway components by categorizing them as either normal or abnormal. However, these methods provide limited insight into underlying conditions and the reasons for abnormalities, often lacking a comprehensive explanation. With advancements in multimodal feature learning, multimodal data are potentially integrated for various downstream tasks, such as visual question answering (VQA). This paper proposes a three-phase procedure for VQA-guided maintenance of maglev conditions, aiming to automatically recognize evidence of damage and failure using accumulated multimodal knowledge. As one of the early VQA datasets designed for railway condition monitoring, it is organized as image-question-answer tuples, where images are generated from time-frequency spectrograms, questions and answers are formulated based on maglev structural dynamic characteristics. The results indicate that the proposed model is reliable in answer accuracy and expression quality. This advancement contributes to forming intelligent decision-making processes in railway infrastructure, ultimately promoting safer and more efficient railway operations.
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.