Shuoshuo Xu, Kai Zhao, James Loney, Zili Li, Andrea Visentin
{"title":"Image-based large language model approach to road pavement monitoring","authors":"Shuoshuo Xu, Kai Zhao, James Loney, Zili Li, Andrea Visentin","doi":"10.1111/mice.70075","DOIUrl":null,"url":null,"abstract":"Accurate and rapid assessment of pavement surface condition is essential for maintaining transportation safety and minimizing vehicle wear. Manual pavement inspections are subjective and time-consuming, and machine learning methods typically require large labeled datasets. This study introduces an innovative zero-shot learning method that leverages large language models’ (LLMs) image analysis and natural-language understanding capabilities for accurate road condition assessment. Prompts were designed in alignment with the pavement surface condition index criteria to generate multiple evaluation models, which were then compared against official scores to identify an optimized configuration. Tests conducted using Google Street View imagery indicate that the optimized LLM-based model achieves a mean absolute error of 1.07 on a 0–10 scale, outperforming expert evaluations. The proposed approach enables rapid, accurate, and consistent assessments without the need for labeled data, demonstrating the transformative role of LLMs in automating infrastructure monitoring and emphasizing the importance of structured prompt engineering for reliable performance.","PeriodicalId":156,"journal":{"name":"Computer-Aided Civil and Infrastructure Engineering","volume":"4 1","pages":""},"PeriodicalIF":9.1000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Civil and Infrastructure Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1111/mice.70075","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate and rapid assessment of pavement surface condition is essential for maintaining transportation safety and minimizing vehicle wear. Manual pavement inspections are subjective and time-consuming, and machine learning methods typically require large labeled datasets. This study introduces an innovative zero-shot learning method that leverages large language models’ (LLMs) image analysis and natural-language understanding capabilities for accurate road condition assessment. Prompts were designed in alignment with the pavement surface condition index criteria to generate multiple evaluation models, which were then compared against official scores to identify an optimized configuration. Tests conducted using Google Street View imagery indicate that the optimized LLM-based model achieves a mean absolute error of 1.07 on a 0–10 scale, outperforming expert evaluations. The proposed approach enables rapid, accurate, and consistent assessments without the need for labeled data, demonstrating the transformative role of LLMs in automating infrastructure monitoring and emphasizing the importance of structured prompt engineering for reliable performance.
期刊介绍:
Computer-Aided Civil and Infrastructure Engineering stands as a scholarly, peer-reviewed archival journal, serving as a vital link between advancements in computer technology and civil and infrastructure engineering. The journal serves as a distinctive platform for the publication of original articles, spotlighting novel computational techniques and inventive applications of computers. Specifically, it concentrates on recent progress in computer and information technologies, fostering the development and application of emerging computing paradigms.
Encompassing a broad scope, the journal addresses bridge, construction, environmental, highway, geotechnical, structural, transportation, and water resources engineering. It extends its reach to the management of infrastructure systems, covering domains such as highways, bridges, pavements, airports, and utilities. The journal delves into areas like artificial intelligence, cognitive modeling, concurrent engineering, database management, distributed computing, evolutionary computing, fuzzy logic, genetic algorithms, geometric modeling, internet-based technologies, knowledge discovery and engineering, machine learning, mobile computing, multimedia technologies, networking, neural network computing, optimization and search, parallel processing, robotics, smart structures, software engineering, virtual reality, and visualization techniques.