Advanced Engineering Informatics最新文献

筛选
英文 中文
MAPPO-ITD3-IMLFQ algorithm for multi-mobile robot path planning 多移动机器人路径规划的MAPPO-ITD3-IMLFQ算法
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-05-01 DOI: 10.1016/j.aei.2025.103398
Likun Hu, Chunyou Wei, Linfei Yin
{"title":"MAPPO-ITD3-IMLFQ algorithm for multi-mobile robot path planning","authors":"Likun Hu,&nbsp;Chunyou Wei,&nbsp;Linfei Yin","doi":"10.1016/j.aei.2025.103398","DOIUrl":"10.1016/j.aei.2025.103398","url":null,"abstract":"<div><div>With the development of robotics, mobile robots (MRs) are widely applied in industrial and agricultural production. Reasonable path planning (PP) algorithms are the prerequisite for multi-mobile robot (MMR) systems to accomplish tasks. However, the existing PP algorithms of MMR systems still have the problems of being unable to dynamically assign tasks, not comprehensively considering the needs of kinematic constraints and dynamic obstacle avoidance, and poorly coordinating path conflicts. This study proposes a multi-agent proximal policy optimization-artificial potential field twin delayed deep deterministic policy gradient-improved multi-level feedback queue (MAPPO-ITD3-IMLFQ) algorithm for the PP of MMR systems. The proposed MAPPO-ITD3-IMLFQ algorithm combines the multi-agent proximal policy optimization (MAPPO) algorithm, the improved twin delayed deep deterministic policy gradient (ITD3) algorithm, and the improved multi-level feedback queue (IMLFQ) algorithm to form a PP algorithm for MMR system. The MRs apply the MAPPO algorithm to calculate task assignment (TA) schemes and provide sub-goal points for ITD3 algorithm. The MRs apply the ITD3 algorithm to calculate the path of the MRs. When the paths of different MRs conflict, the MR applies the IMLFQ algorithm to coordinate the movement of the MRs. The proposed MAPPO-ITD3-IMLFQ algorithm realizes the dynamic TA of the MMR system, meets the kinematic constraints and dynamic obstacle avoidance requirements of MRs, and coordinates path conflicts among the MRs. In this study, the proposed MAPPO-ITD3-IMLFQ algorithm is applied to different environments for the PP of MMRs. Experimental results show that: compared to the Hungarian algorithm and the genetic algorithm, the proposed MAPPO-ITD3-IMLFQ algorithm reduces the time spent on assigned tasks by 75.25 % and 77.44 %, respectively. Compared to the PP algorithms for reinforcement learning, the proposed MAPPO-ITD3-IMLFQ algorithm reduces the length of the planned path by 23.57 % on average.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103398"},"PeriodicalIF":8.0,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143887535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight segmentation model for automated facade installation in high-rise buildings 高层建筑立面自动化安装的轻量化分段模型
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-05-01 DOI: 10.1016/j.aei.2025.103374
Ahmed Yimam Hassen , Mehrdad Arashpour , Elahe Abdi
{"title":"Lightweight segmentation model for automated facade installation in high-rise buildings","authors":"Ahmed Yimam Hassen ,&nbsp;Mehrdad Arashpour ,&nbsp;Elahe Abdi","doi":"10.1016/j.aei.2025.103374","DOIUrl":"10.1016/j.aei.2025.103374","url":null,"abstract":"<div><div>The installation of curtain wall modules (CWM) in high-rise buildings is a complex task that poses significant safety risks due to manual labor, especially when working at great heights. Traditional methods are labor-intensive, time-consuming, and expose workers to hazards such as falls and equipment malfunctions. To mitigate these risks and enhance operational efficiency, automation and precise positioning of CWMs are essential. Accurate detection of installation locations becomes critical, as it enables crane operators or autonomous robots to position CWMs safely and precisely. This study introduces a novel approach utilizing semantic segmentation for detecting CWM installation locations. To address the challenges of deploying deep learning models on edge devices in construction environments, we propose Lightweight Attention Network (LANet), a lightweight, single-stream semantic segmentation architecture. LANet incorporates an optimized transformer module for global context modeling with linear complexity, enabling efficient feature extraction while maintaining computational efficiency. Additionally, we have developed a custom curtain wall dataset tailored for automating CWM installation, which was used to train and evaluate LANet. Experimental results demonstrate that LANet achieves competitive segmentation accuracy with only 1.92 million parameters, delivering real-time performance at 262 FPS on an RTX 3090 GPU and 19 FPS on a standard Intel i7 CPU. These results make LANet highly suitable for deployment in resource-constrained environments.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103374"},"PeriodicalIF":8.0,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143891487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Font conversion for steel product number recognition: A conditioned diffusion model approach 钢产品编号识别的字体转换:条件扩散模型方法
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-29 DOI: 10.1016/j.aei.2025.103368
Taehan Lee , Hyeyeon Choi , Bum Jun Kim , Hyeonah Jang , Donggeon Lee , Sang Woo Kim
{"title":"Font conversion for steel product number recognition: A conditioned diffusion model approach","authors":"Taehan Lee ,&nbsp;Hyeyeon Choi ,&nbsp;Bum Jun Kim ,&nbsp;Hyeonah Jang ,&nbsp;Donggeon Lee ,&nbsp;Sang Woo Kim","doi":"10.1016/j.aei.2025.103368","DOIUrl":"10.1016/j.aei.2025.103368","url":null,"abstract":"<div><div>In the steel manufacturing industry, it is crucial to automatically recognize semi-finished product numbers to avoid mix-ups and ensure that each product is processed according to its specific material properties. The advancement of deep learning has significantly improved the recognition of steel product numbers, particularly those printed by machines with consistent thickness and spacing, resulting in high recognition accuracy. Conversely, handwritten numbers by workers are often challenging to recognize due to varying thickness, spacing, being too thin, partially erased, or overwritten with scribbles. This inconsistency causes low recognition accuracy of steel product number recognition models for fonts with insufficient training data or fonts not seen during training. The models must be updated periodically whenever a new font is used and remain vulnerable to new fonts until sufficient data is accumulated and updated. In this paper, we propose a Font Changer that converts various fonts into a representative font to address these issues. Font Changer is designed to learn the trajectory from a Gaussian distribution to the data distribution of images generated in a representative font with clean background. Font Changer, composed of a conditional image encoder and a diffusion model, extracts location, size, and number information from the original image containing the steel product number. The extracted information is then used as a condition for the diffusion model, allowing it to generate the closest sample within the data distribution. Images processed by the Font Changer exhibit uniformity, ensuring the consistency of steel product number images. Experiments demonstrate that the Font Changer enhances number recognition by removing background noise and converting even messy and damaged images into a consistent representative font. Our proposed method advances the steel manufacturing industry by standardizing fonts in work environments with diverse handwritten fonts.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103368"},"PeriodicalIF":8.0,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143882540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust support vector machine based on the bounded asymmetric least squares loss function and its applications in noise corrupted data 基于有界非对称最小二乘损失函数的鲁棒支持向量机及其在噪声损坏数据中的应用
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-29 DOI: 10.1016/j.aei.2025.103371
Jiaqi Zhang, Hu Yang
{"title":"Robust support vector machine based on the bounded asymmetric least squares loss function and its applications in noise corrupted data","authors":"Jiaqi Zhang,&nbsp;Hu Yang","doi":"10.1016/j.aei.2025.103371","DOIUrl":"10.1016/j.aei.2025.103371","url":null,"abstract":"<div><div>The support vector machine (SVM) is a popular machine learning tool that has achieved great success in various fields, but its performance is significantly disturbed on noise corrupted datasets. In this paper, motivated by the bounded quantile loss function, based on the relationship of the expectile and asymmetric least squares loss function, we propose the bounded asymmetric least squares loss function (<span><math><msub><mrow><mi>L</mi></mrow><mrow><mi>b</mi><mi>a</mi><mi>l</mi><mi>s</mi></mrow></msub></math></span> loss function). The <span><math><msub><mrow><mi>L</mi></mrow><mrow><mi>b</mi><mi>a</mi><mi>l</mi><mi>s</mi></mrow></msub></math></span> loss function is an extension of the asymmetric least squares loss function. <span><math><msub><mrow><mi>L</mi></mrow><mrow><mi>b</mi><mi>a</mi><mi>l</mi><mi>s</mi></mrow></msub></math></span> loss function inherits the good properties from the asymmetric least squares loss function, such as asymmetric and differentiable. Further, <span><math><msub><mrow><mi>L</mi></mrow><mrow><mi>b</mi><mi>a</mi><mi>l</mi><mi>s</mi></mrow></msub></math></span> loss function is more robust to noise in classification and regression problems. Next, we propose two models based on <span><math><msub><mrow><mi>L</mi></mrow><mrow><mi>b</mi><mi>a</mi><mi>l</mi><mi>s</mi></mrow></msub></math></span> loss function, namely, BALS-SVM and BALS-SVR. The <span><math><msub><mrow><mi>L</mi></mrow><mrow><mi>b</mi><mi>a</mi><mi>l</mi><mi>s</mi></mrow></msub></math></span> loss function is a non-convex loss function which makes it difficult to optimize. Thus, we design a clipping dual coordinate descent (clipDCD) based half-quadratic algorithm for solving the proposed models. We further find that BALS-SVM and BALS-SVR can be viewed as iterative weighted asymmetric least squares loss function based support vector machines and support vector regression, which enhances the interpretability of the models. Finally, we provide a theoretical analysis of the model based on a general framework of bounded loss function, mainly including Fisher consistency and noise insensitivity. Meanwhile, theoretical guarantees are provided for the proposed models. The results on the simulated dataset and the 14 classification and 11 regression benchmark dataset show that our method is superior compared to the classical methods and some state-of-the-art methods, especially on the noise corrupted dataset. The statistical tests further confirm this fact. Experiments on the Fashion MNIST dataset and gene expression dataset further illustrate that our proposed model also performs well in real environments.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103371"},"PeriodicalIF":8.0,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143882538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Climate-resilient epoxy asphalt mixture design: An intelligent framework 耐候性环氧沥青混合料设计:智能框架
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-29 DOI: 10.1016/j.aei.2025.103395
Ke Zhang , Zhaohui Min , Wei Huang , Theunis F.P. Henning , Xiatong Hao , Kaimo Shao
{"title":"Climate-resilient epoxy asphalt mixture design: An intelligent framework","authors":"Ke Zhang ,&nbsp;Zhaohui Min ,&nbsp;Wei Huang ,&nbsp;Theunis F.P. Henning ,&nbsp;Xiatong Hao ,&nbsp;Kaimo Shao","doi":"10.1016/j.aei.2025.103395","DOIUrl":"10.1016/j.aei.2025.103395","url":null,"abstract":"<div><div>Epoxy asphalt mixture exhibits excellent durability, crack resistance and high-temperature stability, making it an ideal choice for climate-resilient pavement materials. In order to expand its application scope and maximize its advantage, it is necessary to propose more advanced mixture design method. This study proposed an intelligent design framework combining machine learning and metaheuristic algorithms for developing epoxy asphalt mixture. First, high-accuracy prediction models of the performance of epoxy asphalt mixture under high and low-temperature environments were established using Extreme Gradient Boosting optimized by Particle Swarm Optimization (PSO-XGBoost). Then, interpretability analysis, including feature importance and accumulated local effects, was conducted based on these models to identify the key design features of epoxy asphalt mixture and determine their empirical value ranges to achieve satisfactory mixture performance. Next, diversified strategies were determined to meet engineering needs, including high performance, low cost and carbon emissions, as well as a comprehensive strategy that incorporates all these objectives. Subsequently, multi-objective optimization models considering these strategies were established, and the optimal solutions were generated based on the Third Generation of Non-dominated Sorting Genetic Algorithm (NSGA-III) and TOPSIS. Finally, the practical feasibility of these solutions was confirmed through laboratory tests. Based on the proposed framework, high-performance, cost-effective, and environmentally sustainable epoxy asphalt mixtures can be obtained. This study sets a new benchmark for future research in the intelligent design of sustainable pavement materials, emphasizing the practical and theoretical implications of integrating advanced computational tools in pavement material science.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103395"},"PeriodicalIF":8.0,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143882537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coupled prediction method for assembly precision and performance of composite structures based on a hybrid saint-venant’s principle and neural network approach 基于混合圣维南原理和神经网络的复合材料结构装配精度与性能耦合预测方法
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-28 DOI: 10.1016/j.aei.2025.103401
Xin Tong, Jianfeng Yu, Dong Xue, He Zhang, baihui Gao, Jie Zhang, Yuan Li
{"title":"Coupled prediction method for assembly precision and performance of composite structures based on a hybrid saint-venant’s principle and neural network approach","authors":"Xin Tong,&nbsp;Jianfeng Yu,&nbsp;Dong Xue,&nbsp;He Zhang,&nbsp;baihui Gao,&nbsp;Jie Zhang,&nbsp;Yuan Li","doi":"10.1016/j.aei.2025.103401","DOIUrl":"10.1016/j.aei.2025.103401","url":null,"abstract":"<div><div>The application of composite materials and interference fit technology in aerospace products presents new challenges to assembly quality requirements: specifically, the demand for higher assembly precision and reduced assembly stress, as these factors directly impact the aerodynamic performance and service life of the product. Consequently, a large number of assembly deviation and stress predictions are necessary during the aircraft structure design process. To meet the requirements for prediction accuracy and efficiency under the constraints of large data volumes and high computational costs, this study proposes an innovative method for the rapid prediction of assembly precision and performance coupling (CPAP) in composite structures. This method combines Saint-Venant’s principle with finite element analysis (FEA) to create an efficient sample generation technique that can quickly provide key data on assembly deviations and stress around the interference fit holes (SAH). Additionally, dimensionality reduction techniques are incorporated into the metamodel (MM), effectively capturing the nonlinear relationships between assembly process parameters and both assembly precision and performance. This results in a predictive coupling model with statistical analysis capabilities. Case studies demonstrate that the method proposed in this study significantly improves prediction efficiency compared to traditional approaches. Furthermore, the results highlight the substantial influence of interference fit process parameters on the assembly accuracy and performance of single longitudinal splicing (SLS) joint structures. This research offers an effective tool for controlling the assembly quality of aerospace products, contributing to technological innovation and advancements in the aerospace industry.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103401"},"PeriodicalIF":8.0,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143878895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Concise belief rule base with credibility decay for system performance prediction 用于系统性能预测的具有可信度衰减的简明信念规则库
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-27 DOI: 10.1016/j.aei.2025.103385
Jie Wang , Yaqian You , Zhijie Zhou , Peng Zhang
{"title":"Concise belief rule base with credibility decay for system performance prediction","authors":"Jie Wang ,&nbsp;Yaqian You ,&nbsp;Zhijie Zhou ,&nbsp;Peng Zhang","doi":"10.1016/j.aei.2025.103385","DOIUrl":"10.1016/j.aei.2025.103385","url":null,"abstract":"<div><div>In engineering scenarios, the performance of industrial systems varies continuously, making it necessary to develop a prediction model to track system performance. Recently, a modeling approach known as the concise belief rule base (CBRB) has provided an effective reference for performance prediction. However, CBRB ignores the decay phenomenon of information credibility during the prediction process, leading to suboptimal output accuracy. To address this limitation, a novel performance prediction model based on the concise belief rule base with credibility decay (CBRB-CD) is put forward. The proposed model incorporates a decay factor to reflect the property that the credibility of belief rules decays over time. Meanwhile, the decay factor is aggregated into the fusion process of belief rules, from which the prediction results are generated. Furthermore, a stability analysis of the prediction model is carried out by introducing external perturbations to validate the prediction results. The analysis results quantitatively reveal the changing patterns of prediction results under perturbed environments. Finally, real-world experiments on aerospace relays demonstrate the feasibility of the proposed model.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103385"},"PeriodicalIF":8.0,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143877322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative garment design through group chatting with generative industrial large models 通过生成性工业大模特群聊协同服装设计
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-27 DOI: 10.1016/j.aei.2025.103366
Arjun Rachana Harish , Zhaolin Yuan , Ming Li , Hongxia Yang , George Q. Huang
{"title":"Collaborative garment design through group chatting with generative industrial large models","authors":"Arjun Rachana Harish ,&nbsp;Zhaolin Yuan ,&nbsp;Ming Li ,&nbsp;Hongxia Yang ,&nbsp;George Q. Huang","doi":"10.1016/j.aei.2025.103366","DOIUrl":"10.1016/j.aei.2025.103366","url":null,"abstract":"<div><div>The collaborative garment designing lifecycle involves stages such as designing, styling, and patterning. Some of these stages can be partially or fully automated using industrial large models (LMs), such as generative and large language models. The key to quick and cost-effective order fulfillment is the orchestration of group interactions, or a group chat, between the stakeholders and LMs in garment design. However, certain unaddressed aspects, such as knowledge retention, generalization, and complexity of group interaction, are critical to realizing group chat for garment design. This study proposes a framework called ChatFashion for group chat in garment design. Transformer, a core construct of the proposed framework, orchestrates interaction among stakeholders and industrial LMs. It undergoes an evolution with the intelligence it picks up from its interaction with diverse stakeholders and industrial LMs, allowing it to act as a one-stop solution for multidisciplinary design needs. This study contributes to theory in the following aspects. First, it proposes transformers to eliminate concerns about knowledge retention by industrial LMs. Second, while other studies focus on the benefits of industrial LMs to simplify individual stages in garment design, this study introduces the design and demonstration of a ChatFashion framework for collaborative garment designing using industrial LMs. Finally, this study advances the literature on prompt engineering of industrial LMs by utilizing collaborative learning (or models learning from each other) to capture and orchestrate the group chat among stakeholders, signifying its practicality and value for research in garment design.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103366"},"PeriodicalIF":8.0,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143877324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GLoU-MiT: Lightweight Global-Local Mamba-Guided U-mix transformer for UAV-based pavement crack segmentation GLoU-MiT:用于无人机路面裂缝分割的轻型全局-局部曼巴制导U-mix变压器
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-27 DOI: 10.1016/j.aei.2025.103384
Jinhuan Shan , Yue Huang , Wei Jiang , Dongdong Yuan , Feiyang Guo
{"title":"GLoU-MiT: Lightweight Global-Local Mamba-Guided U-mix transformer for UAV-based pavement crack segmentation","authors":"Jinhuan Shan ,&nbsp;Yue Huang ,&nbsp;Wei Jiang ,&nbsp;Dongdong Yuan ,&nbsp;Feiyang Guo","doi":"10.1016/j.aei.2025.103384","DOIUrl":"10.1016/j.aei.2025.103384","url":null,"abstract":"<div><div>The utility of Unmanned Aerial Vehicles (UAVs) for routine pavement distresses inspection has been increasingly recognized due to their efficiency, flexibility, safety, and low-cost automation. However, UAV-acquired high-altitude images present unique challenges for deep learning-based semantic segmentation models, such as minute crack details, blurred boundaries, and high levels of environmental noise. We propose GLoU-MiT, a lightweight segmentation model designed to address the difficulties of UAV-based pavement crack segmentation. Our model integrates a U-shaped Mix Transformer architecture for efficient hierarchical feature extraction, a Global-Local Mamba-Guided Skip Connection for improved feature alignment and computational efficiency, and a Boundary / Semantic Deep Supervision Refinement Module to enhance segmentation precision in complex scenarios. Extensive experiments on UAV-Crack500, CrackSC and Crack500 datasets demonstrate that GLoU-MiT effectively improves segmentation accuracy, particularly in low-contrast and complex background environments, making it a robust solution for UAV-based pavement crack inspection tasks. Furthermore, inference speed and energy consumption evaluations conducted on the Jetson Orin Nano (8 GB) show that our model achieves an excellent balance between accuracy, energy efficiency, and speed. The code will be released at: <span><span>https://github.com/SHAN-JH/GLoU-MiT</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103384"},"PeriodicalIF":8.0,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143877323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ThermoNeRF: A multimodal Neural Radiance Field for joint RGB-thermal novel view synthesis of building facades ThermoNeRF:用于建筑立面联合rgb -热新视图合成的多模态神经辐射场
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-04-26 DOI: 10.1016/j.aei.2025.103345
Mariam Hassan , Florent Forest , Olga Fink , Malcolm Mielle
{"title":"ThermoNeRF: A multimodal Neural Radiance Field for joint RGB-thermal novel view synthesis of building facades","authors":"Mariam Hassan ,&nbsp;Florent Forest ,&nbsp;Olga Fink ,&nbsp;Malcolm Mielle","doi":"10.1016/j.aei.2025.103345","DOIUrl":"10.1016/j.aei.2025.103345","url":null,"abstract":"<div><div>Thermal scene reconstruction holds great potential for various applications, such as building energy analysis and non-destructive infrastructure testing. However, existing methods rely on dense scene measurements and use RGB images for 3D reconstruction, incorporating thermal data only through a post-hoc projection. Due to the lower resolution of thermal cameras and the challenges of RGB/Thermal camera calibration, this post-hoc projection often results in spatial discrepancies between temperatures projected onto the 3D model and real temperatures at the surface. We propose ThermoNeRF, a novel multimodal Neural Radiance Fields (NerF) that renders new RGB and thermal views of a scene with joint optimization of the geometry and thermal information while preventing cross-modal interference. To compensate for the lack of texture in thermal images, ThermoNeRF leverages paired RGB and thermal images to learn scene geometry while maintaining separate networks for reconstructing RGB color and temperature values, ensuring accurate and modality-specific representations. We also introduce ThermoScenes, a dataset of paired RGB+thermal images comprising 8 scenes of building facades and 8 scenes of everyday objects enabling evaluation in diverse scenarios. On ThermoScenes, ThermoNeRF achieves an average mean absolute error of 1.13 °C for buildings and 0.41 °C for other scenes when predicting temperatures of previously unobserved views. This improves accuracy by over 50% compared to concatenated RGB+thermal input in standard NeRF. While ThermoNeRF performs well on aligned RGB-thermal images, future work could address misaligned or unpaired data for better generalization. <span><span>Code</span><svg><path></path></svg></span> and <span><span>dataset</span><svg><path></path></svg></span> are available online.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103345"},"PeriodicalIF":8.0,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143874436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信