Advanced Engineering Informatics最新文献

筛选
英文 中文
IndVisSGG: VLM-based scene graph generation for industrial spatial intelligence
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-11 DOI: 10.1016/j.aei.2024.103107
Zuoxu Wang , Zhijie Yan , Shufei Li , Jihong Liu
{"title":"IndVisSGG: VLM-based scene graph generation for industrial spatial intelligence","authors":"Zuoxu Wang ,&nbsp;Zhijie Yan ,&nbsp;Shufei Li ,&nbsp;Jihong Liu","doi":"10.1016/j.aei.2024.103107","DOIUrl":"10.1016/j.aei.2024.103107","url":null,"abstract":"<div><div>Industrial spatial intelligence enables robots and machine tools to understand environmental settings and their relationships, allowing them to manipulate target components. A crucial aspect of this process is scene graph generation (SGG). Previous research on SGG primarily focuses on detection and panoptic segmentation of objects, followed by the prediction of their pairwise relationships. However, these approaches struggle with generalization and transferability when encountering new scenarios. To tackle this problem, we propose the <em>Industrial Visual Scene Graph Generation</em> (IndVisSGG) method, which parses spatial and interactive relationships between objects in temporal industrial settings. This approach leverages the capabilities of <em>Vision-Language Models</em> (VLMs) to generate scene graphs quickly and accurately without any additional object annotations. Furthermore, leveraging the IndVisSGG method, we have implemented a meticulous annotation procedure to compile a high-quality <em>industrial scene graph generation</em> (ISG) dataset, comprising 10,000 images of manufacturing and related industrial scenes. Through comparisons with various scene graph generation methods and benchmarks across two other datasets, we have showcased the superiority of the IndVisSGG method and underscored the benefits of the ISG dataset over existing datasets.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103107"},"PeriodicalIF":8.0,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143136490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative path planning of multi-unmanned surface vehicles via multi-stage constrained multi-objective optimization
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-10 DOI: 10.1016/j.aei.2025.103115
Shihong Yin , Ningjun Xu , Zhangsong Shi , Zhengrong Xiang
{"title":"Collaborative path planning of multi-unmanned surface vehicles via multi-stage constrained multi-objective optimization","authors":"Shihong Yin ,&nbsp;Ningjun Xu ,&nbsp;Zhangsong Shi ,&nbsp;Zhengrong Xiang","doi":"10.1016/j.aei.2025.103115","DOIUrl":"10.1016/j.aei.2025.103115","url":null,"abstract":"<div><div>A collaborative path planning algorithm based on a multi-stage constraint processing strategy is proposed for the task of unmanned surface vehicle (USV) cluster operation in complex water environments. The algorithm takes into account the distinct advantages of different USVs, the collaborative task time, and collision avoidance. Firstly, the objectives and constraints of the collaborative path planning problem for the USV cluster are modeled. Next, a path representation method with an adaptive number of waypoints is designed to improve the smoothness of the USV paths. Subsequently, a multi-stage constrained multi-objective optimization (MSCMO) algorithm is proposed to deal with the cooperative time and collision avoidance constraints of the USV cluster through a multi-stage strategy. Finally, eight collaborative operation scenarios for the USV cluster are designed to verify the performance of MSCMO. The simulation results demonstrate that MSCMO outperforms seven state-of-the-art constrained multi-objective algorithms, exhibiting a strong competitive advantage and superior overall performance. MSCMO enables USV clusters to perform collaborative tasks faster, safer, and smoother without violating any maneuvering constraints, while providing a variety of trade-off solutions for decision-makers. The source code is available at <span><span>https://github.com/Shihong-Yin/MSCMO-MUCP</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103115"},"PeriodicalIF":8.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143136812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human–Robot collaboration in construction: Robot design, perception and Interaction, and task allocation and execution
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-10 DOI: 10.1016/j.aei.2025.103109
Jiajing Liu , Hanbin Luo , Dongrui Wu
{"title":"Human–Robot collaboration in construction: Robot design, perception and Interaction, and task allocation and execution","authors":"Jiajing Liu ,&nbsp;Hanbin Luo ,&nbsp;Dongrui Wu","doi":"10.1016/j.aei.2025.103109","DOIUrl":"10.1016/j.aei.2025.103109","url":null,"abstract":"<div><div>Human–robot collaboration (HRC) is a vital area for enhancing safety and productivity in the construction industry. Despite its growing importance, current literature lacks comprehensive reviews on the technological and methodological advancements supporting the design and deployment of HRC systems in construction industry. This review aims to fill this gap by providing a detailed examination of recent progress in HRC technologies and methods within the construction industry, with a focus on three main areas: (1) collaborative robot design, (2) perception and interaction, and (3) task allocation and execution of HRC systems. The review highlights significant challenges in current construction research and underscores the necessity for future research to prioritize the development of: (1) exoskeleton robots designed for construction trades and non-exoskeleton collaborative robots with rigid-flexible-soft configurations; (2) multimodal perception and interactive methods; and (3) digital twin-based HRC systems. This review study not only addresses the existing gap but also identifies promising research avenues to promote the development of safe and efficient HRC systems in the construction industry.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103109"},"PeriodicalIF":8.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143137303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing understanding of asphalt mixture dynamic modulus prediction through interpretable machine learning method
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-10 DOI: 10.1016/j.aei.2025.103111
Ke Zhang , Zhaohui Min , Xiatong Hao , Theunis F.P. Henning , Wei Huang
{"title":"Enhancing understanding of asphalt mixture dynamic modulus prediction through interpretable machine learning method","authors":"Ke Zhang ,&nbsp;Zhaohui Min ,&nbsp;Xiatong Hao ,&nbsp;Theunis F.P. Henning ,&nbsp;Wei Huang","doi":"10.1016/j.aei.2025.103111","DOIUrl":"10.1016/j.aei.2025.103111","url":null,"abstract":"<div><div>Dynamic modulus is a key parameter in pavement design and pavement mechanics analysis. It is essential to accurately predict dynamic modulus and study the relationships between influencing factors and dynamic modulus. In this study, a hybrid prediction model is developed based on Extreme Gradient Boosting (XGBoost) and Whale Optimization Algorithm (WOA). Based on this model, the effects of asphalt binder properties, test condition, asphalt mixture volume parameters, and asphalt mixture gradation on dynamic modulus are analyzed. The contribution of each variable to the model predictions is quantified through Shapley Additive Explanations (SHAP), and the interaction between dynamic modulus and influencing factors is evaluated by Partial Dependence Plot (PDP). The results indicate that the WOA-XGBoost model has excellent accuracy and robustness in predicting dynamic modulus. The three most important factors affecting dynamic modulus prediction results are the complex shear modulus of binder, the test temperature and the asphalt binder viscosity. The increase in dynamic modulus can be achieved through the utilization of asphalt binders characterized by relatively large complex modulus, high viscosity, small phase angle, and high asphalt PG indexes. Reducing the effective binder volume and air voids of the mixture, optimizing the mixture gradation to a suitable level, and increasing the mineral powder content can also lead to the increase of dynamic modulus. Besides, low test temperature and high frequency generally mean a large value of dynamic modulus. This study clarifies the impact of influencing factors on the performance of asphalt mixtures based on machine learning, which lay a foundation for the intelligent design of asphalt mixtures.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103111"},"PeriodicalIF":8.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143136813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimising predictive accuracy in sheet metal stamping with advanced machine learning: A LightGBM and neural network ensemble approach
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-09 DOI: 10.1016/j.aei.2024.103103
Ema Stefanovska, Tomaž Pepelnjak
{"title":"Optimising predictive accuracy in sheet metal stamping with advanced machine learning: A LightGBM and neural network ensemble approach","authors":"Ema Stefanovska,&nbsp;Tomaž Pepelnjak","doi":"10.1016/j.aei.2024.103103","DOIUrl":"10.1016/j.aei.2024.103103","url":null,"abstract":"<div><div>This article presents an innovative ensemble model that integrates advanced machine learning techniques to enhance the precision of sheet metal stamping processes. By combining a light gradient boosting machine (LightGBM) with deep neural networks (DNNs), the model achieves high accuracy in predicting the final geometry of stamped sheet metal parts, and proactively identifies potential deviations to guarantee strict compliance to geometrical tolerances. In a comprehensive evaluation based on diverse performance metrics, the ensemble model demonstrates substantial improvements over the individual models, achieving a high coefficient of determination <em>R</em><sup>2</sup> of 0.951. Significantly, an extensive dataset derived from finite element method simulations is found to facilitate the training of our models in a variety of stamping scenarios, giving superior generalisability and reliability in terms of predictions. In addition, the integration of the ensemble model into an interactive web platform for real-time predictive analytics underscores its practical application in manufacturing settings, as it can optimise decision-making and operational efficiency. The predictive power of the ensemble model and its integration into a real-time framework provide a solid foundation for further advancements in developing a digital twin of the sheet metal stamping process. Our findings highlight the transformative potential of combining diverse machine learning techniques to revolutionise manufacturing processes, thus ensuring higher quality, adaptability, and cost efficiency.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103103"},"PeriodicalIF":8.0,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143137302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital twin-based smart shop-floor management and control: A review
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-09 DOI: 10.1016/j.aei.2024.103102
Cunbo Zhuang , Lei Zhang , Shimin Liu , Jiewu Leng , Jianhua Liu , Fengque Pei
{"title":"Digital twin-based smart shop-floor management and control: A review","authors":"Cunbo Zhuang ,&nbsp;Lei Zhang ,&nbsp;Shimin Liu ,&nbsp;Jiewu Leng ,&nbsp;Jianhua Liu ,&nbsp;Fengque Pei","doi":"10.1016/j.aei.2024.103102","DOIUrl":"10.1016/j.aei.2024.103102","url":null,"abstract":"<div><div>Propelled by the latest advancements in information technology, shop-floor management and control (SMC) is transitioning towards a more intelligent paradigm, predominantly marked by data-driven insights and the integration of virtual reality. The digital twin (DT) stands out as a pivotal technology for the realization of cyber-physical systems, and its role in smart shop-floor management and control (SSMC) has attracted significant interest from both the industrial sector and academic circles. However, the application of DT in achieving SSMC remains diverse and lacks a structured methodology. In light of this, this review provides an in-depth analysis and discussion of the current state, limitations, and prospective trends of DT in SSMC. Initially, a DT-based SSMC framework is introduced to guide the subsequent literature review and thematic discussions. This is followed by an examination of DT-based SSMC research across four key dimensions: the development of shop-floor DT models, dynamic monitoring and forecasting of the shop-floor leveraging DT, DT-assisted shop-floor scheduling, and DT-driven production process control. The review culminates with an outline of challenges and future research directions for DT-based SSMC. This comprehensive review not only enhances researchers’ comprehension of SSMC but also offers a valuable reference for the continued application and integration of DT within this domain.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103102"},"PeriodicalIF":8.0,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143136972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements in AI-Driven detection and localisation of solar panel defects
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-09 DOI: 10.1016/j.aei.2024.103104
Ali Ghahremani, Scott D. Adams, Michael Norton, Sui Yang Khoo, Abbas Z. Kouzani
{"title":"Advancements in AI-Driven detection and localisation of solar panel defects","authors":"Ali Ghahremani,&nbsp;Scott D. Adams,&nbsp;Michael Norton,&nbsp;Sui Yang Khoo,&nbsp;Abbas Z. Kouzani","doi":"10.1016/j.aei.2024.103104","DOIUrl":"10.1016/j.aei.2024.103104","url":null,"abstract":"<div><div>Renewable energy production has experienced rapid growth over the past three decades and is projected to triple its global capacity by 2030. Given that the utilisation of solar photovoltaic (PV) technology plays a vital role in generating renewable electricity, it is crucial to continuously monitor the condition of solar panels because a variety of defects can significantly reduce their power production. In this paper, we review the latest artificial intelligence (AI) algorithms developed for inspecting solar panels. We also discuss various low-resource hardware systems used to execute these algorithms. AI algorithms are trained using datasets and images, including optical, infrared, and electroluminescence images of solar panels. These images can be captured by unmanned aerial vehicles (UAVs), ground vehicles, and fixed cameras. In this paper, we compare the precision, accuracy, and recall rates of a selection of reviewed AI algorithms. To gain a deeper understanding of these AI algorithms, we introduce a generic framework of AI-driven systems that can autonomously detect and localise solar panel defects and we analyse the literature based on this framework. Some of the main AI and image processing algorithms reviewed are YOLO V5 BDL, weight imprinting, custom-designed CNN, modified edge detection, fuzzy-based edge detection, and the modified Canny algorithm. We also discuss the main hardware systems used to execute image processing algorithms to localise and detect defects in solar panels: the central processing unit (CPU), field programmable gate array (FPGA), and graphics processing unit (GPU). Finally, as a future direction, we suggest developing image processing algorithms specifically designed for hardware systems tailored for machine learning, such as tensor processing units (TPUs). This development would further enhance the capabilities of solar panel inspection and defect detection.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"64 ","pages":"Article 103104"},"PeriodicalIF":8.0,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143130278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification and precise optimization of key assembly error links for complex aviation components driven by mechanism and data fusion model
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-07 DOI: 10.1016/j.aei.2024.103059
Feiyan Guo , Zhang Yongliang , Song Changjie , Sha Xiliang
{"title":"Identification and precise optimization of key assembly error links for complex aviation components driven by mechanism and data fusion model","authors":"Feiyan Guo ,&nbsp;Zhang Yongliang ,&nbsp;Song Changjie ,&nbsp;Sha Xiliang","doi":"10.1016/j.aei.2024.103059","DOIUrl":"10.1016/j.aei.2024.103059","url":null,"abstract":"<div><div>As assembling complex aviation products, due to factors such as part deformation under loads, numerous process parameters, and complex error transmission path, the effective identification and optimization of key error links that affecting assembly accuracy significantly is challenging. In this paper, a mechanism and data fusion method for solving this problem was proposed. Firstly, the geometric-physical coupling relationship among composite thin-walled parts and the entire locating/clamping/joining/rebounding operations was analyzed. Then with the actual error information, the Jacobian-torsor matrix that representing error accumulation relationship was modified, and assembly error was calculated with the mechanism model. Secondly, with actual data processing solution to obtain the deviation of theoretical calculation results, the fusion model of integrating mechanism and data analysis results was proposed for predicting the final assembly accuracy. Subsequently, with massive data samples from the fusion model, the Sobol method was adopted to gain the global sensitivity coefficients of different error elements, and the key error links could be identified. Thirdly, with the accurate error fusion results, three single tolerance optimization models for the entire production process were established, i.e. manufacturing cost, assembly quality loss and repair cost. Then a weight parameters design method was proposed, which can avoid the conflict phenomena of data imbalance and optimization deviation problems among different goals, and the multi-objective tolerance allocation model was solved with intelligent algorithm. Finally, for the assembly work of wing-box component, key error links that having an obvious impact on the profile gap and step difference accuracy were identified and optimized, and beneficial quality/efficiency results were gained. This research could provide a strong interpretability for assembly accuracy analysis results, and a good applicability to practical assembly site.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"64 ","pages":"Article 103059"},"PeriodicalIF":8.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143129625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust operating performance assessment of flotation processes using convolutional neural networks and feature learning
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-07 DOI: 10.1016/j.aei.2024.103087
Runda Jia , Mingxuan Ren , Jinglong Wang , Feng Yu , Dakuo He
{"title":"Robust operating performance assessment of flotation processes using convolutional neural networks and feature learning","authors":"Runda Jia ,&nbsp;Mingxuan Ren ,&nbsp;Jinglong Wang ,&nbsp;Feng Yu ,&nbsp;Dakuo He","doi":"10.1016/j.aei.2024.103087","DOIUrl":"10.1016/j.aei.2024.103087","url":null,"abstract":"<div><div>The use of computer vision, rather than manual observation, to assess flotation performance based on froth characteristics is crucial for optimizing and controlling the flotation process. Convolutional neural networks (CNNs) are widely employed for image recognition tasks related to evaluating flotation operating performance. However, previous studies have often overlooked the quality of feature learning within these networks, resulting in limited robustness, especially when industrial applications encounter image distortions that challenge network performance.</div><div>To address this issue, this paper proposes a CNN-based algorithm for robust assessment of flotation operating performance, focusing on learning features that accurately reflect froth characteristics. The network is guided through regression training to prioritize froth-specific features, while classification training enhances its ability to evaluate flotation performance. Iterative optimization is achieved by adjusting the regression training loss using feedback from classification results and expert knowledge, thereby refining the network’s performance.</div><div>Experimental results from industrial applications validate the effectiveness of the proposed algorithm, demonstrating its ability to learn key features of froth images and showing high robustness under various types and levels of image distortion.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"64 ","pages":"Article 103087"},"PeriodicalIF":8.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143129698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LGGFormer: A dual-branch local-guided global self-attention network for surface defect segmentation
IF 8 1区 工程技术
Advanced Engineering Informatics Pub Date : 2025-01-07 DOI: 10.1016/j.aei.2024.103099
Gaowei Zhang , Yang Lu , Xiaoheng Jiang , Shaohui Jin , Shupan Li , Mingliang Xu
{"title":"LGGFormer: A dual-branch local-guided global self-attention network for surface defect segmentation","authors":"Gaowei Zhang ,&nbsp;Yang Lu ,&nbsp;Xiaoheng Jiang ,&nbsp;Shaohui Jin ,&nbsp;Shupan Li ,&nbsp;Mingliang Xu","doi":"10.1016/j.aei.2024.103099","DOIUrl":"10.1016/j.aei.2024.103099","url":null,"abstract":"<div><div>In industrial manufacturing, efficient and accurate surface defect detection is paramount. Recently, CNN-based defect segmentation networks have achieved significant success but have limitations in capturing global contextual information. Although Transformer models excel in global modeling, they often lack sufficient attention to local details. To combine the advantages of CNN and Transformer, this paper proposes a dual-branch local-guided global self-attention network (LGGFormer) for Surface Defect Segmentation. Considering the unique characteristics and computational differences between CNN and Transformer, we propose Local-Guided Global Attention Self-Attention (LGGSA) for extracting global and local information. LGGSA computes localized attention through a sliding window to capture rich contextual details. These local features are then aggregated for global attention computation, enabling the model to focus on areas signified as important by local information. To address the problems of tiny defects and low background contrast, we enhance the learning process by adding supervision to the CNN branch, forcing the branch to learn detailed boundary information. In addition, to take full advantage of the different modeling potentials of CNN and Transformer, we designed the Cross-Branch Feature Interaction Module (CBFI), which achieves a deep interaction between the two features through correlation-weighted integration to optimize feature extraction and representation. Finally, the edge-guided decoder (EGD) utilizes the boundary information extracted by the CNN to guide feature fusion to compensate for the loss of detail information. Experimental results on three public defect datasets demonstrate that our method exhibits promising performance.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"64 ","pages":"Article 103099"},"PeriodicalIF":8.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143129636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信