{"title":"基于自然语言处理的语义感知迁移学习框架在有限数据下预测复合材料棒的跨材料结合强度","authors":"Pei-Fu Zhang , Daxu Zhang , Qi Zhao , Xuan Zhao , Yiliyaer Tuerxunmaimaiti","doi":"10.1016/j.engappai.2025.112011","DOIUrl":null,"url":null,"abstract":"<div><div>This study develops a novel transfer learning (TL) framework integrating natural language processing (NLP) and deep learning (DL) to predict the bond strength of fiber-reinforced polymer (FRP) composite bars in ultra-high-performance seawater sea-sand concrete (UHPSSC) using limited experimental data. The framework incorporates an NLP data processor to handle textual variables, a gated transformer to enhance parameter representations, and DL models for regression. Through leveraging the characteristics of seawater sea-sand concrete (SWSSC) and ultra-high-performance concrete (UHPC), it enables effective TL from large FRP–SWSSC/UHPC datasets to smaller FRP–UHPSSC datasets. Three pre-trained models were developed using FRP–SWSSC, FRP–UHPC, and combined datasets, respectively, and then fine-tuned on FRP–UHPSSC data to create three TL models. Compared with traditional machine learning (ML) models such as XGBoost and random forest (RF) trained directly on FRP–UHPSSC data, the proposed TL models achieved significantly higher predictive accuracy. Specifically, the optimal-performing TL model fine-tuned using combined FRP–SWSSC/UHPC data, achieved an R<sup>2</sup> of 0.914 and root mean square error (RMSE) of 0.75 on the test set, corresponding to a 13 % increase in R<sup>2</sup> and a 47.6 % reduction in RMSE relative to the optimal ML model. Furthermore, NLP-based textual representation improved the TL model's test R<sup>2</sup> from 0.876 to 0.914, highlighting the effectiveness in handling tabular data. The TL model also demonstrates semantic recognition capabilities, addressing variations in parameter expressions and dimensionality, enhancing robustness across scenarios. Overall, the proposed NLP-based TL framework effectively integrates heterogeneous material knowledge and substantially enhances predictive performance and generalization, offering a promising approach for data-limited engineering applications.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"160 ","pages":"Article 112011"},"PeriodicalIF":8.0000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A semantic-aware transfer learning framework via natural language processing to predict the cross-material bond strength of composite bars with limited data\",\"authors\":\"Pei-Fu Zhang , Daxu Zhang , Qi Zhao , Xuan Zhao , Yiliyaer Tuerxunmaimaiti\",\"doi\":\"10.1016/j.engappai.2025.112011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study develops a novel transfer learning (TL) framework integrating natural language processing (NLP) and deep learning (DL) to predict the bond strength of fiber-reinforced polymer (FRP) composite bars in ultra-high-performance seawater sea-sand concrete (UHPSSC) using limited experimental data. The framework incorporates an NLP data processor to handle textual variables, a gated transformer to enhance parameter representations, and DL models for regression. Through leveraging the characteristics of seawater sea-sand concrete (SWSSC) and ultra-high-performance concrete (UHPC), it enables effective TL from large FRP–SWSSC/UHPC datasets to smaller FRP–UHPSSC datasets. Three pre-trained models were developed using FRP–SWSSC, FRP–UHPC, and combined datasets, respectively, and then fine-tuned on FRP–UHPSSC data to create three TL models. Compared with traditional machine learning (ML) models such as XGBoost and random forest (RF) trained directly on FRP–UHPSSC data, the proposed TL models achieved significantly higher predictive accuracy. Specifically, the optimal-performing TL model fine-tuned using combined FRP–SWSSC/UHPC data, achieved an R<sup>2</sup> of 0.914 and root mean square error (RMSE) of 0.75 on the test set, corresponding to a 13 % increase in R<sup>2</sup> and a 47.6 % reduction in RMSE relative to the optimal ML model. Furthermore, NLP-based textual representation improved the TL model's test R<sup>2</sup> from 0.876 to 0.914, highlighting the effectiveness in handling tabular data. The TL model also demonstrates semantic recognition capabilities, addressing variations in parameter expressions and dimensionality, enhancing robustness across scenarios. Overall, the proposed NLP-based TL framework effectively integrates heterogeneous material knowledge and substantially enhances predictive performance and generalization, offering a promising approach for data-limited engineering applications.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"160 \",\"pages\":\"Article 112011\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625020196\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625020196","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
A semantic-aware transfer learning framework via natural language processing to predict the cross-material bond strength of composite bars with limited data
This study develops a novel transfer learning (TL) framework integrating natural language processing (NLP) and deep learning (DL) to predict the bond strength of fiber-reinforced polymer (FRP) composite bars in ultra-high-performance seawater sea-sand concrete (UHPSSC) using limited experimental data. The framework incorporates an NLP data processor to handle textual variables, a gated transformer to enhance parameter representations, and DL models for regression. Through leveraging the characteristics of seawater sea-sand concrete (SWSSC) and ultra-high-performance concrete (UHPC), it enables effective TL from large FRP–SWSSC/UHPC datasets to smaller FRP–UHPSSC datasets. Three pre-trained models were developed using FRP–SWSSC, FRP–UHPC, and combined datasets, respectively, and then fine-tuned on FRP–UHPSSC data to create three TL models. Compared with traditional machine learning (ML) models such as XGBoost and random forest (RF) trained directly on FRP–UHPSSC data, the proposed TL models achieved significantly higher predictive accuracy. Specifically, the optimal-performing TL model fine-tuned using combined FRP–SWSSC/UHPC data, achieved an R2 of 0.914 and root mean square error (RMSE) of 0.75 on the test set, corresponding to a 13 % increase in R2 and a 47.6 % reduction in RMSE relative to the optimal ML model. Furthermore, NLP-based textual representation improved the TL model's test R2 from 0.876 to 0.914, highlighting the effectiveness in handling tabular data. The TL model also demonstrates semantic recognition capabilities, addressing variations in parameter expressions and dimensionality, enhancing robustness across scenarios. Overall, the proposed NLP-based TL framework effectively integrates heterogeneous material knowledge and substantially enhances predictive performance and generalization, offering a promising approach for data-limited engineering applications.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.