Yuqing Shang , Zhijiang Ke , Peng Lin , Qianhui Ren , Wenshan Zhang , Xiongwu Wang , Xiaotao Li , Fuyuan Gong , Shiqi Wang , Baofa Wang , Zhengkui Xu , Minglun Sun , Shunli Tan
{"title":"Empowering knowledge graphs with hybrid retrieval-augmented generation for the intelligent mix scheme of mass concrete","authors":"Yuqing Shang , Zhijiang Ke , Peng Lin , Qianhui Ren , Wenshan Zhang , Xiongwu Wang , Xiaotao Li , Fuyuan Gong , Shiqi Wang , Baofa Wang , Zhengkui Xu , Minglun Sun , Shunli Tan","doi":"10.1016/j.cscm.2025.e04979","DOIUrl":null,"url":null,"abstract":"<div><div>To achieve precise multi-objective mix design of dam concrete-targeting strength, durability, and crack resistance-this study proposes a Dam Concrete Mix Proportioning intelligent generation System (DCMPS) based on a Retrieval-Augmented Generation (RAG) framework. Firstly, a total of 1723 mix proportioning records collected from 15 hydropower stations were structurally extracted using a customized data transformation tool. A knowledge graph (KG) was then constructed on the Neo4j platform, encompassing entities such as environmental conditions, material compositions, and performance indicators. Secondly, the Moka Massive Mixed Embedding (M3E) model was employed to encode the KG into a vector database, enabling accurate retrieval of relevant mix proportioning cases via cosine similarity calculation. Finally, based on a hierarchical reasoning architecture driven by prompt engineering, DCMPS retrieves relevant cases from the vector database using a semantic search mechanism. A large language model (LLM) conducts comparative analysis on the retrieved cases to generate mix design schemes tailored to specific requirements. The DCMPS’s performance was validated using a test dataset, The results demonstrate that the system outperforms the larger-parameter native LLM in overall evaluation on the test dataset, confirming that knowledge augmentation significantly enhances the performance of smaller models. Case studies indicate that the recommended mix designs closely align with the target performance requirements, and the comparison of relevant cases along with the analysis of material influence mechanisms enhances the interpretability of the results. This DCMPS provides effective technical support for intelligent analysis and engineering decision-making in concrete mix proportioning.</div></div>","PeriodicalId":9641,"journal":{"name":"Case Studies in Construction Materials","volume":"23 ","pages":"Article e04979"},"PeriodicalIF":6.6000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Case Studies in Construction Materials","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214509525007776","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
To achieve precise multi-objective mix design of dam concrete-targeting strength, durability, and crack resistance-this study proposes a Dam Concrete Mix Proportioning intelligent generation System (DCMPS) based on a Retrieval-Augmented Generation (RAG) framework. Firstly, a total of 1723 mix proportioning records collected from 15 hydropower stations were structurally extracted using a customized data transformation tool. A knowledge graph (KG) was then constructed on the Neo4j platform, encompassing entities such as environmental conditions, material compositions, and performance indicators. Secondly, the Moka Massive Mixed Embedding (M3E) model was employed to encode the KG into a vector database, enabling accurate retrieval of relevant mix proportioning cases via cosine similarity calculation. Finally, based on a hierarchical reasoning architecture driven by prompt engineering, DCMPS retrieves relevant cases from the vector database using a semantic search mechanism. A large language model (LLM) conducts comparative analysis on the retrieved cases to generate mix design schemes tailored to specific requirements. The DCMPS’s performance was validated using a test dataset, The results demonstrate that the system outperforms the larger-parameter native LLM in overall evaluation on the test dataset, confirming that knowledge augmentation significantly enhances the performance of smaller models. Case studies indicate that the recommended mix designs closely align with the target performance requirements, and the comparison of relevant cases along with the analysis of material influence mechanisms enhances the interpretability of the results. This DCMPS provides effective technical support for intelligent analysis and engineering decision-making in concrete mix proportioning.
期刊介绍:
Case Studies in Construction Materials provides a forum for the rapid publication of short, structured Case Studies on construction materials. In addition, the journal also publishes related Short Communications, Full length research article and Comprehensive review papers (by invitation).
The journal will provide an essential compendium of case studies for practicing engineers, designers, researchers and other practitioners who are interested in all aspects construction materials. The journal will publish new and novel case studies, but will also provide a forum for the publication of high quality descriptions of classic construction material problems and solutions.