{"title":"基于llm数据增强和空间金字塔池的两阶段知识图谱补全","authors":"Na Zhou, Yuan Yuan, Lei Chen","doi":"10.1007/s10489-025-06556-5","DOIUrl":null,"url":null,"abstract":"<div><p>With the development of information technology, a large amount of unstructured and fragmented data is generated. Knowledge graphs can effectively integrate these fragmented data. Due to the difficulty of domain knowledge mining, knowledge graphs have problems of data sparseness and data missing. In addition, standard convolutional neural networks have limited capability in capturing feature interactions. To address data sparsity and the limitations of standard convolutional models, we propose DA-ARKGC, a two-stage knowledge graph completion model using wheat as a case study. In the first stage, to address the data sparsity problem, the rule mining data augmentation module (DA) based on large language models expands the wheat knowledge graph. In the second stage, the knowledge completion module (ARKGC) of the atrous spatial pyramid pooling with residual is introduced to achieve knowledge completion. The DA-ARKGC model was verified on the constructed wheat knowledge graph (Wheat_KG). Compared with ConvE, its MRR, Hits@1, Hits@3 and Hits@10 increased by 10% and 10.2%, 10.1% and 9.3%, respectively. In order to verify the effectiveness and generalization of the ARKGC module, experiments were conducted on the open-source datasets WN18 and FB15k. The results demonstrated that the model achieved optimal or sub-optimal performance compared with other baseline models.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 7","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A two-stage knowledge graph completion based on LLMs’ data augmentation and atrous spatial pyramid pooling\",\"authors\":\"Na Zhou, Yuan Yuan, Lei Chen\",\"doi\":\"10.1007/s10489-025-06556-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>With the development of information technology, a large amount of unstructured and fragmented data is generated. Knowledge graphs can effectively integrate these fragmented data. Due to the difficulty of domain knowledge mining, knowledge graphs have problems of data sparseness and data missing. In addition, standard convolutional neural networks have limited capability in capturing feature interactions. To address data sparsity and the limitations of standard convolutional models, we propose DA-ARKGC, a two-stage knowledge graph completion model using wheat as a case study. In the first stage, to address the data sparsity problem, the rule mining data augmentation module (DA) based on large language models expands the wheat knowledge graph. In the second stage, the knowledge completion module (ARKGC) of the atrous spatial pyramid pooling with residual is introduced to achieve knowledge completion. The DA-ARKGC model was verified on the constructed wheat knowledge graph (Wheat_KG). Compared with ConvE, its MRR, Hits@1, Hits@3 and Hits@10 increased by 10% and 10.2%, 10.1% and 9.3%, respectively. In order to verify the effectiveness and generalization of the ARKGC module, experiments were conducted on the open-source datasets WN18 and FB15k. The results demonstrated that the model achieved optimal or sub-optimal performance compared with other baseline models.</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 7\",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-025-06556-5\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06556-5","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A two-stage knowledge graph completion based on LLMs’ data augmentation and atrous spatial pyramid pooling
With the development of information technology, a large amount of unstructured and fragmented data is generated. Knowledge graphs can effectively integrate these fragmented data. Due to the difficulty of domain knowledge mining, knowledge graphs have problems of data sparseness and data missing. In addition, standard convolutional neural networks have limited capability in capturing feature interactions. To address data sparsity and the limitations of standard convolutional models, we propose DA-ARKGC, a two-stage knowledge graph completion model using wheat as a case study. In the first stage, to address the data sparsity problem, the rule mining data augmentation module (DA) based on large language models expands the wheat knowledge graph. In the second stage, the knowledge completion module (ARKGC) of the atrous spatial pyramid pooling with residual is introduced to achieve knowledge completion. The DA-ARKGC model was verified on the constructed wheat knowledge graph (Wheat_KG). Compared with ConvE, its MRR, Hits@1, Hits@3 and Hits@10 increased by 10% and 10.2%, 10.1% and 9.3%, respectively. In order to verify the effectiveness and generalization of the ARKGC module, experiments were conducted on the open-source datasets WN18 and FB15k. The results demonstrated that the model achieved optimal or sub-optimal performance compared with other baseline models.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.