Joint extraction method for entity relations from mineral resources reports integrating dependency parsing and improved graph convolutional networks

IF 3.2 2区 地球科学 Q1 GEOLOGY
Qinjun Qiu , Xiangguo Jin , Miao Tian , Qirui Wu , Liufeng Tao , Jianguo Chen , Zhong Xie
{"title":"Joint extraction method for entity relations from mineral resources reports integrating dependency parsing and improved graph convolutional networks","authors":"Qinjun Qiu ,&nbsp;Xiangguo Jin ,&nbsp;Miao Tian ,&nbsp;Qirui Wu ,&nbsp;Liufeng Tao ,&nbsp;Jianguo Chen ,&nbsp;Zhong Xie","doi":"10.1016/j.oregeorev.2025.106640","DOIUrl":null,"url":null,"abstract":"<div><div>Geological reports, as crucial technical documents reflecting the outcomes of geological survey work, encapsulate extensive expert and domain knowledge. Geological knowledge graphs integrate vast amounts of data, facilitating efficient and rapid extraction of knowledge embedded within geoscientific data. The extraction of geological entity relations is a key method in creating these knowledge graphs. Existing techniques for extracting geological entities and their relations encounter difficulties such entity overlap, relation overlap, and the challenge of obtaining deep semantic information because of the vastness and complexity of geological data. Our study suggests a collaborative extraction model for entity relations that integrates dependency syntactic relations with a graph convolutional network (GCN) in order to address these problems. This model learns dependency syntactic structures and deep semantic information by building a GCN that includes dependency syntactic relations. A pointer network decoder is then added to increase entity relation extraction efficiency. Dependencies between words in a phrase, such as subject-verb and verb-object relations, are revealed via dependency syntactic analysis. By structuring these dependencies into a graph, the model captures syntactic structural information. Through operations involving adjacency matrices and feature matrices, the model effectively propagates and aggregates node information, thereby capturing the global dependency syntactic structure and deep semantic information of sentences. The integration of dependency syntactic relations with GCN processing enables the model to more accurately comprehend entity relations within sentences. Results from experiments show that this model successfully tackles problems like overlapping entity relations and the challenge of gleaning deep semantic information from geological texts. It achieves a 79.73% accuracy rate and a 77.98% F1 score on geological text datasets.</div></div>","PeriodicalId":19644,"journal":{"name":"Ore Geology Reviews","volume":"182 ","pages":"Article 106640"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ore Geology Reviews","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169136825002008","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Geological reports, as crucial technical documents reflecting the outcomes of geological survey work, encapsulate extensive expert and domain knowledge. Geological knowledge graphs integrate vast amounts of data, facilitating efficient and rapid extraction of knowledge embedded within geoscientific data. The extraction of geological entity relations is a key method in creating these knowledge graphs. Existing techniques for extracting geological entities and their relations encounter difficulties such entity overlap, relation overlap, and the challenge of obtaining deep semantic information because of the vastness and complexity of geological data. Our study suggests a collaborative extraction model for entity relations that integrates dependency syntactic relations with a graph convolutional network (GCN) in order to address these problems. This model learns dependency syntactic structures and deep semantic information by building a GCN that includes dependency syntactic relations. A pointer network decoder is then added to increase entity relation extraction efficiency. Dependencies between words in a phrase, such as subject-verb and verb-object relations, are revealed via dependency syntactic analysis. By structuring these dependencies into a graph, the model captures syntactic structural information. Through operations involving adjacency matrices and feature matrices, the model effectively propagates and aggregates node information, thereby capturing the global dependency syntactic structure and deep semantic information of sentences. The integration of dependency syntactic relations with GCN processing enables the model to more accurately comprehend entity relations within sentences. Results from experiments show that this model successfully tackles problems like overlapping entity relations and the challenge of gleaning deep semantic information from geological texts. It achieves a 79.73% accuracy rate and a 77.98% F1 score on geological text datasets.

Abstract Image

结合依赖关系分析和改进图卷积网络的矿产资源报告实体关系联合提取方法
地质报告是反映地质调查工作成果的重要技术文件,包含了广泛的专家知识和领域知识。地质知识图集成了大量的数据,便于高效、快速地提取嵌入在地球科学数据中的知识。地质实体关系的提取是构建这些知识图谱的关键方法。由于地质数据的庞大和复杂,现有的地质实体及其关系提取技术存在实体重叠、关系重叠以及获取深层语义信息等困难。我们的研究提出了一种实体关系的协作提取模型,该模型将依赖句法关系与图卷积网络(GCN)集成在一起,以解决这些问题。该模型通过构建包含依赖句法关系的GCN来学习依赖句法结构和深层语义信息。然后增加一个指针网络解码器,以提高实体关系提取效率。短语中词之间的依存关系,如主谓关系、动宾关系等,是通过依存句法分析来揭示的。通过将这些依赖关系构建成一个图,模型捕获语法结构信息。该模型通过涉及邻接矩阵和特征矩阵的运算,有效地传播和聚合节点信息,从而捕获句子的全局依赖句法结构和深层语义信息。依赖句法关系与GCN处理的集成使模型能够更准确地理解句子内的实体关系。实验结果表明,该模型成功地解决了实体关系重叠和地质文本深层语义信息收集等问题。该方法在地质文本数据集上的准确率为79.73%,F1分数为77.98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ore Geology Reviews
Ore Geology Reviews 地学-地质学
CiteScore
6.50
自引率
27.30%
发文量
546
审稿时长
22.9 weeks
期刊介绍: Ore Geology Reviews aims to familiarize all earth scientists with recent advances in a number of interconnected disciplines related to the study of, and search for, ore deposits. The reviews range from brief to longer contributions, but the journal preferentially publishes manuscripts that fill the niche between the commonly shorter journal articles and the comprehensive book coverages, and thus has a special appeal to many authors and readers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信