Qinjun Qiu , Xiangguo Jin , Miao Tian , Qirui Wu , Liufeng Tao , Jianguo Chen , Zhong Xie
{"title":"Joint extraction method for entity relations from mineral resources reports integrating dependency parsing and improved graph convolutional networks","authors":"Qinjun Qiu , Xiangguo Jin , Miao Tian , Qirui Wu , Liufeng Tao , Jianguo Chen , Zhong Xie","doi":"10.1016/j.oregeorev.2025.106640","DOIUrl":null,"url":null,"abstract":"<div><div>Geological reports, as crucial technical documents reflecting the outcomes of geological survey work, encapsulate extensive expert and domain knowledge. Geological knowledge graphs integrate vast amounts of data, facilitating efficient and rapid extraction of knowledge embedded within geoscientific data. The extraction of geological entity relations is a key method in creating these knowledge graphs. Existing techniques for extracting geological entities and their relations encounter difficulties such entity overlap, relation overlap, and the challenge of obtaining deep semantic information because of the vastness and complexity of geological data. Our study suggests a collaborative extraction model for entity relations that integrates dependency syntactic relations with a graph convolutional network (GCN) in order to address these problems. This model learns dependency syntactic structures and deep semantic information by building a GCN that includes dependency syntactic relations. A pointer network decoder is then added to increase entity relation extraction efficiency. Dependencies between words in a phrase, such as subject-verb and verb-object relations, are revealed via dependency syntactic analysis. By structuring these dependencies into a graph, the model captures syntactic structural information. Through operations involving adjacency matrices and feature matrices, the model effectively propagates and aggregates node information, thereby capturing the global dependency syntactic structure and deep semantic information of sentences. The integration of dependency syntactic relations with GCN processing enables the model to more accurately comprehend entity relations within sentences. Results from experiments show that this model successfully tackles problems like overlapping entity relations and the challenge of gleaning deep semantic information from geological texts. It achieves a 79.73% accuracy rate and a 77.98% F1 score on geological text datasets.</div></div>","PeriodicalId":19644,"journal":{"name":"Ore Geology Reviews","volume":"182 ","pages":"Article 106640"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ore Geology Reviews","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169136825002008","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Geological reports, as crucial technical documents reflecting the outcomes of geological survey work, encapsulate extensive expert and domain knowledge. Geological knowledge graphs integrate vast amounts of data, facilitating efficient and rapid extraction of knowledge embedded within geoscientific data. The extraction of geological entity relations is a key method in creating these knowledge graphs. Existing techniques for extracting geological entities and their relations encounter difficulties such entity overlap, relation overlap, and the challenge of obtaining deep semantic information because of the vastness and complexity of geological data. Our study suggests a collaborative extraction model for entity relations that integrates dependency syntactic relations with a graph convolutional network (GCN) in order to address these problems. This model learns dependency syntactic structures and deep semantic information by building a GCN that includes dependency syntactic relations. A pointer network decoder is then added to increase entity relation extraction efficiency. Dependencies between words in a phrase, such as subject-verb and verb-object relations, are revealed via dependency syntactic analysis. By structuring these dependencies into a graph, the model captures syntactic structural information. Through operations involving adjacency matrices and feature matrices, the model effectively propagates and aggregates node information, thereby capturing the global dependency syntactic structure and deep semantic information of sentences. The integration of dependency syntactic relations with GCN processing enables the model to more accurately comprehend entity relations within sentences. Results from experiments show that this model successfully tackles problems like overlapping entity relations and the challenge of gleaning deep semantic information from geological texts. It achieves a 79.73% accuracy rate and a 77.98% F1 score on geological text datasets.
期刊介绍:
Ore Geology Reviews aims to familiarize all earth scientists with recent advances in a number of interconnected disciplines related to the study of, and search for, ore deposits. The reviews range from brief to longer contributions, but the journal preferentially publishes manuscripts that fill the niche between the commonly shorter journal articles and the comprehensive book coverages, and thus has a special appeal to many authors and readers.