{"title":"基于改进机器阅读理解的中文实体关系抽取方法","authors":"Tianci Shang, Baosong Deng, Tingsong Jiang","doi":"10.1109/iip57348.2022.00043","DOIUrl":null,"url":null,"abstract":"As the downstream task of building a knowledge graph, Chinese entity relationship extraction from unstructured texts plays an important role in the field of natural language processing. There are two main ways for Chinese entity relation extraction: joint extraction method and pipeline extraction method. The joint extraction method outputs the relation triples contained in the texts directly in a row, which causes two problems: the lack of external knowledge and the nesting of entities. This article proposes a method to take the advantage of the similarity between the span extraction task and the information extraction task, and transforms entity relation extraction problem into a task similar to machine reading comprehension. This method first uses the Roberta pre-training model to obtain word representation with relation information, and then identifies entity pairs that may exists under per relation through a global pointer network, which outperforms better than the normal pointer network. By comparing different models’ performance on the same dataset, the results show the accuracy, recall and F1 scores of our method are higher than other methods, which proves the effectiveness of our method.","PeriodicalId":412907,"journal":{"name":"2022 4th International Conference on Intelligent Information Processing (IIP)","volume":"451 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Chinese Entity-Relation Extraction Method Via Improved Machine Reading Comprehension\",\"authors\":\"Tianci Shang, Baosong Deng, Tingsong Jiang\",\"doi\":\"10.1109/iip57348.2022.00043\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the downstream task of building a knowledge graph, Chinese entity relationship extraction from unstructured texts plays an important role in the field of natural language processing. There are two main ways for Chinese entity relation extraction: joint extraction method and pipeline extraction method. The joint extraction method outputs the relation triples contained in the texts directly in a row, which causes two problems: the lack of external knowledge and the nesting of entities. This article proposes a method to take the advantage of the similarity between the span extraction task and the information extraction task, and transforms entity relation extraction problem into a task similar to machine reading comprehension. This method first uses the Roberta pre-training model to obtain word representation with relation information, and then identifies entity pairs that may exists under per relation through a global pointer network, which outperforms better than the normal pointer network. By comparing different models’ performance on the same dataset, the results show the accuracy, recall and F1 scores of our method are higher than other methods, which proves the effectiveness of our method.\",\"PeriodicalId\":412907,\"journal\":{\"name\":\"2022 4th International Conference on Intelligent Information Processing (IIP)\",\"volume\":\"451 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 4th International Conference on Intelligent Information Processing (IIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iip57348.2022.00043\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Intelligent Information Processing (IIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iip57348.2022.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Chinese Entity-Relation Extraction Method Via Improved Machine Reading Comprehension
As the downstream task of building a knowledge graph, Chinese entity relationship extraction from unstructured texts plays an important role in the field of natural language processing. There are two main ways for Chinese entity relation extraction: joint extraction method and pipeline extraction method. The joint extraction method outputs the relation triples contained in the texts directly in a row, which causes two problems: the lack of external knowledge and the nesting of entities. This article proposes a method to take the advantage of the similarity between the span extraction task and the information extraction task, and transforms entity relation extraction problem into a task similar to machine reading comprehension. This method first uses the Roberta pre-training model to obtain word representation with relation information, and then identifies entity pairs that may exists under per relation through a global pointer network, which outperforms better than the normal pointer network. By comparing different models’ performance on the same dataset, the results show the accuracy, recall and F1 scores of our method are higher than other methods, which proves the effectiveness of our method.