基于神经网络方法的全文文档语义模式识别

GraphiCon'2019 Proceedings. Volume 2 Pub Date : 2019-11-11 DOI:10.30987/graphicon-2019-2-276-279

Олег Золотарев, O. Zolotarev, Ярослав Соломенцев, Yaroslav K. Solomentsev, Аида Хакимова, Aida Khakimova, Михаил Шарнин, M. Charnine

{"title":"基于神经网络方法的全文文档语义模式识别","authors":"Олег Золотарев, O. Zolotarev, Ярослав Соломенцев, Yaroslav K. Solomentsev, Аида Хакимова, Aida Khakimova, Михаил Шарнин, M. Charnine","doi":"10.30987/graphicon-2019-2-276-279","DOIUrl":null,"url":null,"abstract":"Processing and text mining are becoming increasingly possible thanks to the development of computer technology, as well as the development of artificial intelligence (machine learning). This article describes approaches to the analysis of texts in natural language using methods of morphological, syntactic and semantic analysis. Morphological and syntactic analysis of the text is carried out using the Pullenti system, which allows not only to normalize words, but also to distinguish named entities, their characteristics, and relationships between them. As a result, a semantic network of related named entities is built, such as people, positions, geographical names, business associations, documents, education, dates, etc. The word2vec technology is used to identify semantic patterns in the text based on the joint occurrence of terms. The possibility of joint use of the described technologies is being considered.","PeriodicalId":409819,"journal":{"name":"GraphiCon'2019 Proceedings. Volume 2","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Identification of Semantic Patterns in Full-text Documents Using Neural Network Methods\",\"authors\":\"Олег Золотарев, O. Zolotarev, Ярослав Соломенцев, Yaroslav K. Solomentsev, Аида Хакимова, Aida Khakimova, Михаил Шарнин, M. Charnine\",\"doi\":\"10.30987/graphicon-2019-2-276-279\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Processing and text mining are becoming increasingly possible thanks to the development of computer technology, as well as the development of artificial intelligence (machine learning). This article describes approaches to the analysis of texts in natural language using methods of morphological, syntactic and semantic analysis. Morphological and syntactic analysis of the text is carried out using the Pullenti system, which allows not only to normalize words, but also to distinguish named entities, their characteristics, and relationships between them. As a result, a semantic network of related named entities is built, such as people, positions, geographical names, business associations, documents, education, dates, etc. The word2vec technology is used to identify semantic patterns in the text based on the joint occurrence of terms. The possibility of joint use of the described technologies is being considered.\",\"PeriodicalId\":409819,\"journal\":{\"name\":\"GraphiCon'2019 Proceedings. Volume 2\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GraphiCon'2019 Proceedings. Volume 2\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30987/graphicon-2019-2-276-279\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GraphiCon'2019 Proceedings. Volume 2","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30987/graphicon-2019-2-276-279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

由于计算机技术的发展，以及人工智能(机器学习)的发展，处理和文本挖掘变得越来越可能。本文介绍了自然语言文本的形态分析、句法分析和语义分析方法。使用Pullenti系统对文本进行形态和句法分析，该系统不仅可以对单词进行规范化，还可以区分命名实体、它们的特征以及它们之间的关系。因此，建立了一个相关命名实体的语义网络，如人员、职位、地名、商业协会、文档、教育、日期等。word2vec技术用于根据术语的联合出现来识别文本中的语义模式。正在考虑联合使用上述技术的可能性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Identification of Semantic Patterns in Full-text Documents Using Neural Network Methods

Processing and text mining are becoming increasingly possible thanks to the development of computer technology, as well as the development of artificial intelligence (machine learning). This article describes approaches to the analysis of texts in natural language using methods of morphological, syntactic and semantic analysis. Morphological and syntactic analysis of the text is carried out using the Pullenti system, which allows not only to normalize words, but also to distinguish named entities, their characteristics, and relationships between them. As a result, a semantic network of related named entities is built, such as people, positions, geographical names, business associations, documents, education, dates, etc. The word2vec technology is used to identify semantic patterns in the text based on the joint occurrence of terms. The possibility of joint use of the described technologies is being considered.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

GraphiCon'2019 Proceedings. Volume 2

自引率

0.00%

发文量