Олег Золотарев, O. Zolotarev, Ярослав Соломенцев, Yaroslav K. Solomentsev, Аида Хакимова, Aida Khakimova, Михаил Шарнин, M. Charnine
{"title":"基于神经网络方法的全文文档语义模式识别","authors":"Олег Золотарев, O. Zolotarev, Ярослав Соломенцев, Yaroslav K. Solomentsev, Аида Хакимова, Aida Khakimova, Михаил Шарнин, M. Charnine","doi":"10.30987/graphicon-2019-2-276-279","DOIUrl":null,"url":null,"abstract":"Processing and text mining are becoming increasingly possible thanks to the development of computer technology, as well as the development of artificial intelligence (machine learning). This article describes approaches to the analysis of texts in natural language using methods of morphological, syntactic and semantic analysis. Morphological and syntactic analysis of the text is carried out using the Pullenti system, which allows not only to normalize words, but also to distinguish named entities, their characteristics, and relationships between them. As a result, a semantic network of related named entities is built, such as people, positions, geographical names, business associations, documents, education, dates, etc. The word2vec technology is used to identify semantic patterns in the text based on the joint occurrence of terms. The possibility of joint use of the described technologies is being considered.","PeriodicalId":409819,"journal":{"name":"GraphiCon'2019 Proceedings. Volume 2","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Identification of Semantic Patterns in Full-text Documents Using Neural Network Methods\",\"authors\":\"Олег Золотарев, O. Zolotarev, Ярослав Соломенцев, Yaroslav K. Solomentsev, Аида Хакимова, Aida Khakimova, Михаил Шарнин, M. Charnine\",\"doi\":\"10.30987/graphicon-2019-2-276-279\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Processing and text mining are becoming increasingly possible thanks to the development of computer technology, as well as the development of artificial intelligence (machine learning). This article describes approaches to the analysis of texts in natural language using methods of morphological, syntactic and semantic analysis. Morphological and syntactic analysis of the text is carried out using the Pullenti system, which allows not only to normalize words, but also to distinguish named entities, their characteristics, and relationships between them. As a result, a semantic network of related named entities is built, such as people, positions, geographical names, business associations, documents, education, dates, etc. The word2vec technology is used to identify semantic patterns in the text based on the joint occurrence of terms. The possibility of joint use of the described technologies is being considered.\",\"PeriodicalId\":409819,\"journal\":{\"name\":\"GraphiCon'2019 Proceedings. Volume 2\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GraphiCon'2019 Proceedings. Volume 2\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30987/graphicon-2019-2-276-279\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GraphiCon'2019 Proceedings. Volume 2","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30987/graphicon-2019-2-276-279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identification of Semantic Patterns in Full-text Documents Using Neural Network Methods
Processing and text mining are becoming increasingly possible thanks to the development of computer technology, as well as the development of artificial intelligence (machine learning). This article describes approaches to the analysis of texts in natural language using methods of morphological, syntactic and semantic analysis. Morphological and syntactic analysis of the text is carried out using the Pullenti system, which allows not only to normalize words, but also to distinguish named entities, their characteristics, and relationships between them. As a result, a semantic network of related named entities is built, such as people, positions, geographical names, business associations, documents, education, dates, etc. The word2vec technology is used to identify semantic patterns in the text based on the joint occurrence of terms. The possibility of joint use of the described technologies is being considered.