A. Bondarev, A. Bondarenko, V. Galaktionov, L. Shapiro
{"title":"名词和形容词连用频率的语篇信息可视化分析","authors":"A. Bondarev, A. Bondarenko, V. Galaktionov, L. Shapiro","doi":"10.51130/graphicon-2020-2-3-20","DOIUrl":null,"url":null,"abstract":"This paper presents the results of numerical experiments on the study of data volumes consisting of frequencies of joint use of adjectives and nouns. The volumes of data were obtained from samples from text collections in Rus-sian. The aim of the research is to analyze the cluster structure of the studied volume and semantic proximity of words in clusters and subclusters. The hypothesis was used that words with similar meaning should occur in approximately the same context. In this regard, in the space of features, they will be at a relatively close distance from each other, while differing words will be at a more distant distance from each other. Research is carried out using elastic maps, which are effective tools for visual analysis of multidimensional data. The construction of elastic maps and their extensions in the space of the first three principal components makes it possible to determine the cluster structure of the studied multidimensional data volumes. The analysis of the cluster structure for the considered volume of multidimensional data is carried out. The influence of transposition of the initial data array is considered.","PeriodicalId":344054,"journal":{"name":"Proceedings of the 30th International Conference on Computer Graphics and Machine Vision (GraphiCon 2020). Part 2","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Visual Analysis of Textual Information on the Frequencies of Joint Use of Nouns and Adjectives\",\"authors\":\"A. Bondarev, A. Bondarenko, V. Galaktionov, L. Shapiro\",\"doi\":\"10.51130/graphicon-2020-2-3-20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the results of numerical experiments on the study of data volumes consisting of frequencies of joint use of adjectives and nouns. The volumes of data were obtained from samples from text collections in Rus-sian. The aim of the research is to analyze the cluster structure of the studied volume and semantic proximity of words in clusters and subclusters. The hypothesis was used that words with similar meaning should occur in approximately the same context. In this regard, in the space of features, they will be at a relatively close distance from each other, while differing words will be at a more distant distance from each other. Research is carried out using elastic maps, which are effective tools for visual analysis of multidimensional data. The construction of elastic maps and their extensions in the space of the first three principal components makes it possible to determine the cluster structure of the studied multidimensional data volumes. The analysis of the cluster structure for the considered volume of multidimensional data is carried out. The influence of transposition of the initial data array is considered.\",\"PeriodicalId\":344054,\"journal\":{\"name\":\"Proceedings of the 30th International Conference on Computer Graphics and Machine Vision (GraphiCon 2020). Part 2\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 30th International Conference on Computer Graphics and Machine Vision (GraphiCon 2020). Part 2\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.51130/graphicon-2020-2-3-20\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 30th International Conference on Computer Graphics and Machine Vision (GraphiCon 2020). Part 2","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51130/graphicon-2020-2-3-20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Visual Analysis of Textual Information on the Frequencies of Joint Use of Nouns and Adjectives
This paper presents the results of numerical experiments on the study of data volumes consisting of frequencies of joint use of adjectives and nouns. The volumes of data were obtained from samples from text collections in Rus-sian. The aim of the research is to analyze the cluster structure of the studied volume and semantic proximity of words in clusters and subclusters. The hypothesis was used that words with similar meaning should occur in approximately the same context. In this regard, in the space of features, they will be at a relatively close distance from each other, while differing words will be at a more distant distance from each other. Research is carried out using elastic maps, which are effective tools for visual analysis of multidimensional data. The construction of elastic maps and their extensions in the space of the first three principal components makes it possible to determine the cluster structure of the studied multidimensional data volumes. The analysis of the cluster structure for the considered volume of multidimensional data is carried out. The influence of transposition of the initial data array is considered.