关键词在频率和主题相关性方面

Natalia P. Galkina
{"title":"关键词在频率和主题相关性方面","authors":"Natalia P. Galkina","doi":"10.34216/1998-0817-2022-28-3-180-185","DOIUrl":null,"url":null,"abstract":"The article focuses on the role of keywords, their statistical data for determining the thematic dominance when working with large arrays of texts. The description is based on the materials of a typological linguistic study of the texts of military songs, the period of 1939-1945, in English and Russian. The selection of keywords was carried out on the basis of semantic, lexical-syntactic, morphological analysis, taking into account the frequency of their use. The frequency of using a word may not always be a defining feature for marking it as a keyword. Within the framework of one text, the keywords may be words that help understand the sense, unravel its deep meaning, remember the content. When combining a large number of texts, by authorship, chronology, thematic, stylistic or other relatedness, the frequency of keywords matters and can serve as a determining factor, a classification criterion. This paper shows that the results of the thematic distribution of texts based on the semantic analysis of their content correspond to the results of statistical analysis of the keywords and are confirmed by machine quantitative indicators of their frequency. The results are relevant for both Russian-language and English- language materials.","PeriodicalId":326235,"journal":{"name":"Vestnik of Kostroma State University","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"KEYWORDS IN TERMS OF FREQUENCY AND THEMATIC RELEVANCE\",\"authors\":\"Natalia P. Galkina\",\"doi\":\"10.34216/1998-0817-2022-28-3-180-185\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article focuses on the role of keywords, their statistical data for determining the thematic dominance when working with large arrays of texts. The description is based on the materials of a typological linguistic study of the texts of military songs, the period of 1939-1945, in English and Russian. The selection of keywords was carried out on the basis of semantic, lexical-syntactic, morphological analysis, taking into account the frequency of their use. The frequency of using a word may not always be a defining feature for marking it as a keyword. Within the framework of one text, the keywords may be words that help understand the sense, unravel its deep meaning, remember the content. When combining a large number of texts, by authorship, chronology, thematic, stylistic or other relatedness, the frequency of keywords matters and can serve as a determining factor, a classification criterion. This paper shows that the results of the thematic distribution of texts based on the semantic analysis of their content correspond to the results of statistical analysis of the keywords and are confirmed by machine quantitative indicators of their frequency. The results are relevant for both Russian-language and English- language materials.\",\"PeriodicalId\":326235,\"journal\":{\"name\":\"Vestnik of Kostroma State University\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Vestnik of Kostroma State University\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34216/1998-0817-2022-28-3-180-185\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vestnik of Kostroma State University","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34216/1998-0817-2022-28-3-180-185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文主要关注关键词的作用,它们的统计数据用于在处理大量文本时确定主题优势。该描述是基于对1939年至1945年期间的英语和俄语军歌文本的类型学语言学研究的材料。关键词的选择是在语义、词汇句法和形态分析的基础上进行的,并考虑到它们的使用频率。使用一个单词的频率可能并不总是将其标记为关键字的定义特征。在一篇文章的框架内,关键词可能是有助于理解意思、揭示其深层含义、记住内容的词语。当根据作者、年代、主题、风格或其他相关性组合大量文本时,关键词的频率很重要,可以作为一个决定性因素,一个分类标准。本文表明,基于文本内容语义分析的文本主题分布结果与关键词统计分析的结果相对应,并通过关键词频率的机器定量指标得到证实。研究结果适用于俄语和英语材料。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
KEYWORDS IN TERMS OF FREQUENCY AND THEMATIC RELEVANCE
The article focuses on the role of keywords, their statistical data for determining the thematic dominance when working with large arrays of texts. The description is based on the materials of a typological linguistic study of the texts of military songs, the period of 1939-1945, in English and Russian. The selection of keywords was carried out on the basis of semantic, lexical-syntactic, morphological analysis, taking into account the frequency of their use. The frequency of using a word may not always be a defining feature for marking it as a keyword. Within the framework of one text, the keywords may be words that help understand the sense, unravel its deep meaning, remember the content. When combining a large number of texts, by authorship, chronology, thematic, stylistic or other relatedness, the frequency of keywords matters and can serve as a determining factor, a classification criterion. This paper shows that the results of the thematic distribution of texts based on the semantic analysis of their content correspond to the results of statistical analysis of the keywords and are confirmed by machine quantitative indicators of their frequency. The results are relevant for both Russian-language and English- language materials.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信