{"title":"Automatic Extraction of Value-Semantic Components of the Cultural Codes from a Corpus of Readers’ Reviews","authors":"L. A. Mosunova, E. V. Mityagina, P. V. Ananin","doi":"10.3103/S0005105522030037","DOIUrl":null,"url":null,"abstract":"<p>The problem of the automatic analysis of large corpora of humanitarian texts is investigated. Book reviews are considered as a space of cultural code containing information on the value dominants in the consciousness of the modern reader. The database Reviews of Works of Fiction Containing Information about the Cultural Code has been created, which differs from analogues in its formation procedure and subject matter. The process of automatic search is described, and the results of an experiment on extracting the value-semantic components of the cultural code from the corpus of reader reviews (8278 texts) are presented. A conclusion is reached about the effectiveness of the developed methodology based on statistical methods and text analysis programs. The automatic analysis of the texts of reviews confirmed the ideas of the researchers regarding changes in the cultural code under the conditions of globalization: with its certain stability, some traditional values are replaced by values from other cultures.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":null,"pages":null},"PeriodicalIF":0.5000,"publicationDate":"2022-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0005105522030037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 1
Abstract
The problem of the automatic analysis of large corpora of humanitarian texts is investigated. Book reviews are considered as a space of cultural code containing information on the value dominants in the consciousness of the modern reader. The database Reviews of Works of Fiction Containing Information about the Cultural Code has been created, which differs from analogues in its formation procedure and subject matter. The process of automatic search is described, and the results of an experiment on extracting the value-semantic components of the cultural code from the corpus of reader reviews (8278 texts) are presented. A conclusion is reached about the effectiveness of the developed methodology based on statistical methods and text analysis programs. The automatic analysis of the texts of reviews confirmed the ideas of the researchers regarding changes in the cultural code under the conditions of globalization: with its certain stability, some traditional values are replaced by values from other cultures.
期刊介绍:
Automatic Documentation and Mathematical Linguistics is an international peer reviewed journal that covers all aspects of automation of information processes and systems, as well as algorithms and methods for automatic language analysis. Emphasis is on the practical applications of new technologies and techniques for information analysis and processing.