{"title":"“Joy” and “Fear” in Thomas Bernhard’s autobiographies: Aspects of a Computational Sentiment Analysis","authors":"M. Sellner","doi":"10.1553/SENTIMENT_ANALYSISS1","DOIUrl":null,"url":null,"abstract":"This pilot-study of a computational analysis of literary texts presents the results of aspects of a “sentiment analysis”. The data of analysis are the autobiographies of the Austrian novelist Thomas Bernhard. The primary object of attention are the sentiments “joy” and “fear”. We elaborate on and demonstrate the impact of several preprocessing procedures, describe the characteristics of the dictionary and the annotations of its entries conceived and used for analysis. We specify the general methodology and the steps involved for quantifying of its result by the use of the functions of the R-package “Quanteda”. The descriptive output of the procedures is examined with several statistical measures to compare the counts of “joy” vs “fear” that were found in the texts individually, contrastively and in combination as a corpus. We conclude that there is a proportional and relative difference between the frequencies of the sentiments of the individual texts, but that this observation is insignificant if interpreted on the basis of the non-parametric Wilcoxon rank-sum test. A “goodness of fit” test, on the other hand, shows that the two sentiments show a homogeneous distribution across the corpus","PeriodicalId":210552,"journal":{"name":"Digital Lexis and Beyond","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Lexis and Beyond","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1553/SENTIMENT_ANALYSISS1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This pilot-study of a computational analysis of literary texts presents the results of aspects of a “sentiment analysis”. The data of analysis are the autobiographies of the Austrian novelist Thomas Bernhard. The primary object of attention are the sentiments “joy” and “fear”. We elaborate on and demonstrate the impact of several preprocessing procedures, describe the characteristics of the dictionary and the annotations of its entries conceived and used for analysis. We specify the general methodology and the steps involved for quantifying of its result by the use of the functions of the R-package “Quanteda”. The descriptive output of the procedures is examined with several statistical measures to compare the counts of “joy” vs “fear” that were found in the texts individually, contrastively and in combination as a corpus. We conclude that there is a proportional and relative difference between the frequencies of the sentiments of the individual texts, but that this observation is insignificant if interpreted on the basis of the non-parametric Wilcoxon rank-sum test. A “goodness of fit” test, on the other hand, shows that the two sentiments show a homogeneous distribution across the corpus