Digital Lexis and Beyond最新文献

Digitising a corpus of Austrian dialect recordings from the 20th century 数字化20世纪奥地利方言录音的语料库

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/oe_phonogrammarchiv

Christian Huber, Benjamin Fischer

引用次数: 0

Komparative Zeitreihenanalyse der lexikalischen Stabilität und Emotion in österreichischen Korpusdaten 比较了奥地利体重数据中的词汇稳定性和情感的历态分析

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/austrian_corpora

引用次数: 2

“Joy” and “Fear” in Thomas Bernhard’s autobiographies: Aspects of a Computational Sentiment Analysis 托马斯·伯恩哈德自传中的“喜悦”和“恐惧”:计算情感分析的各个方面

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/SENTIMENT_ANALYSISS1

M. Sellner

{"title":"“Joy” and “Fear” in Thomas Bernhard’s autobiographies: Aspects of a Computational Sentiment Analysis","authors":"M. Sellner","doi":"10.1553/SENTIMENT_ANALYSISS1","DOIUrl":"https://doi.org/10.1553/SENTIMENT_ANALYSISS1","url":null,"abstract":"This pilot-study of a computational analysis of literary texts presents the results of aspects of a “sentiment analysis”. The data of analysis are the autobiographies of the Austrian novelist Thomas Bernhard. The primary object of attention are the sentiments “joy” and “fear”. We elaborate on and demonstrate the impact of several preprocessing procedures, describe the characteristics of the dictionary and the annotations of its entries conceived and used for analysis. We specify the general methodology and the steps involved for quantifying of its result by the use of the functions of the R-package “Quanteda”. The descriptive output of the procedures is examined with several statistical measures to compare the counts of “joy” vs “fear” that were found in the texts individually, contrastively and in combination as a corpus. We conclude that there is a proportional and relative difference between the frequencies of the sentiments of the individual texts, but that this observation is insignificant if interpreted on the basis of the non-parametric Wilcoxon rank-sum test. A “goodness of fit” test, on the other hand, shows that the two sentiments show a homogeneous distribution across the corpus","PeriodicalId":210552,"journal":{"name":"Digital Lexis and Beyond","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128090035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigating the linguistic representativeness of Early Modern Greek Corpora 考察早期现代希腊语料库的语言代表性

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/EMODERN_GREEKS1

E. Karantzola, Yannis Kostopoulos, K. Sampanis

{"title":"Investigating the linguistic representativeness of Early Modern Greek Corpora","authors":"E. Karantzola, Yannis Kostopoulos, K. Sampanis","doi":"10.1553/EMODERN_GREEKS1","DOIUrl":"https://doi.org/10.1553/EMODERN_GREEKS1","url":null,"abstract":"Following a poorly documented period in the history of vernacular Greek (6th-12th c.), the late 15th century sets the beginning of a linguistic era characterized by a quantitatively and qualitatively incomparable production of prose texts written in “common” language. It is at this point that classicizing Greek stops dominating in writing, and a new linguistic variety – albeit a very diverse and fluid one – Early Modern Greek (EMG) starts growing rapidly as a literacy language. The development of this new variety is manifested in its widespread use as literary language (in texts with aesthetic function), as well as in its use as a simple scripta, namely a written vernacular for legal, administrative, commercial, and other functions. Despite its significance in the history of Greek, this period remains to a large extent unexplored and underrepresented in Greek language corpora. On this view, our understanding of EMG depends crucially on the representativeness of the few available corpora. The aim of this paper is to investigate the linguistic representativeness of EMG corpora, and to explore possible associations between observed linguistic patterns and corpora design. Focusing on the distribution of contrastive and reformulation markers, our study reveals that the linguistic data illustrated in the available EMG corpora are divergent and largely dependent on the representation of variables, such as text form (poetry/prose), period, geographical region, and genre","PeriodicalId":210552,"journal":{"name":"Digital Lexis and Beyond","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128881510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Exploring Etymology and Language Contact Through Digital Lexicographical Encoding: The Dictionary of Loanwords in the Midrash Genesis Rabbah (DLGenR) 通过数字词典编码探索词源和语言接触:米德拉什《创世纪·拉巴》外来词词典(DLGenR)

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/DLGENR_LOANWORDSS1

Christina Katsikadeli, V. Slepoy, Thomas Klampfl

引用次数: 0

(Dis)continuities in the diachrony of the Greek lexicon: The learned component in the light of a corpus analysis (二)希腊词汇历时上的连续性:语料库分析下的学习成分

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/greek_lexicon

引用次数: 0

VICAV 3.0: Zooming in on Lexical Resources VICAV 3.0:放大词汇资源

Digital Lexis and Beyond Pub Date : 1900-01-01 DOI: 10.1553/vicav

Karlheinz Moerth, Daniel Schopper

引用次数: 0