The concept Muslim: discourses in Bosnian, Croatian, German and Serbian webcorpora

IF 0.2 0 LANGUAGE & LINGUISTICS
Phillip Wasserscheidt
{"title":"The concept Muslim: discourses in Bosnian, Croatian, German and Serbian webcorpora","authors":"Phillip Wasserscheidt","doi":"10.22210/suvlin.2022.094.06","DOIUrl":null,"url":null,"abstract":"The aim of this contribution is to examine the meaning of the word musliman or German Muslim and to answer the question whether the meaning of this word in a dictionary can be mapped by means of classical distributional analysis, and whether further meanings can be captured by this method. In addition, a simple method for a distributional semantic analysis is presented. The basis of the investigation is the assumption common in corpus–based semantics and lexicography that the meaning of words can be interpreted from the context and thus polysemy and synonymy can be recognized. The article examines four web corpora for Bosnian, Croatian, Serbian and German. First, the context is analysed, assuming that the syntactic structures in the context are themselves inherently meaningful. Specifically, adjective attributes are used to analyse properties of the keyword, coordination partners as indicators for relevant categories, verbs with the keyword as subject and accusative object as indicators for typical actions the keyword performs or is target of. In a second step, the lexemes that occur in these four contexts are clustered and the semantic dimensions of the keyword are determined. The results suggest that the distributional method can in principle map dictionary definitions well. However, this is mainly true for contexts which involve attributes and coordination but not for action–related syntactic configurations. Overall, the most salient semantic dimensions of the lexeme are those pertaining to members of a religious community, members of a regional ethnic group, and experienced or induced aggression. Here, the four corpora differ considerably: while German usage almost exclusively focuses on religion–related aspects, the other corpora also contain ethnic/regional components, as well as the discourse of aggression. This aggression is mainly verbal in German, but physical in the other corpora. Neutral meaning components for the word musliman are salient only in the Bosnian corpus.","PeriodicalId":40950,"journal":{"name":"Suvremena Lingvistika","volume":" ","pages":""},"PeriodicalIF":0.2000,"publicationDate":"2022-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Suvremena Lingvistika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22210/suvlin.2022.094.06","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

The aim of this contribution is to examine the meaning of the word musliman or German Muslim and to answer the question whether the meaning of this word in a dictionary can be mapped by means of classical distributional analysis, and whether further meanings can be captured by this method. In addition, a simple method for a distributional semantic analysis is presented. The basis of the investigation is the assumption common in corpus–based semantics and lexicography that the meaning of words can be interpreted from the context and thus polysemy and synonymy can be recognized. The article examines four web corpora for Bosnian, Croatian, Serbian and German. First, the context is analysed, assuming that the syntactic structures in the context are themselves inherently meaningful. Specifically, adjective attributes are used to analyse properties of the keyword, coordination partners as indicators for relevant categories, verbs with the keyword as subject and accusative object as indicators for typical actions the keyword performs or is target of. In a second step, the lexemes that occur in these four contexts are clustered and the semantic dimensions of the keyword are determined. The results suggest that the distributional method can in principle map dictionary definitions well. However, this is mainly true for contexts which involve attributes and coordination but not for action–related syntactic configurations. Overall, the most salient semantic dimensions of the lexeme are those pertaining to members of a religious community, members of a regional ethnic group, and experienced or induced aggression. Here, the four corpora differ considerably: while German usage almost exclusively focuses on religion–related aspects, the other corpora also contain ethnic/regional components, as well as the discourse of aggression. This aggression is mainly verbal in German, but physical in the other corpora. Neutral meaning components for the word musliman are salient only in the Bosnian corpus.
穆斯林概念:波斯尼亚语、克罗地亚语、德语和塞尔维亚语网络语料库中的话语
这篇文章的目的是研究“穆斯林”或“德国穆斯林”这个词的含义,并回答这个词在字典中的含义是否可以通过经典分布分析来映射,以及这种方法是否可以捕获进一步的含义。此外,还提出了一种简单的分布式语义分析方法。研究的基础是基于语料库的语义学和词典学中常见的假设,即可以根据上下文解释单词的意思,从而识别多义词和同义词。本文考察了波斯尼亚语、克罗地亚语、塞尔维亚语和德语的四种网络语料库。首先,分析语境,假设语境中的句法结构本身具有内在意义。具体而言,形容词属性用于分析关键词的属性,配合伙伴作为相关类别的指示,以关键词为主语的动词和宾格宾语作为关键字执行或目标的典型动作的指示。在第二步中,对这四个上下文中出现的词汇进行聚类,并确定关键字的语义维度。结果表明,分布式方法原则上可以很好地映射字典定义。然而,这主要适用于涉及属性和协调的上下文,而不适用于与动作相关的语法配置。总的来说,词位最显著的语义维度是那些与宗教团体成员、区域种族群体成员和经历或诱发侵略有关的语义维度。在这里,这四个语料库有很大的不同:德语的用法几乎完全集中在与宗教有关的方面,而其他语料库也包含种族/地区成分,以及侵略话语。这种攻击在德语中主要是口头的,但在其他语料库中是肢体的。“穆斯林”一词的中性词成分只在波斯尼亚语料库中突出。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Suvremena Lingvistika
Suvremena Lingvistika LANGUAGE & LINGUISTICS-
CiteScore
0.30
自引率
0.00%
发文量
8
审稿时长
17 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信