A Review and Cluster Analysis of German Polarity Resources for Sentiment Analysis

B. Kern, Andreas Baumann, T. Kolb, Katharina Sekanina, Klaus Hofmann, Tanja Wissik, J. Neidhardt
{"title":"A Review and Cluster Analysis of German Polarity Resources for Sentiment Analysis","authors":"B. Kern, Andreas Baumann, T. Kolb, Katharina Sekanina, Klaus Hofmann, Tanja Wissik, J. Neidhardt","doi":"10.4230/OASIcs.LDK.2021.37","DOIUrl":null,"url":null,"abstract":"The domain of German polarity dictionaries is heterogeneous with many small dictionaries created for different purposes and using different methods. This paper aims to map out the landscape of freely available German polarity dictionaries by clustering them to uncover similarities and shared features. We find that, although most dictionaries seem to agree in their assessment of a word’s sentiment, subsets of them form groups of interrelated dictionaries. These dependencies are in most cases an immediate reflex of how these dictionaries were designed and compiled. As a consequence, we argue that sentiment evaluation should be based on multiple and diverse sentiment resources in order to avoid error propagation and amplification of potential biases. 2012 ACM Subject Classification Computing methodologies → Cluster analysis","PeriodicalId":377119,"journal":{"name":"International Conference on Language, Data, and Knowledge","volume":"290 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Language, Data, and Knowledge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/OASIcs.LDK.2021.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The domain of German polarity dictionaries is heterogeneous with many small dictionaries created for different purposes and using different methods. This paper aims to map out the landscape of freely available German polarity dictionaries by clustering them to uncover similarities and shared features. We find that, although most dictionaries seem to agree in their assessment of a word’s sentiment, subsets of them form groups of interrelated dictionaries. These dependencies are in most cases an immediate reflex of how these dictionaries were designed and compiled. As a consequence, we argue that sentiment evaluation should be based on multiple and diverse sentiment resources in order to avoid error propagation and amplification of potential biases. 2012 ACM Subject Classification Computing methodologies → Cluster analysis
面向情感分析的德语极性资源综述与聚类分析
德语极性词典的领域是异构的,有许多为不同目的和使用不同方法创建的小词典。本文旨在通过聚类来揭示德语极性词典的相似之处和共同特征,从而绘制出免费提供的德语极性词典的版图。我们发现,虽然大多数词典对一个词的情感评价似乎是一致的,但它们的子集形成了相互关联的词典组。在大多数情况下,这些依赖关系直接反映了这些字典的设计和编译方式。因此,我们认为情感评估应该基于多个不同的情感资源,以避免错误传播和潜在偏见的放大。2012 ACM学科分类计算方法→聚类分析
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信