MULTILINGUAL SENTIMENT NORMALIZATION FOR SCANDINAVIAN LANGUAGES

Scandinavian Studies in Language Pub Date : 2021-12-31 DOI:10.7146/sss.v12i1.130068

R. Baglini, Lasse Hansen, K. Enevoldsen, K. Nielbo

引用次数: 1

Abstract

In this paper, we address the challenge of multilingual sentiment analysis using a traditional lexicon and rule-based sentiment instrument that is tailored to capture sentiment patterns in a particular language. Focusing on a case study of three closely related Scandinavian languages (Danish, Norwegian, and Swedish) and using three tailored versions of VADER, we measure the relative degree of variation in valence using the OPUS corpus. We found that scores for Swedish are systematically skewed lower than Danish for translational pairs, and that scores for Norwegian are skewed higher for both other languages. We use a neural network to optimize the fit between Norwegian and Swedish respectively and Danish as the reference (target) language.

查看原文本刊更多论文

斯堪的纳维亚语言的多语言情感规范化

在本文中，我们使用传统的词典和基于规则的情感工具来解决多语言情感分析的挑战，该工具专门用于捕获特定语言中的情感模式。通过对三种密切相关的斯堪的纳维亚语言(丹麦语、挪威语和瑞典语)的案例研究，并使用三种量身定制的VADER版本，我们使用OPUS语料库测量了语价变化的相对程度。我们发现，在翻译对中，瑞典语的分数系统性地比丹麦语低，而挪威语的分数在其他两种语言中都更高。我们使用神经网络分别优化挪威语和瑞典语与丹麦语作为参考(目标)语言之间的拟合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Scandinavian Studies in Language

自引率

0.00%

发文量