Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment

Computational Communication Research Pub Date : 2020-10-07 DOI:10.31235/osf.io/np5wa

Chung-hong Chan, Joseph W. Bajjalieh, L. Auvil, Hartmut Wessler, Scott L. Althaus, Kasper Welbers, Wouter van Atteveldt, Marc Jungblut

引用次数: 18

Abstract

We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated the risk of generating a spectrum of results with different levels of statistical significance by presenting an analysis of relationships between news sentiment and U.S. presidential approval. We summarize our findings into four best practices: 1) use a suitable sentiment dictionary; 2) do not assume that the validity and reliability of the dictionary is ‘built-in’; 3) check for the influence of content length and 4) do not use multiple dictionaries to test the same statistical hypothesis.

查看原文本刊更多论文

使用“现成”词典衡量新闻情绪的四个最佳实践:大规模p-hacking实验

我们使用一个大型新闻语料库，基于基于词典的方法检验了37种情绪得分的有效性，并通过分析新闻情绪与美国总统支持率之间的关系，展示了产生具有不同统计显著性水平的结果谱的风险。我们将研究结果总结为四个最佳实践:1)使用合适的情感词典;2)不要认为字典的有效性和可靠性是“内置的”;3)检查内容长度的影响，4)不要使用多个字典来检验相同的统计假设。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational Communication Research

自引率

0.00%

发文量