Chung-hong Chan, Joseph W. Bajjalieh, L. Auvil, Hartmut Wessler, Scott L. Althaus, Kasper Welbers, Wouter van Atteveldt, Marc Jungblut
{"title":"Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment","authors":"Chung-hong Chan, Joseph W. Bajjalieh, L. Auvil, Hartmut Wessler, Scott L. Althaus, Kasper Welbers, Wouter van Atteveldt, Marc Jungblut","doi":"10.31235/osf.io/np5wa","DOIUrl":null,"url":null,"abstract":"We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated the risk of generating a spectrum of results with different levels of statistical significance by presenting an analysis of relationships between news sentiment and U.S. presidential approval. We summarize our findings into four best practices: 1) use a suitable sentiment dictionary; 2) do not assume that the validity and reliability of the dictionary is ‘built-in’; 3) check for the influence of content length and 4) do not use multiple dictionaries to test the same statistical hypothesis.","PeriodicalId":275035,"journal":{"name":"Computational Communication Research","volume":"14 36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Communication Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31235/osf.io/np5wa","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated the risk of generating a spectrum of results with different levels of statistical significance by presenting an analysis of relationships between news sentiment and U.S. presidential approval. We summarize our findings into four best practices: 1) use a suitable sentiment dictionary; 2) do not assume that the validity and reliability of the dictionary is ‘built-in’; 3) check for the influence of content length and 4) do not use multiple dictionaries to test the same statistical hypothesis.