{"title":"Induction of a sentiment dictionary for financial analyst communication: a data-driven approach balancing machine learning and human intuition","authors":"Matthias Palmer, J. Roeder, Jan Muntermann","doi":"10.1080/2573234X.2021.1955022","DOIUrl":null,"url":null,"abstract":"ABSTRACT While sentiment dictionaries are easy to apply and provide reproducible results, they often exhibit inferior classification performance compared to machine learning approaches trained for specific application domains. Nevertheless, both approaches typically require manual data analysis. This paper develops a domain-specific dictionary using regularised linear models drawing from textual reports of financial analysts. The first evaluation step demonstrates that the developed financial analyst dictionary can explain cumulative abnormal stock returns related to earnings events more accurately compared to other finance-related dictionaries and sentiment classifiers. In a second step, the approaches are compared using manually annotated sentiment. The financial analyst dictionary is more accurate than other dictionary-based approaches, although it cannot compete with a pre-trained deep learning sentiment classifier. While we show that the proposed approach is suited for texts of financial analysts, it can be applied to other use cases. The approach realises context specificity while reducing extensive manual data analysis.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2021-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Business Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/2573234X.2021.1955022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 5
Abstract
ABSTRACT While sentiment dictionaries are easy to apply and provide reproducible results, they often exhibit inferior classification performance compared to machine learning approaches trained for specific application domains. Nevertheless, both approaches typically require manual data analysis. This paper develops a domain-specific dictionary using regularised linear models drawing from textual reports of financial analysts. The first evaluation step demonstrates that the developed financial analyst dictionary can explain cumulative abnormal stock returns related to earnings events more accurately compared to other finance-related dictionaries and sentiment classifiers. In a second step, the approaches are compared using manually annotated sentiment. The financial analyst dictionary is more accurate than other dictionary-based approaches, although it cannot compete with a pre-trained deep learning sentiment classifier. While we show that the proposed approach is suited for texts of financial analysts, it can be applied to other use cases. The approach realises context specificity while reducing extensive manual data analysis.