{"title":"语言与领域专用性:汉语金融情绪词典","authors":"Zijia Du, A. Huang, Russ Wermers, Wenfeng Wu","doi":"10.2139/ssrn.3759258","DOIUrl":null,"url":null,"abstract":"We use supervised machine learning to develop a financial sentiment dictionary from 3.1 million Chinese-language financial news articles. Our dictionary maps semantically similar words to a subset of human-expert generated financial sentiment words. In validation tests, our dictionary scores the sentiment of articles consistently with human reading of full articles. In return association tests, our dictionary outperforms and subsumes previous Chinese financial sentiment dictionaries, such as direct translations of Loughran and McDonald’s (2011) English-language financial dictionary. We also generate a list of politically-related positive words that is unique to China; we find that this list has a weaker association with returns than does the list of other positive words. We demonstrate that state media uses more politically-related positive and fewer negative words, and exhibits a sentiment bias. This bias renders the state media’s sentiment as less return-informative. Our findings demonstrate that dictionary-based sentiment analysis exhibits strong language and domain specificity.","PeriodicalId":260048,"journal":{"name":"Capital Markets: Market Efficiency eJournal","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Language and Domain Specificity: A Chinese Financial Sentiment Dictionary\",\"authors\":\"Zijia Du, A. Huang, Russ Wermers, Wenfeng Wu\",\"doi\":\"10.2139/ssrn.3759258\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We use supervised machine learning to develop a financial sentiment dictionary from 3.1 million Chinese-language financial news articles. Our dictionary maps semantically similar words to a subset of human-expert generated financial sentiment words. In validation tests, our dictionary scores the sentiment of articles consistently with human reading of full articles. In return association tests, our dictionary outperforms and subsumes previous Chinese financial sentiment dictionaries, such as direct translations of Loughran and McDonald’s (2011) English-language financial dictionary. We also generate a list of politically-related positive words that is unique to China; we find that this list has a weaker association with returns than does the list of other positive words. We demonstrate that state media uses more politically-related positive and fewer negative words, and exhibits a sentiment bias. This bias renders the state media’s sentiment as less return-informative. Our findings demonstrate that dictionary-based sentiment analysis exhibits strong language and domain specificity.\",\"PeriodicalId\":260048,\"journal\":{\"name\":\"Capital Markets: Market Efficiency eJournal\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Capital Markets: Market Efficiency eJournal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3759258\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Capital Markets: Market Efficiency eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3759258","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Language and Domain Specificity: A Chinese Financial Sentiment Dictionary
We use supervised machine learning to develop a financial sentiment dictionary from 3.1 million Chinese-language financial news articles. Our dictionary maps semantically similar words to a subset of human-expert generated financial sentiment words. In validation tests, our dictionary scores the sentiment of articles consistently with human reading of full articles. In return association tests, our dictionary outperforms and subsumes previous Chinese financial sentiment dictionaries, such as direct translations of Loughran and McDonald’s (2011) English-language financial dictionary. We also generate a list of politically-related positive words that is unique to China; we find that this list has a weaker association with returns than does the list of other positive words. We demonstrate that state media uses more politically-related positive and fewer negative words, and exhibits a sentiment bias. This bias renders the state media’s sentiment as less return-informative. Our findings demonstrate that dictionary-based sentiment analysis exhibits strong language and domain specificity.