Sanjiv R. Das , Michele Donini , Muhammad Bilal Zafar , John He , Krishnaram Kenthapadi
{"title":"FinLex:有效地使用词嵌入来生成金融词汇","authors":"Sanjiv R. Das , Michele Donini , Muhammad Bilal Zafar , John He , Krishnaram Kenthapadi","doi":"10.1016/j.jfds.2021.10.001","DOIUrl":null,"url":null,"abstract":"<div><p>We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons, which makes automatic preparation of any word list immediate. We show that our automated word lists deliver comparable performance to traditional lexicons on machine learning classification tasks. This new approach will enable finance academics and practitioners to create and deploy new word lists in addition to the few traditional ones in a facile manner.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 1-11"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918821000131/pdfft?md5=f870155829ce1c2b61a45c753663ba75&pid=1-s2.0-S2405918821000131-main.pdf","citationCount":"6","resultStr":"{\"title\":\"FinLex: An effective use of word embeddings for financial lexicon generation\",\"authors\":\"Sanjiv R. Das , Michele Donini , Muhammad Bilal Zafar , John He , Krishnaram Kenthapadi\",\"doi\":\"10.1016/j.jfds.2021.10.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons, which makes automatic preparation of any word list immediate. We show that our automated word lists deliver comparable performance to traditional lexicons on machine learning classification tasks. This new approach will enable finance academics and practitioners to create and deploy new word lists in addition to the few traditional ones in a facile manner.</p></div>\",\"PeriodicalId\":36340,\"journal\":{\"name\":\"Journal of Finance and Data Science\",\"volume\":\"8 \",\"pages\":\"Pages 1-11\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2405918821000131/pdfft?md5=f870155829ce1c2b61a45c753663ba75&pid=1-s2.0-S2405918821000131-main.pdf\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Finance and Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2405918821000131\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Finance and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405918821000131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
FinLex: An effective use of word embeddings for financial lexicon generation
We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons, which makes automatic preparation of any word list immediate. We show that our automated word lists deliver comparable performance to traditional lexicons on machine learning classification tasks. This new approach will enable finance academics and practitioners to create and deploy new word lists in addition to the few traditional ones in a facile manner.