Shuwei Wang, Ruifeng Xu, Bin Liu, Lin Gui, Yu Zhou
{"title":"Financial named entity recognition based on conditional random fields and information entropy","authors":"Shuwei Wang, Ruifeng Xu, Bin Liu, Lin Gui, Yu Zhou","doi":"10.1109/ICMLC.2014.7009718","DOIUrl":null,"url":null,"abstract":"Named entity recognition plays an important role in many natural language processing tasks, such as relation detection and information extraction. This paper presents a novel method to recognize named entities in financial news texts in three steps. First, the domain dictionary is applied to recognize stock names. Second, the full form FNEs are identified by incorporating internal features in a classifier based on Conditional Random Fields. Third, the mutual information, boundary entropy and context features are employed to recognize the abbreviation FNE candidates. The experiments completed on a Chinese financial dataset show that the proposed approach achieves 91.02% precision and 92.77% recall.","PeriodicalId":335296,"journal":{"name":"2014 International Conference on Machine Learning and Cybernetics","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2014.7009718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
Named entity recognition plays an important role in many natural language processing tasks, such as relation detection and information extraction. This paper presents a novel method to recognize named entities in financial news texts in three steps. First, the domain dictionary is applied to recognize stock names. Second, the full form FNEs are identified by incorporating internal features in a classifier based on Conditional Random Fields. Third, the mutual information, boundary entropy and context features are employed to recognize the abbreviation FNE candidates. The experiments completed on a Chinese financial dataset show that the proposed approach achieves 91.02% precision and 92.77% recall.