{"title":"Attention-Based Bidirectional Long Short-Term Memory Networks for Chinese Named Entity Recognition","authors":"Chaoyi Huang, Youguang Chen, Qi Liang","doi":"10.1145/3340997.3341002","DOIUrl":null,"url":null,"abstract":"Named entity recognition is an important task in natural language processing and has been carefully studied in recent decades. In this paper, we investigate the problem of Chinese named entity recognition. Using attention mechanisms based on BiLSTM-CRF model, a model is proposed in this paper, which makes better use of word-based and character-based information. All the potential words that match the input characters and sentences with the dictionary are encoded, and one attention layer to control the dynamic acquisition of multiple potential characters in different paths from sequence information. A series of input characters and all potential words matched with dictionaries in sentences are encoded to measure the correlation scores between candidate characters and potential words. Another attention layer is to produce a weight vector and merge word-level features from each time step into a sentence-level feature vector by multiplying the weight vector. Then, CRF model is introduced to get the final tagging to obtain the desired result. The experimental data shows that the F1-score of our model has increased from 73.88% to 75.10% on the OntoNote 4 dataset, and from 93.18% to 94.17% on the MSRA dataset. The results show that our method has a better performance than the previous model.","PeriodicalId":409906,"journal":{"name":"Proceedings of the 2019 4th International Conference on Machine Learning Technologies","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 4th International Conference on Machine Learning Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3340997.3341002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Named entity recognition is an important task in natural language processing and has been carefully studied in recent decades. In this paper, we investigate the problem of Chinese named entity recognition. Using attention mechanisms based on BiLSTM-CRF model, a model is proposed in this paper, which makes better use of word-based and character-based information. All the potential words that match the input characters and sentences with the dictionary are encoded, and one attention layer to control the dynamic acquisition of multiple potential characters in different paths from sequence information. A series of input characters and all potential words matched with dictionaries in sentences are encoded to measure the correlation scores between candidate characters and potential words. Another attention layer is to produce a weight vector and merge word-level features from each time step into a sentence-level feature vector by multiplying the weight vector. Then, CRF model is introduced to get the final tagging to obtain the desired result. The experimental data shows that the F1-score of our model has increased from 73.88% to 75.10% on the OntoNote 4 dataset, and from 93.18% to 94.17% on the MSRA dataset. The results show that our method has a better performance than the previous model.