{"title":"基于神经网络的中文命名实体识别分块信息","authors":"Chen Lyu, Junchi Zhang, Jiangping Huang","doi":"10.1117/12.2685645","DOIUrl":null,"url":null,"abstract":"Most named entity recognition (NER) systems use supervised machine learning methods, including linear models and neural networks. However, both methods require large amounts of annotated data. In this paper, we utilize chunking resources to improve the performance of NER without annotating more data and investigate two approaches to incorporate chunking information in the character-level neural network framework for Chinese NER. The first approach is multi-task learning, which falls into the framework of sharing common feature representation of Chinese chunking and NER task. The second approach is based on stacking, which uses the features provided by a trained chunking model to guide the NER model. Experimental results on OntoNotes 4.0 corpus show that compared with the models only using the NER corpus, the NER models which utilize the chunking resources can improve the performance of NER.","PeriodicalId":305812,"journal":{"name":"International Conference on Electronic Information Technology","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Incorporating chunking information for Chinese named entity recognition using neural networks\",\"authors\":\"Chen Lyu, Junchi Zhang, Jiangping Huang\",\"doi\":\"10.1117/12.2685645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most named entity recognition (NER) systems use supervised machine learning methods, including linear models and neural networks. However, both methods require large amounts of annotated data. In this paper, we utilize chunking resources to improve the performance of NER without annotating more data and investigate two approaches to incorporate chunking information in the character-level neural network framework for Chinese NER. The first approach is multi-task learning, which falls into the framework of sharing common feature representation of Chinese chunking and NER task. The second approach is based on stacking, which uses the features provided by a trained chunking model to guide the NER model. Experimental results on OntoNotes 4.0 corpus show that compared with the models only using the NER corpus, the NER models which utilize the chunking resources can improve the performance of NER.\",\"PeriodicalId\":305812,\"journal\":{\"name\":\"International Conference on Electronic Information Technology\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Electronic Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2685645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2685645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Incorporating chunking information for Chinese named entity recognition using neural networks
Most named entity recognition (NER) systems use supervised machine learning methods, including linear models and neural networks. However, both methods require large amounts of annotated data. In this paper, we utilize chunking resources to improve the performance of NER without annotating more data and investigate two approaches to incorporate chunking information in the character-level neural network framework for Chinese NER. The first approach is multi-task learning, which falls into the framework of sharing common feature representation of Chinese chunking and NER task. The second approach is based on stacking, which uses the features provided by a trained chunking model to guide the NER model. Experimental results on OntoNotes 4.0 corpus show that compared with the models only using the NER corpus, the NER models which utilize the chunking resources can improve the performance of NER.