Li Wensong, Hu Zhuqing, Zhang Jinhui, Liu Xuejing, Lin Feng, Yu Jun
{"title":"基于词结构BiGRU的电场命名实体识别优化","authors":"Li Wensong, Hu Zhuqing, Zhang Jinhui, Liu Xuejing, Lin Feng, Yu Jun","doi":"10.1109/iSPEC53008.2021.9735481","DOIUrl":null,"url":null,"abstract":"In view of the characteristics of small corpus size, nested entities and abbreviated entities in electric field, this paper studied the methods of entity recognition in professional field, and proposed a method of entity recognition in electric field based on text vector feature enhancement. Firstly, by using low-intensity and preset word segmentation, the semantic information contained in Chinese words can be used reasonably to avoid the influence of semantic separation between single Chinese character and word on text vectors; Then WS-BiGRU is designed to learn the structural features of words, and a new feature enhancement vector is formed by fusing word vectors, parts of speech and word lengths. Furthermore, the training of entity recognition model is completed by BiGRU-Attention-CRF. For electric corpus, the P value is 86.74% and the R value is 87.3%. The research results can be used in electric dispatching knowledge graph, metadata model, intelligent report query and other business scenarios.","PeriodicalId":417862,"journal":{"name":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimized Named Entity Recognition of Electric Power Field Based on Word-Struct BiGRU\",\"authors\":\"Li Wensong, Hu Zhuqing, Zhang Jinhui, Liu Xuejing, Lin Feng, Yu Jun\",\"doi\":\"10.1109/iSPEC53008.2021.9735481\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In view of the characteristics of small corpus size, nested entities and abbreviated entities in electric field, this paper studied the methods of entity recognition in professional field, and proposed a method of entity recognition in electric field based on text vector feature enhancement. Firstly, by using low-intensity and preset word segmentation, the semantic information contained in Chinese words can be used reasonably to avoid the influence of semantic separation between single Chinese character and word on text vectors; Then WS-BiGRU is designed to learn the structural features of words, and a new feature enhancement vector is formed by fusing word vectors, parts of speech and word lengths. Furthermore, the training of entity recognition model is completed by BiGRU-Attention-CRF. For electric corpus, the P value is 86.74% and the R value is 87.3%. The research results can be used in electric dispatching knowledge graph, metadata model, intelligent report query and other business scenarios.\",\"PeriodicalId\":417862,\"journal\":{\"name\":\"2021 IEEE Sustainable Power and Energy Conference (iSPEC)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Sustainable Power and Energy Conference (iSPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSPEC53008.2021.9735481\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSPEC53008.2021.9735481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimized Named Entity Recognition of Electric Power Field Based on Word-Struct BiGRU
In view of the characteristics of small corpus size, nested entities and abbreviated entities in electric field, this paper studied the methods of entity recognition in professional field, and proposed a method of entity recognition in electric field based on text vector feature enhancement. Firstly, by using low-intensity and preset word segmentation, the semantic information contained in Chinese words can be used reasonably to avoid the influence of semantic separation between single Chinese character and word on text vectors; Then WS-BiGRU is designed to learn the structural features of words, and a new feature enhancement vector is formed by fusing word vectors, parts of speech and word lengths. Furthermore, the training of entity recognition model is completed by BiGRU-Attention-CRF. For electric corpus, the P value is 86.74% and the R value is 87.3%. The research results can be used in electric dispatching knowledge graph, metadata model, intelligent report query and other business scenarios.