{"title":"基于规则和字典的权力域命名实体识别","authors":"Jue Jiang, Rongheng Lin, Hua Zou","doi":"10.1109/ICCT56141.2022.10073142","DOIUrl":null,"url":null,"abstract":"There are massive electricity data in the daily management, normalization operation, troubleshooting and other aspects of the power domain, but these professional and accurate data have not been fully mined and used. Constructing a power domain knowledge map can not only help power grid companies tap the value of these massive data and realize the integration of power knowledge, but also greatly facilitate the staff's query and acquisition of power information, and improve the work efficiency of the power industry. NER (name entity recognition) is the basis for constructing knowledge graph. This paper studies name entity recognition based on dictionaries and rules. It can standardize and accurately extract electricity from unstructured text through three methods: power entity dictionary, feature character rule matching, and part-of-speech combination rule matching. Related entities provide high-quality and high-precision entities for the construction of power domain knowledge graph.","PeriodicalId":294057,"journal":{"name":"2022 IEEE 22nd International Conference on Communication Technology (ICCT)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Power Domain Named Entity Recognition Based on Rules and Dictionaries\",\"authors\":\"Jue Jiang, Rongheng Lin, Hua Zou\",\"doi\":\"10.1109/ICCT56141.2022.10073142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are massive electricity data in the daily management, normalization operation, troubleshooting and other aspects of the power domain, but these professional and accurate data have not been fully mined and used. Constructing a power domain knowledge map can not only help power grid companies tap the value of these massive data and realize the integration of power knowledge, but also greatly facilitate the staff's query and acquisition of power information, and improve the work efficiency of the power industry. NER (name entity recognition) is the basis for constructing knowledge graph. This paper studies name entity recognition based on dictionaries and rules. It can standardize and accurately extract electricity from unstructured text through three methods: power entity dictionary, feature character rule matching, and part-of-speech combination rule matching. Related entities provide high-quality and high-precision entities for the construction of power domain knowledge graph.\",\"PeriodicalId\":294057,\"journal\":{\"name\":\"2022 IEEE 22nd International Conference on Communication Technology (ICCT)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 22nd International Conference on Communication Technology (ICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCT56141.2022.10073142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 22nd International Conference on Communication Technology (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT56141.2022.10073142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Power Domain Named Entity Recognition Based on Rules and Dictionaries
There are massive electricity data in the daily management, normalization operation, troubleshooting and other aspects of the power domain, but these professional and accurate data have not been fully mined and used. Constructing a power domain knowledge map can not only help power grid companies tap the value of these massive data and realize the integration of power knowledge, but also greatly facilitate the staff's query and acquisition of power information, and improve the work efficiency of the power industry. NER (name entity recognition) is the basis for constructing knowledge graph. This paper studies name entity recognition based on dictionaries and rules. It can standardize and accurately extract electricity from unstructured text through three methods: power entity dictionary, feature character rule matching, and part-of-speech combination rule matching. Related entities provide high-quality and high-precision entities for the construction of power domain knowledge graph.