{"title":"基于朴素贝叶斯的中国人姓名识别","authors":"Hui Zeng, J. Wang, Tao Wan","doi":"10.1109/3PGCIC.2014.60","DOIUrl":null,"url":null,"abstract":"On the basis of the traditional Naive Bayesian classification algorithm that just considered character of Chinese person name, we brought person name's up and down boundary words in it. In order to overcome the difficulty of boundary defining, we counted Chinese name's character frequency and boundary templates' frequency from tagged corpus. Then these recognized person names are used to match the missed occurrence in the text. The method is easy and the final result is good. Experimental results show that the F-value for recognition of Chinese person name was increased.","PeriodicalId":395610,"journal":{"name":"2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Chinese Person Name Recognition Based on Naive Bayes\",\"authors\":\"Hui Zeng, J. Wang, Tao Wan\",\"doi\":\"10.1109/3PGCIC.2014.60\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"On the basis of the traditional Naive Bayesian classification algorithm that just considered character of Chinese person name, we brought person name's up and down boundary words in it. In order to overcome the difficulty of boundary defining, we counted Chinese name's character frequency and boundary templates' frequency from tagged corpus. Then these recognized person names are used to match the missed occurrence in the text. The method is easy and the final result is good. Experimental results show that the F-value for recognition of Chinese person name was increased.\",\"PeriodicalId\":395610,\"journal\":{\"name\":\"2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/3PGCIC.2014.60\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3PGCIC.2014.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Chinese Person Name Recognition Based on Naive Bayes
On the basis of the traditional Naive Bayesian classification algorithm that just considered character of Chinese person name, we brought person name's up and down boundary words in it. In order to overcome the difficulty of boundary defining, we counted Chinese name's character frequency and boundary templates' frequency from tagged corpus. Then these recognized person names are used to match the missed occurrence in the text. The method is easy and the final result is good. Experimental results show that the F-value for recognition of Chinese person name was increased.