{"title":"基于集成学习分类器的藏文文本分类研究","authors":"Ling Ailin, Yu Hongzhi, Yuan Bin","doi":"10.1109/IAEAC.2015.7428520","DOIUrl":null,"url":null,"abstract":"Based on the text feature and syntatic structure of Tibetan, this theory mainly focuses on categorization of Tibetan through ensemble learning classified method. Combine KNN and Naive Bayesian to build a basic classifier on account of three character subsets over category of word character. And then take weighted calculation of basic classifier. For the last step, calculate the weight of basic classifier via gradient descend and reach final result of categorization. During experiment, choose recall ratio, precision ratio as well as other evaluation function to analyze and evaluate KNN, Naive Bayesian and text categorization. Conclusion from experiment shows that precision of calculation of Tibetan text based on ensemble learning method has been greatly improved.","PeriodicalId":398100,"journal":{"name":"2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Study of Tibetan text categorization based on ensemble learning classifier\",\"authors\":\"Ling Ailin, Yu Hongzhi, Yuan Bin\",\"doi\":\"10.1109/IAEAC.2015.7428520\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Based on the text feature and syntatic structure of Tibetan, this theory mainly focuses on categorization of Tibetan through ensemble learning classified method. Combine KNN and Naive Bayesian to build a basic classifier on account of three character subsets over category of word character. And then take weighted calculation of basic classifier. For the last step, calculate the weight of basic classifier via gradient descend and reach final result of categorization. During experiment, choose recall ratio, precision ratio as well as other evaluation function to analyze and evaluate KNN, Naive Bayesian and text categorization. Conclusion from experiment shows that precision of calculation of Tibetan text based on ensemble learning method has been greatly improved.\",\"PeriodicalId\":398100,\"journal\":{\"name\":\"2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC.2015.7428520\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC.2015.7428520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Study of Tibetan text categorization based on ensemble learning classifier
Based on the text feature and syntatic structure of Tibetan, this theory mainly focuses on categorization of Tibetan through ensemble learning classified method. Combine KNN and Naive Bayesian to build a basic classifier on account of three character subsets over category of word character. And then take weighted calculation of basic classifier. For the last step, calculate the weight of basic classifier via gradient descend and reach final result of categorization. During experiment, choose recall ratio, precision ratio as well as other evaluation function to analyze and evaluate KNN, Naive Bayesian and text categorization. Conclusion from experiment shows that precision of calculation of Tibetan text based on ensemble learning method has been greatly improved.