{"title":"基于音节的藏文文本分类算法","authors":"Xianghe Meng, Hongzhi Yu, Hui Cao","doi":"10.1109/ICISCAE51034.2020.9236833","DOIUrl":null,"url":null,"abstract":"Tibetan text classification is one of the core technologies in the field of Tibetan information processing. With the rapid development of the Internet, a large amount of Tibetan Internet text data will be generated every day. Text classification technology can quickly and accurately obtain the required information to solve the problem of out-of-order in text. Tibetan syllables are the basic components of Tibetan text, and each syllable in Tibetan is divided by syllable nodes. This paper proposes a Tibetan syllable as a text representation feature, and uses deep neural network models such as CNN, BiL STM and RCNN to classify Tibetan text. Experiments show that this method has achieved prefect results in different depth neural network classification models.","PeriodicalId":355473,"journal":{"name":"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tibetan Text Classification Algorithm Based on Syllables\",\"authors\":\"Xianghe Meng, Hongzhi Yu, Hui Cao\",\"doi\":\"10.1109/ICISCAE51034.2020.9236833\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Tibetan text classification is one of the core technologies in the field of Tibetan information processing. With the rapid development of the Internet, a large amount of Tibetan Internet text data will be generated every day. Text classification technology can quickly and accurately obtain the required information to solve the problem of out-of-order in text. Tibetan syllables are the basic components of Tibetan text, and each syllable in Tibetan is divided by syllable nodes. This paper proposes a Tibetan syllable as a text representation feature, and uses deep neural network models such as CNN, BiL STM and RCNN to classify Tibetan text. Experiments show that this method has achieved prefect results in different depth neural network classification models.\",\"PeriodicalId\":355473,\"journal\":{\"name\":\"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISCAE51034.2020.9236833\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISCAE51034.2020.9236833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Tibetan Text Classification Algorithm Based on Syllables
Tibetan text classification is one of the core technologies in the field of Tibetan information processing. With the rapid development of the Internet, a large amount of Tibetan Internet text data will be generated every day. Text classification technology can quickly and accurately obtain the required information to solve the problem of out-of-order in text. Tibetan syllables are the basic components of Tibetan text, and each syllable in Tibetan is divided by syllable nodes. This paper proposes a Tibetan syllable as a text representation feature, and uses deep neural network models such as CNN, BiL STM and RCNN to classify Tibetan text. Experiments show that this method has achieved prefect results in different depth neural network classification models.