{"title":"Research on Tibetan Text Classification Method Based on Neural Network","authors":"Zhensong Li, Jie Zhu, Zhixiang Luo, Saihu Liu","doi":"10.1109/IALP48816.2019.9037706","DOIUrl":null,"url":null,"abstract":"Text categorization is an important task in natural language processing, and it has a wide range of applications in real life. In this paper, two N-Gram feature models (MLP, FastText) and two sequential models (sepCNN, Bi-LSTM) are used to study the automatic classification for Tibetan text based on syllables and vocabulary. The experiment on Tibetan language data collected by China Tibet News Network shows that the classification accuracy is about 85%.","PeriodicalId":208066,"journal":{"name":"2019 International Conference on Asian Language Processing (IALP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Asian Language Processing (IALP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP48816.2019.9037706","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Text categorization is an important task in natural language processing, and it has a wide range of applications in real life. In this paper, two N-Gram feature models (MLP, FastText) and two sequential models (sepCNN, Bi-LSTM) are used to study the automatic classification for Tibetan text based on syllables and vocabulary. The experiment on Tibetan language data collected by China Tibet News Network shows that the classification accuracy is about 85%.