{"title":"Research on Construction of Tibetan Sentiment Corpus","authors":"Tao Huang, Xiaodong Yan","doi":"10.1109/BWCCA.2015.31","DOIUrl":null,"url":null,"abstract":"Sentiment classification is one of the research hot spots of Natural Language Processing. Compared with English and Chinese, it is hard for Tibetan to do some research of sentiment analysis because of the situation that we lack of related sentiment corpus. In this paper, we construct a Tibetan sentiment corpus by crawling from Tibetan website and artificial Chinese-Tibetan translation. The final corpus we build is basically reaching a experimental requirement. The corpus contains 10,134 Emotion sentences, including 2,025 artificial translation corpus, and 8109 corpus crawl through the network.","PeriodicalId":193597,"journal":{"name":"2015 10th International Conference on Broadband and Wireless Computing, Communication and Applications (BWCCA)","volume":"676 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 10th International Conference on Broadband and Wireless Computing, Communication and Applications (BWCCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BWCCA.2015.31","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sentiment classification is one of the research hot spots of Natural Language Processing. Compared with English and Chinese, it is hard for Tibetan to do some research of sentiment analysis because of the situation that we lack of related sentiment corpus. In this paper, we construct a Tibetan sentiment corpus by crawling from Tibetan website and artificial Chinese-Tibetan translation. The final corpus we build is basically reaching a experimental requirement. The corpus contains 10,134 Emotion sentences, including 2,025 artificial translation corpus, and 8109 corpus crawl through the network.