Yujie Lu, Kotaro Sakamoto, Hideyuki Shibuki, Tatsunori Mori
{"title":"Construction of a Multilingual Annotated Corpus for Deeper Sentiment Understanding in Social Media","authors":"Yujie Lu, Kotaro Sakamoto, Hideyuki Shibuki, Tatsunori Mori","doi":"10.5715/JNLP.24.205","DOIUrl":null,"url":null,"abstract":"The surge of social media makes it possible to understand people’s emotion in different cultures. In this paper, we construct an annotated corpus for multilingual sentiment understanding. The annotation is developed in a multilingual setting including English/Japanese/Chinese, and on a representative dataset including 4 topics (spanning 3 genres, which are product, people, and event).To deep understand expression mechanism of feeling entailed in the text, we labelled sentimental signal words and rhetoric phenomenon in addition to overall polarity. This innovative corpus can be a helpful resource for the improvement of sentiment classification, cross-cultural comparison etc.","PeriodicalId":16243,"journal":{"name":"Journal of Information Processing","volume":"24 1","pages":"205-265"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.5715/JNLP.24.205","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5715/JNLP.24.205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 7
Abstract
The surge of social media makes it possible to understand people’s emotion in different cultures. In this paper, we construct an annotated corpus for multilingual sentiment understanding. The annotation is developed in a multilingual setting including English/Japanese/Chinese, and on a representative dataset including 4 topics (spanning 3 genres, which are product, people, and event).To deep understand expression mechanism of feeling entailed in the text, we labelled sentimental signal words and rhetoric phenomenon in addition to overall polarity. This innovative corpus can be a helpful resource for the improvement of sentiment classification, cross-cultural comparison etc.