{"title":"基于方面的越南语情感分析语料库","authors":"Minh-Hao Nguyen, T. Nguyen, D. Thin, N. Nguyen","doi":"10.1109/KSE.2019.8919448","DOIUrl":null,"url":null,"abstract":"Recently, researchers have shown an increased interest in the aspect-based sentiment analysis problem. The goal is to extract valuable information concerning the aspects mentioned in users comments. This problem can be divided into three sub-tasks: term extraction, aspect detection, and polarity detection. In this paper, we present a new annotated corpus for studies on the two sub-tasks: aspect detection and polarity detection. Our corpus includes 7,828 restaurant reviews at document-level. We also performed a supervised learning method with rich features, achieving the F1-score of 87.13% for the aspect detection and the F1-score of 59.20% for polarity detection. Our corpus is published for research purpose1.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"A corpus for aspect-based sentiment analysis in Vietnamese\",\"authors\":\"Minh-Hao Nguyen, T. Nguyen, D. Thin, N. Nguyen\",\"doi\":\"10.1109/KSE.2019.8919448\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, researchers have shown an increased interest in the aspect-based sentiment analysis problem. The goal is to extract valuable information concerning the aspects mentioned in users comments. This problem can be divided into three sub-tasks: term extraction, aspect detection, and polarity detection. In this paper, we present a new annotated corpus for studies on the two sub-tasks: aspect detection and polarity detection. Our corpus includes 7,828 restaurant reviews at document-level. We also performed a supervised learning method with rich features, achieving the F1-score of 87.13% for the aspect detection and the F1-score of 59.20% for polarity detection. Our corpus is published for research purpose1.\",\"PeriodicalId\":439841,\"journal\":{\"name\":\"2019 11th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 11th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE.2019.8919448\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2019.8919448","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A corpus for aspect-based sentiment analysis in Vietnamese
Recently, researchers have shown an increased interest in the aspect-based sentiment analysis problem. The goal is to extract valuable information concerning the aspects mentioned in users comments. This problem can be divided into three sub-tasks: term extraction, aspect detection, and polarity detection. In this paper, we present a new annotated corpus for studies on the two sub-tasks: aspect detection and polarity detection. Our corpus includes 7,828 restaurant reviews at document-level. We also performed a supervised learning method with rich features, achieving the F1-score of 87.13% for the aspect detection and the F1-score of 59.20% for polarity detection. Our corpus is published for research purpose1.