{"title":"开发一个数据集,用于挖掘名人方面的推文评论","authors":"Kotoe Sugawara, T. Utsuro","doi":"10.1109/wi-iat55865.2022.00075","DOIUrl":null,"url":null,"abstract":"The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"7 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing a Dataset for Mining Reviews in Tweets focusing on Celebrities’ Aspects\",\"authors\":\"Kotoe Sugawara, T. Utsuro\",\"doi\":\"10.1109/wi-iat55865.2022.00075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.\",\"PeriodicalId\":345445,\"journal\":{\"name\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"volume\":\"7 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/wi-iat55865.2022.00075\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/wi-iat55865.2022.00075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Developing a Dataset for Mining Reviews in Tweets focusing on Celebrities’ Aspects
The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.