开发一个数据集，用于挖掘名人方面的推文评论

2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) Pub Date : 2022-11-01 DOI:10.1109/wi-iat55865.2022.00075

Kotoe Sugawara, T. Utsuro

{"title":"开发一个数据集，用于挖掘名人方面的推文评论","authors":"Kotoe Sugawara, T. Utsuro","doi":"10.1109/wi-iat55865.2022.00075","DOIUrl":null,"url":null,"abstract":"The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"7 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing a Dataset for Mining Reviews in Tweets focusing on Celebrities’ Aspects\",\"authors\":\"Kotoe Sugawara, T. Utsuro\",\"doi\":\"10.1109/wi-iat55865.2022.00075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.\",\"PeriodicalId\":345445,\"journal\":{\"name\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"volume\":\"7 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/wi-iat55865.2022.00075\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/wi-iat55865.2022.00075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文的目的是为了让明星粉丝更容易搜索到关于名人及其相关事项的评论信息和兴趣趋势。作为这个过程的一部分，我们在Twitter上收集提到名人方面的推文，并训练和评估一个令牌分类框架，以判断推文是否代表对名人方面的评论。具体来说，我们通过注释收集到的推文来开发一个数据集，无论方面的名人名字和形容词/形容词动词是否有评论关系。我们使用BERT训练和评估令牌分类框架，以确定推文是否代表对名人方面的评论。在评估中，我们评估使用BERT的令牌分类框架注释的TARGET和REVIEW标签是否正确。使用BERT的令牌分类框架取得了足够高的性能，表明将使用BERT的令牌分类框架应用于该任务是合适的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Developing a Dataset for Mining Reviews in Tweets focusing on Celebrities’ Aspects

The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)

自引率

0.00%

发文量