Developing a Dataset for Mining Reviews in Tweets focusing on Celebrities’ Aspects

Kotoe Sugawara, T. Utsuro
{"title":"Developing a Dataset for Mining Reviews in Tweets focusing on Celebrities’ Aspects","authors":"Kotoe Sugawara, T. Utsuro","doi":"10.1109/wi-iat55865.2022.00075","DOIUrl":null,"url":null,"abstract":"The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"7 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/wi-iat55865.2022.00075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The purpose of this paper is to make it easier for celebrity fans to search for information on critiques and interest trends about celebrities and matters related to them. As part of this process, we collect tweets on Twitter that mention aspects of celebrities and train and evaluate a token classification framework to judge whether or not tweets represent reviews on aspects of celebrities. Specifically, we develop a dataset by annotating the collected tweets whether or not aspect’s names of celebrities and adjectives/adjectival verbs have review relationships or not. We train and evaluate the token classification framework using BERT to determine whether the tweets represent reviews on aspects of celebrities. In the evaluation, we evaluate whether or not the TARGET and REVIEW labels annotated by the token classification framework using BERT are correct or not. The token classification framework using BERT achieved high enough performance, indicating that it is appropriate to apply the token classification framework using BERT to this task.
开发一个数据集,用于挖掘名人方面的推文评论
本文的目的是为了让明星粉丝更容易搜索到关于名人及其相关事项的评论信息和兴趣趋势。作为这个过程的一部分,我们在Twitter上收集提到名人方面的推文,并训练和评估一个令牌分类框架,以判断推文是否代表对名人方面的评论。具体来说,我们通过注释收集到的推文来开发一个数据集,无论方面的名人名字和形容词/形容词动词是否有评论关系。我们使用BERT训练和评估令牌分类框架,以确定推文是否代表对名人方面的评论。在评估中,我们评估使用BERT的令牌分类框架注释的TARGET和REVIEW标签是否正确。使用BERT的令牌分类框架取得了足够高的性能,表明将使用BERT的令牌分类框架应用于该任务是合适的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信