基于 BERT 微调和 SBD-K 形状的丹木视频情感时间序列聚类

Ruoxi Zhang, Chenhan Ren
{"title":"基于 BERT 微调和 SBD-K 形状的丹木视频情感时间序列聚类","authors":"Ruoxi Zhang, Chenhan Ren","doi":"10.1108/el-10-2023-0243","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThis study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.\n\n\nDesign/methodology/approach\nThis study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.\n\n\nFindings\nThe filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.\n\n\nOriginality/value\nTheir sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.\n","PeriodicalId":360625,"journal":{"name":"The Electronic Library","volume":"13 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape\",\"authors\":\"Ruoxi Zhang, Chenhan Ren\",\"doi\":\"10.1108/el-10-2023-0243\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nPurpose\\nThis study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.\\n\\n\\nDesign/methodology/approach\\nThis study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.\\n\\n\\nFindings\\nThe filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.\\n\\n\\nOriginality/value\\nTheir sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.\\n\",\"PeriodicalId\":360625,\"journal\":{\"name\":\"The Electronic Library\",\"volume\":\"13 2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Electronic Library\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1108/el-10-2023-0243\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Electronic Library","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/el-10-2023-0243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的 本研究旨在构建一种基于深度学习的当沐评论情感系列生成方法,并探索聚类后的情感系列特征。在第一部分,作者提出了一种基于 BERT 微调的情感分类模型,用于量化丹木评论的情感极性。为了平滑情感序列,他们使用了综合权重等方法。研究结果Bilibili 网站微电影中过滤后的情感序列或曲线可分为四大类。前三类情感曲线的时间间隔明显稳定,而第四类情感曲线总体波动趋势明显。此外,研究还发现,"争议点 "或 "亮点 "容易出现在影片的开头和高潮部分,从而导致情感曲线发生显著变化。基于 BERT 微调的情感分类模型优于传统的情感词典方法,这为使用深度学习和迁移学习进行丹木评论情感分析提供了参考。BERT微调-SBD-K-shape算法可以弱化非规则噪声和时间相移对丹木文本的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape
Purpose This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering. Design/methodology/approach This study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data. Findings The filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others. Originality/value Their sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信