基于 BERT 微调和 SBD-K 形状的丹木视频情感时间序列聚类

The Electronic Library Pub Date : 2024-04-22 DOI:10.1108/el-10-2023-0243

Ruoxi Zhang, Chenhan Ren

{"title":"基于 BERT 微调和 SBD-K 形状的丹木视频情感时间序列聚类","authors":"Ruoxi Zhang, Chenhan Ren","doi":"10.1108/el-10-2023-0243","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThis study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.\n\n\nDesign/methodology/approach\nThis study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.\n\n\nFindings\nThe filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.\n\n\nOriginality/value\nTheir sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.\n","PeriodicalId":360625,"journal":{"name":"The Electronic Library","volume":"13 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape\",\"authors\":\"Ruoxi Zhang, Chenhan Ren\",\"doi\":\"10.1108/el-10-2023-0243\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nPurpose\\nThis study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.\\n\\n\\nDesign/methodology/approach\\nThis study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.\\n\\n\\nFindings\\nThe filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.\\n\\n\\nOriginality/value\\nTheir sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.\\n\",\"PeriodicalId\":360625,\"journal\":{\"name\":\"The Electronic Library\",\"volume\":\"13 2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Electronic Library\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1108/el-10-2023-0243\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Electronic Library","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/el-10-2023-0243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

目的本研究旨在构建一种基于深度学习的当沐评论情感系列生成方法，并探索聚类后的情感系列特征。在第一部分，作者提出了一种基于 BERT 微调的情感分类模型，用于量化丹木评论的情感极性。为了平滑情感序列，他们使用了综合权重等方法。研究结果Bilibili 网站微电影中过滤后的情感序列或曲线可分为四大类。前三类情感曲线的时间间隔明显稳定，而第四类情感曲线总体波动趋势明显。此外，研究还发现，"争议点 "或 "亮点 "容易出现在影片的开头和高潮部分，从而导致情感曲线发生显著变化。基于 BERT 微调的情感分类模型优于传统的情感词典方法，这为使用深度学习和迁移学习进行丹木评论情感分析提供了参考。BERT微调-SBD-K-shape算法可以弱化非规则噪声和时间相移对丹木文本的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape

Purpose This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering. Design/methodology/approach This study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data. Findings The filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others. Originality/value Their sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Electronic Library

自引率

0.00%

发文量