利用深度神经网络检测短文本中的负面情绪。

IF 4.2 2区 心理学 Q1 PSYCHOLOGY, SOCIAL
Luis A Camacho-Vázquez,Vanessa A Camacho-Vázquez,Sandra D Orantes-Jiménez,Grigori Sidorov
{"title":"利用深度神经网络检测短文本中的负面情绪。","authors":"Luis A Camacho-Vázquez,Vanessa A Camacho-Vázquez,Sandra D Orantes-Jiménez,Grigori Sidorov","doi":"10.1089/cyber.2024.0457","DOIUrl":null,"url":null,"abstract":"Emotion detection is crucial in various domains, including psychology, health, social sciences, and marketing. Specifically, in psychology, identifying negative emotions in short Spanish texts, such as tweets, is vital for understanding individuals' emotional states. However, this process is challenging because of factors such as lack of context, cultural nuances, and ambiguous expressions. Although much research on emotion classification in tweets has focused on applications such as crisis analysis, mental health monitoring, and affective computing, most of it has been conducted in English, leaving a significant gap in addressing the emotional needs of Spanish-speaking communities. To address this gap, we used a corpus of 12,000 Spanish tweets tagged with Ekman's negative emotions (sadness, anger, fear, and disgust). Traditional features (n-grams of different types and sizes), syntactic n-grams, and combined features were evaluated. Different deep neural networks, including convolutional neural networks, Bidirectional Encoder Representations of Transformers (BERT), and the robust optimized BERT approach called RoBERTa, were implemented and compared with traditional machine learning methods to identify the most effective method. Extensive testing revealed that BERT achieved the best result, with a macro F1 score of 0.9973. Furthermore, we reported the carbon emissions generated during the training of each implemented method. This study makes a unique contribution by focusing on negative emotions in Spanish, leveraging one of the largest and highest-quality corpora available. It stands out for implementing advanced transformers such as RoBERTa and integrating combined and syntactic n-grams in traditional methods. Furthermore, it highlights how parameters, features, and preprocessing significantly influence performance.","PeriodicalId":10872,"journal":{"name":"Cyberpsychology, behavior and social networking","volume":"47 1","pages":""},"PeriodicalIF":4.2000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detection of Negative Emotions in Short Texts Using Deep Neural Networks.\",\"authors\":\"Luis A Camacho-Vázquez,Vanessa A Camacho-Vázquez,Sandra D Orantes-Jiménez,Grigori Sidorov\",\"doi\":\"10.1089/cyber.2024.0457\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Emotion detection is crucial in various domains, including psychology, health, social sciences, and marketing. Specifically, in psychology, identifying negative emotions in short Spanish texts, such as tweets, is vital for understanding individuals' emotional states. However, this process is challenging because of factors such as lack of context, cultural nuances, and ambiguous expressions. Although much research on emotion classification in tweets has focused on applications such as crisis analysis, mental health monitoring, and affective computing, most of it has been conducted in English, leaving a significant gap in addressing the emotional needs of Spanish-speaking communities. To address this gap, we used a corpus of 12,000 Spanish tweets tagged with Ekman's negative emotions (sadness, anger, fear, and disgust). Traditional features (n-grams of different types and sizes), syntactic n-grams, and combined features were evaluated. Different deep neural networks, including convolutional neural networks, Bidirectional Encoder Representations of Transformers (BERT), and the robust optimized BERT approach called RoBERTa, were implemented and compared with traditional machine learning methods to identify the most effective method. Extensive testing revealed that BERT achieved the best result, with a macro F1 score of 0.9973. Furthermore, we reported the carbon emissions generated during the training of each implemented method. This study makes a unique contribution by focusing on negative emotions in Spanish, leveraging one of the largest and highest-quality corpora available. It stands out for implementing advanced transformers such as RoBERTa and integrating combined and syntactic n-grams in traditional methods. Furthermore, it highlights how parameters, features, and preprocessing significantly influence performance.\",\"PeriodicalId\":10872,\"journal\":{\"name\":\"Cyberpsychology, behavior and social networking\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cyberpsychology, behavior and social networking\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1089/cyber.2024.0457\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, SOCIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cyberpsychology, behavior and social networking","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1089/cyber.2024.0457","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, SOCIAL","Score":null,"Total":0}
引用次数: 0

摘要

情绪检测在许多领域都是至关重要的,包括心理学、健康、社会科学和市场营销。具体来说,在心理学中,识别西班牙语短文本中的负面情绪,比如推特,对于理解个人的情绪状态至关重要。然而,由于缺乏语境、文化差异和歧义表达等因素,这一过程具有挑战性。尽管很多关于推特情绪分类的研究都集中在危机分析、心理健康监测和情感计算等应用上,但大多数研究都是用英语进行的,这在解决西班牙语社区的情感需求方面留下了很大的空白。为了解决这一差距,我们使用了一个由12000条西班牙语推文组成的语料库,这些推文标记了埃克曼的负面情绪(悲伤、愤怒、恐惧和厌恶)。对传统特征(不同类型和大小的n-gram)、句法n-gram和组合特征进行了评估。实现了不同的深度神经网络,包括卷积神经网络、双向编码器表示的变压器(BERT)和鲁棒优化的BERT方法RoBERTa,并与传统的机器学习方法进行了比较,以确定最有效的方法。大量的测试表明BERT取得了最好的结果,其宏观F1得分为0.9973。此外,我们还报告了每种实施方法在培训过程中产生的碳排放量。本研究利用最大和最高质量的语料库之一,专注于西班牙语中的负面情绪,做出了独特的贡献。它在实现高级转换器(如RoBERTa)以及在传统方法中集成组合和语法n-gram方面表现突出。此外,它还强调了参数、特征和预处理如何显著影响性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Detection of Negative Emotions in Short Texts Using Deep Neural Networks.
Emotion detection is crucial in various domains, including psychology, health, social sciences, and marketing. Specifically, in psychology, identifying negative emotions in short Spanish texts, such as tweets, is vital for understanding individuals' emotional states. However, this process is challenging because of factors such as lack of context, cultural nuances, and ambiguous expressions. Although much research on emotion classification in tweets has focused on applications such as crisis analysis, mental health monitoring, and affective computing, most of it has been conducted in English, leaving a significant gap in addressing the emotional needs of Spanish-speaking communities. To address this gap, we used a corpus of 12,000 Spanish tweets tagged with Ekman's negative emotions (sadness, anger, fear, and disgust). Traditional features (n-grams of different types and sizes), syntactic n-grams, and combined features were evaluated. Different deep neural networks, including convolutional neural networks, Bidirectional Encoder Representations of Transformers (BERT), and the robust optimized BERT approach called RoBERTa, were implemented and compared with traditional machine learning methods to identify the most effective method. Extensive testing revealed that BERT achieved the best result, with a macro F1 score of 0.9973. Furthermore, we reported the carbon emissions generated during the training of each implemented method. This study makes a unique contribution by focusing on negative emotions in Spanish, leveraging one of the largest and highest-quality corpora available. It stands out for implementing advanced transformers such as RoBERTa and integrating combined and syntactic n-grams in traditional methods. Furthermore, it highlights how parameters, features, and preprocessing significantly influence performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.60
自引率
3.00%
发文量
123
期刊介绍: Cyberpsychology, Behavior, and Social Networking is a leading peer-reviewed journal that is recognized for its authoritative research on the social, behavioral, and psychological impacts of contemporary social networking practices. The journal covers a wide range of platforms, including Twitter, Facebook, internet gaming, and e-commerce, and examines how these digital environments shape human interaction and societal norms. For over two decades, this journal has been a pioneering voice in the exploration of social networking and virtual reality, establishing itself as an indispensable resource for professionals and academics in the field. It is particularly celebrated for its swift dissemination of findings through rapid communication articles, alongside comprehensive, in-depth studies that delve into the multifaceted effects of interactive technologies on both individual behavior and broader societal trends. The journal's scope encompasses the full spectrum of impacts—highlighting not only the potential benefits but also the challenges that arise as a result of these technologies. By providing a platform for rigorous research and critical discussions, it fosters a deeper understanding of the complex interplay between technology and human behavior.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信