利用残差多嵌入融合网络增强软件工程中的情感检测

IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Rim Mahouachi
{"title":"利用残差多嵌入融合网络增强软件工程中的情感检测","authors":"Rim Mahouachi","doi":"10.1016/j.jss.2025.112651","DOIUrl":null,"url":null,"abstract":"<div><div>Emotions play a crucial role in the development of software, particularly in team dynamics, productivity, and decision making. Developer communications — such as bug reports, code reviews, and online discussions — often include emotional signals. But part of the difficulty in identifying these feelings lies in the technicality and informality of the words, and in the utter scarcity of even critical but rare emotions like fear and surprise. This study aims to improve the detection of both common and minority emotions in software engineering texts, with a focus on better identifying underrepresented classes. We introduce R-MEFN, Residual Multi-Embedding Fusion Network, a network model that employs multiple types of contextual word embeddings to represent the text. Residual connections serve to keep signals of subtle emotionality, especially ones associated with emotions that are infrequent. Cross-validation is performed to choose the best combination of embeddings to be fused. We evaluate R-MEFN on two real-world datasets (StackOverflow and Jira), comparing it to other prior approaches on these benchmarks, as well as to single-embedding and combined-embedding baselines. R-MEFN outperforms other methods that have been evaluated in the same benchmarks for multilabel emotion detection, showing particular improvements on rare classes while keeping a good performance on frequent emotions. Also, it outperforms all single-embedding baselines, as well as all combined-embedding baselines, where embeddings from multiple sources are simply concatenated, showing the strength of the fusion approach. The cross-validated integration of contextual embeddings allows R-MEFN to produce more balanced and expressive representations across all emotion categories. These findings show the effectiveness of using multiple contextual embeddings and residual learning for addressing class imbalance in emotion detection. We see R-MEFN as a useful starting point towards creating emotion aware tools that can allow software teams to track emotional dynamics in a project, and identify hidden risks.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"231 ","pages":"Article 112651"},"PeriodicalIF":4.1000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing emotion detection in software engineering using a residual multi-embedding fusion network\",\"authors\":\"Rim Mahouachi\",\"doi\":\"10.1016/j.jss.2025.112651\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Emotions play a crucial role in the development of software, particularly in team dynamics, productivity, and decision making. Developer communications — such as bug reports, code reviews, and online discussions — often include emotional signals. But part of the difficulty in identifying these feelings lies in the technicality and informality of the words, and in the utter scarcity of even critical but rare emotions like fear and surprise. This study aims to improve the detection of both common and minority emotions in software engineering texts, with a focus on better identifying underrepresented classes. We introduce R-MEFN, Residual Multi-Embedding Fusion Network, a network model that employs multiple types of contextual word embeddings to represent the text. Residual connections serve to keep signals of subtle emotionality, especially ones associated with emotions that are infrequent. Cross-validation is performed to choose the best combination of embeddings to be fused. We evaluate R-MEFN on two real-world datasets (StackOverflow and Jira), comparing it to other prior approaches on these benchmarks, as well as to single-embedding and combined-embedding baselines. R-MEFN outperforms other methods that have been evaluated in the same benchmarks for multilabel emotion detection, showing particular improvements on rare classes while keeping a good performance on frequent emotions. Also, it outperforms all single-embedding baselines, as well as all combined-embedding baselines, where embeddings from multiple sources are simply concatenated, showing the strength of the fusion approach. The cross-validated integration of contextual embeddings allows R-MEFN to produce more balanced and expressive representations across all emotion categories. These findings show the effectiveness of using multiple contextual embeddings and residual learning for addressing class imbalance in emotion detection. We see R-MEFN as a useful starting point towards creating emotion aware tools that can allow software teams to track emotional dynamics in a project, and identify hidden risks.</div></div>\",\"PeriodicalId\":51099,\"journal\":{\"name\":\"Journal of Systems and Software\",\"volume\":\"231 \",\"pages\":\"Article 112651\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Systems and Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0164121225003206\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225003206","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

情绪在软件开发中起着至关重要的作用,特别是在团队动态、生产力和决策制定方面。开发人员的交流——比如bug报告、代码审查和在线讨论——经常包含情感信号。但识别这些情感的部分困难在于这些词语的技术性和非正式性,以及即使是批评性的但罕见的情感,如恐惧和惊讶,也非常稀缺。本研究旨在提高对软件工程文本中常见和少数情感的检测,重点是更好地识别未被充分代表的类别。我们引入R-MEFN,残差多嵌入融合网络,这是一种使用多种类型的上下文词嵌入来表示文本的网络模型。残留的连接用来保存微妙的情绪信号,尤其是那些与不常见的情绪有关的信号。通过交叉验证,选择待融合的最佳嵌入组合。我们在两个真实世界的数据集(StackOverflow和Jira)上评估了R-MEFN,并将其与这些基准测试中的其他先前方法以及单嵌入和组合嵌入基线进行了比较。R-MEFN在多标签情绪检测的相同基准测试中表现优于其他方法,在罕见类别上表现出特别的改进,同时在频繁情绪上保持良好的表现。此外,它优于所有单一嵌入基线,以及所有组合嵌入基线,其中来自多个源的嵌入只是简单地连接起来,显示了融合方法的强度。上下文嵌入的交叉验证整合允许R-MEFN在所有情感类别中产生更平衡和更具表现力的表征。这些研究结果表明,使用多重上下文嵌入和残差学习来解决情感检测中的阶级不平衡问题是有效的。我们认为R-MEFN是创建情感感知工具的一个有用的起点,它可以让软件团队跟踪项目中的情感动态,并识别隐藏的风险。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancing emotion detection in software engineering using a residual multi-embedding fusion network
Emotions play a crucial role in the development of software, particularly in team dynamics, productivity, and decision making. Developer communications — such as bug reports, code reviews, and online discussions — often include emotional signals. But part of the difficulty in identifying these feelings lies in the technicality and informality of the words, and in the utter scarcity of even critical but rare emotions like fear and surprise. This study aims to improve the detection of both common and minority emotions in software engineering texts, with a focus on better identifying underrepresented classes. We introduce R-MEFN, Residual Multi-Embedding Fusion Network, a network model that employs multiple types of contextual word embeddings to represent the text. Residual connections serve to keep signals of subtle emotionality, especially ones associated with emotions that are infrequent. Cross-validation is performed to choose the best combination of embeddings to be fused. We evaluate R-MEFN on two real-world datasets (StackOverflow and Jira), comparing it to other prior approaches on these benchmarks, as well as to single-embedding and combined-embedding baselines. R-MEFN outperforms other methods that have been evaluated in the same benchmarks for multilabel emotion detection, showing particular improvements on rare classes while keeping a good performance on frequent emotions. Also, it outperforms all single-embedding baselines, as well as all combined-embedding baselines, where embeddings from multiple sources are simply concatenated, showing the strength of the fusion approach. The cross-validated integration of contextual embeddings allows R-MEFN to produce more balanced and expressive representations across all emotion categories. These findings show the effectiveness of using multiple contextual embeddings and residual learning for addressing class imbalance in emotion detection. We see R-MEFN as a useful starting point towards creating emotion aware tools that can allow software teams to track emotional dynamics in a project, and identify hidden risks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Systems and Software
Journal of Systems and Software 工程技术-计算机:理论方法
CiteScore
8.60
自引率
5.70%
发文量
193
审稿时长
16 weeks
期刊介绍: The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: •Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution •Agile, model-driven, service-oriented, open source and global software development •Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems •Human factors and management concerns of software development •Data management and big data issues of software systems •Metrics and evaluation, data mining of software development resources •Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信