Multimodal stock market emotion recognition model trained with a large language model

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-05-04 DOI:10.1016/j.engappai.2025.111035

Chao Liu , Yuxia Miao , Qi Zhao , Chao Wang , Xiangyu Zhu

{"title":"Multimodal stock market emotion recognition model trained with a large language model","authors":"Chao Liu , Yuxia Miao , Qi Zhao , Chao Wang , Xiangyu Zhu","doi":"10.1016/j.engappai.2025.111035","DOIUrl":null,"url":null,"abstract":"<div><div>Stock market emotion recognition models are often trained on public textual datasets. These datasets are well-labelled but may not have the same distribution as real-world data and consequently fail to reflect the real world. Furthermore, stock market emotion is reflected not only in textual comments but also in images; emotion recognition on the basis of textual comments alone is one-sided. Considering these issues, in this paper, we propose a multimodal emotion recognition framework that is trained via imitation learning from a large language model. The proposed framework contains two main innovations. First, by leveraging large language model's compositionality capability, the proposed framework generates pseudo labels from textual comments. Through imitating the patterns of the pseudo labels, the framework can be trained directly on unlabelled real-world data, which addresses current models' distribution drift between public datasets and the real-world. Second, multimodal fusion is equipped with the proposed framework, enabling emotion recognition from textual stock market comments and images simultaneously. Compared with existing methods, the proposed method in this paper significantly improves the performance of stock market emotion recognition by leveraging large language models and multimodal stock market data. The experimental results demonstrate that the emotion recognition framework proposed in this paper outperforms the existing single-mode models and multimodal models, with the accuracy, precision, recall, and F1 score being 82.97 %, 83.03 %, 83.05 %, and 82.88 % respectively. This framework provides a new pathway for multimodal and unsupervised/semi-supervised emotion recognition in the stock market.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"154 ","pages":"Article 111035"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010358","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Stock market emotion recognition models are often trained on public textual datasets. These datasets are well-labelled but may not have the same distribution as real-world data and consequently fail to reflect the real world. Furthermore, stock market emotion is reflected not only in textual comments but also in images; emotion recognition on the basis of textual comments alone is one-sided. Considering these issues, in this paper, we propose a multimodal emotion recognition framework that is trained via imitation learning from a large language model. The proposed framework contains two main innovations. First, by leveraging large language model's compositionality capability, the proposed framework generates pseudo labels from textual comments. Through imitating the patterns of the pseudo labels, the framework can be trained directly on unlabelled real-world data, which addresses current models' distribution drift between public datasets and the real-world. Second, multimodal fusion is equipped with the proposed framework, enabling emotion recognition from textual stock market comments and images simultaneously. Compared with existing methods, the proposed method in this paper significantly improves the performance of stock market emotion recognition by leveraging large language models and multimodal stock market data. The experimental results demonstrate that the emotion recognition framework proposed in this paper outperforms the existing single-mode models and multimodal models, with the accuracy, precision, recall, and F1 score being 82.97 %, 83.03 %, 83.05 %, and 82.88 % respectively. This framework provides a new pathway for multimodal and unsupervised/semi-supervised emotion recognition in the stock market.

查看原文本刊更多论文

用大型语言模型训练的多模态股市情绪识别模型

股票市场情绪识别模型通常是在公共文本数据集上训练的。这些数据集标记良好，但可能与真实世界的数据分布不同，因此无法反映真实世界。此外，股市情绪不仅体现在文字评论中，还体现在图像中；单纯基于文本评注的情感识别是片面的。考虑到这些问题，在本文中，我们提出了一个多模态情感识别框架，该框架通过从大型语言模型中模仿学习来训练。拟议的框架包含两个主要创新。首先，通过利用大型语言模型的组合性能力，提出的框架从文本注释生成伪标签。通过模仿伪标签的模式，该框架可以直接在未标记的真实世界数据上进行训练，解决了当前模型在公共数据集和真实世界之间的分布漂移问题。其次，采用多模态融合技术，实现了对股市文本评论和图像的情感识别。与现有方法相比，本文提出的方法利用大语言模型和多模态股市数据，显著提高了股市情绪识别的性能。实验结果表明，本文提出的情感识别框架优于现有的单模态模型和多模态模型，准确率、精密度、召回率和F1分数分别为82.97%、83.03%、83.05%和82.88%。该框架为股票市场中的多模态和无监督/半监督情感识别提供了新的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.