{"title":"A multifactor model using large language models and multimodal investor sentiment","authors":"Junhuan Zhang , Ziyan Zhang , Jiaqi Wen","doi":"10.1016/j.iref.2025.104281","DOIUrl":null,"url":null,"abstract":"<div><div>This study constructs multimodal investor sentiment indices using news and image data from the China News Service, covering the period from January 1, 2017, to December 31, 2024. We employ the RoBERTa model for text-based sentiment measurement and the Google Inception(v3) model for image-based sentiment measurement. We use a multimodal semantic correlation fusion model to integrate textual and visual sentiment features. These sentiment indices are categorised as industry-specific and market-wide investor sentiment, enabling separate analyses of their effects on stock markets. Furthermore, we develop a multifactor stock selection model that incorporates these sentiment indices with other microeconomic factors. Our findings demonstrate that multimodal sentiment analysis yields superior predictive accuracy. Industry-specific investor sentiment influences stock market returns, which in turn exacerbates changes in market-wide investor sentiment. Incorporating industry-specific sentiment into the multifactor stock selection model enhances portfolio returns, and combining market-wide sentiment with timing strategies further improves performance.</div></div>","PeriodicalId":14444,"journal":{"name":"International Review of Economics & Finance","volume":"102 ","pages":"Article 104281"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Review of Economics & Finance","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1059056025004447","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0
Abstract
This study constructs multimodal investor sentiment indices using news and image data from the China News Service, covering the period from January 1, 2017, to December 31, 2024. We employ the RoBERTa model for text-based sentiment measurement and the Google Inception(v3) model for image-based sentiment measurement. We use a multimodal semantic correlation fusion model to integrate textual and visual sentiment features. These sentiment indices are categorised as industry-specific and market-wide investor sentiment, enabling separate analyses of their effects on stock markets. Furthermore, we develop a multifactor stock selection model that incorporates these sentiment indices with other microeconomic factors. Our findings demonstrate that multimodal sentiment analysis yields superior predictive accuracy. Industry-specific investor sentiment influences stock market returns, which in turn exacerbates changes in market-wide investor sentiment. Incorporating industry-specific sentiment into the multifactor stock selection model enhances portfolio returns, and combining market-wide sentiment with timing strategies further improves performance.
期刊介绍:
The International Review of Economics & Finance (IREF) is a scholarly journal devoted to the publication of high quality theoretical and empirical articles in all areas of international economics, macroeconomics and financial economics. Contributions that facilitate the communications between the real and the financial sectors of the economy are of particular interest.