Michele Jackeline Andressa Rosa;Marcos Roberto Souza;Carlos Leandro Silva Machado;Sandro José Rigo;Jorge Luis Victória Barbosa
{"title":"一组股票异质数据整合的巴西股市预测","authors":"Michele Jackeline Andressa Rosa;Marcos Roberto Souza;Carlos Leandro Silva Machado;Sandro José Rigo;Jorge Luis Victória Barbosa","doi":"10.1109/TLA.2025.11045642","DOIUrl":null,"url":null,"abstract":"The significant growth of the Brazilian stock market, coupled with the increase in investors in riskier assets, has generated a demand for automated forecasting tools. This research investigated the behavior and movement of stocks in the Brazilian market by integrating historical price series and textual data extracted from sentiments in X old Twitter messages and news collected from Google News. The analysis used natural language processing techniques for sentiment analysis, enabling an efficient fusion between numerical and textual information. Experiments were carried out with the assets PETR4, VALE3, BBDC4, and ITUB4, applying the Long Short Term Memory, Deep Neural Network, and Linear Regression models to predict the behavior of these assets. The results indicated that the LSTM models, especially Model 2, presented the best performance in terms of predictive capacity, with the lowest values of RMSE 0.0171 and high values of coefficint of determination ranging from 0.9707 to 0.9873. The study concludes that integrating numerical and textual data, combined with deep learning techniques, offers a promising approach to stock market forecasting, increasing forecasting gains.","PeriodicalId":55024,"journal":{"name":"IEEE Latin America Transactions","volume":"23 7","pages":"572-583"},"PeriodicalIF":1.3000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11045642","citationCount":"0","resultStr":"{\"title\":\"Brazilian Stock Market Forecast with Heterogeneous Data Integration for a Set of Stocks\",\"authors\":\"Michele Jackeline Andressa Rosa;Marcos Roberto Souza;Carlos Leandro Silva Machado;Sandro José Rigo;Jorge Luis Victória Barbosa\",\"doi\":\"10.1109/TLA.2025.11045642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The significant growth of the Brazilian stock market, coupled with the increase in investors in riskier assets, has generated a demand for automated forecasting tools. This research investigated the behavior and movement of stocks in the Brazilian market by integrating historical price series and textual data extracted from sentiments in X old Twitter messages and news collected from Google News. The analysis used natural language processing techniques for sentiment analysis, enabling an efficient fusion between numerical and textual information. Experiments were carried out with the assets PETR4, VALE3, BBDC4, and ITUB4, applying the Long Short Term Memory, Deep Neural Network, and Linear Regression models to predict the behavior of these assets. The results indicated that the LSTM models, especially Model 2, presented the best performance in terms of predictive capacity, with the lowest values of RMSE 0.0171 and high values of coefficint of determination ranging from 0.9707 to 0.9873. The study concludes that integrating numerical and textual data, combined with deep learning techniques, offers a promising approach to stock market forecasting, increasing forecasting gains.\",\"PeriodicalId\":55024,\"journal\":{\"name\":\"IEEE Latin America Transactions\",\"volume\":\"23 7\",\"pages\":\"572-583\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11045642\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Latin America Transactions\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11045642/\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Latin America Transactions","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11045642/","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Brazilian Stock Market Forecast with Heterogeneous Data Integration for a Set of Stocks
The significant growth of the Brazilian stock market, coupled with the increase in investors in riskier assets, has generated a demand for automated forecasting tools. This research investigated the behavior and movement of stocks in the Brazilian market by integrating historical price series and textual data extracted from sentiments in X old Twitter messages and news collected from Google News. The analysis used natural language processing techniques for sentiment analysis, enabling an efficient fusion between numerical and textual information. Experiments were carried out with the assets PETR4, VALE3, BBDC4, and ITUB4, applying the Long Short Term Memory, Deep Neural Network, and Linear Regression models to predict the behavior of these assets. The results indicated that the LSTM models, especially Model 2, presented the best performance in terms of predictive capacity, with the lowest values of RMSE 0.0171 and high values of coefficint of determination ranging from 0.9707 to 0.9873. The study concludes that integrating numerical and textual data, combined with deep learning techniques, offers a promising approach to stock market forecasting, increasing forecasting gains.
期刊介绍:
IEEE Latin America Transactions (IEEE LATAM) is an interdisciplinary journal focused on the dissemination of original and quality research papers / review articles in Spanish and Portuguese of emerging topics in three main areas: Computing, Electric Energy and Electronics. Some of the sub-areas of the journal are, but not limited to: Automatic control, communications, instrumentation, artificial intelligence, power and industrial electronics, fault diagnosis and detection, transportation electrification, internet of things, electrical machines, circuits and systems, biomedicine and biomedical / haptic applications, secure communications, robotics, sensors and actuators, computer networks, smart grids, among others.