Shengkun Wang, Taoran Ji, Linhan Wang, Yanshen Sun, Shang-Ching Liu, Amit Kumar, Chang-Tien Lu
{"title":"StockTime: A Time Series Specialized Large Language Model Architecture for Stock Price Prediction","authors":"Shengkun Wang, Taoran Ji, Linhan Wang, Yanshen Sun, Shang-Ching Liu, Amit Kumar, Chang-Tien Lu","doi":"arxiv-2409.08281","DOIUrl":null,"url":null,"abstract":"The stock price prediction task holds a significant role in the financial\ndomain and has been studied for a long time. Recently, large language models\n(LLMs) have brought new ways to improve these predictions. While recent\nfinancial large language models (FinLLMs) have shown considerable progress in\nfinancial NLP tasks compared to smaller pre-trained language models (PLMs),\nchallenges persist in stock price forecasting. Firstly, effectively integrating\nthe modalities of time series data and natural language to fully leverage these\ncapabilities remains complex. Secondly, FinLLMs focus more on analysis and\ninterpretability, which can overlook the essential features of time series\ndata. Moreover, due to the abundance of false and redundant information in\nfinancial markets, models often produce less accurate predictions when faced\nwith such input data. In this paper, we introduce StockTime, a novel LLM-based\narchitecture designed specifically for stock price data. Unlike recent FinLLMs,\nStockTime is specifically designed for stock price time series data. It\nleverages the natural ability of LLMs to predict the next token by treating\nstock prices as consecutive tokens, extracting textual information such as\nstock correlations, statistical trends and timestamps directly from these stock\nprices. StockTime then integrates both textual and time series data into the\nembedding space. By fusing this multimodal data, StockTime effectively predicts\nstock prices across arbitrary look-back periods. Our experiments demonstrate\nthat StockTime outperforms recent LLMs, as it gives more accurate predictions\nwhile reducing memory usage and runtime costs.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Statistical Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08281","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The stock price prediction task holds a significant role in the financial
domain and has been studied for a long time. Recently, large language models
(LLMs) have brought new ways to improve these predictions. While recent
financial large language models (FinLLMs) have shown considerable progress in
financial NLP tasks compared to smaller pre-trained language models (PLMs),
challenges persist in stock price forecasting. Firstly, effectively integrating
the modalities of time series data and natural language to fully leverage these
capabilities remains complex. Secondly, FinLLMs focus more on analysis and
interpretability, which can overlook the essential features of time series
data. Moreover, due to the abundance of false and redundant information in
financial markets, models often produce less accurate predictions when faced
with such input data. In this paper, we introduce StockTime, a novel LLM-based
architecture designed specifically for stock price data. Unlike recent FinLLMs,
StockTime is specifically designed for stock price time series data. It
leverages the natural ability of LLMs to predict the next token by treating
stock prices as consecutive tokens, extracting textual information such as
stock correlations, statistical trends and timestamps directly from these stock
prices. StockTime then integrates both textual and time series data into the
embedding space. By fusing this multimodal data, StockTime effectively predicts
stock prices across arbitrary look-back periods. Our experiments demonstrate
that StockTime outperforms recent LLMs, as it gives more accurate predictions
while reducing memory usage and runtime costs.