Kernel multi-granularity double-quantitative rough set based on ensemble empirical mode decomposition: Application to stock price trends prediction

IF 3.2 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Approximate Reasoning Pub Date : 2024-05-24 DOI:10.1016/j.ijar.2024.109217

Lin Zhang , Juncheng Bai , Bingzhen Sun , Yuqi Guo , Xiangtang Chen

{"title":"Kernel multi-granularity double-quantitative rough set based on ensemble empirical mode decomposition: Application to stock price trends prediction","authors":"Lin Zhang , Juncheng Bai , Bingzhen Sun , Yuqi Guo , Xiangtang Chen","doi":"10.1016/j.ijar.2024.109217","DOIUrl":null,"url":null,"abstract":"<div><p>As financial markets grow increasingly complex and dynamic, accurately predicting stock price trends becomes crucial for investors and financial analysts. Effectively identifying and selecting the most predictive attributes has become a challenge in stock trends prediction. To address this problem, this study proposes a new attribute reduction model. A rough set theory model is built by simplifying the prediction process and combining it with the long short-term memory network (LSTM) to enhance the accuracy of stock trends prediction. Firstly, the Ensemble Empirical Mode Decomposition (EEMD) is utilized to decompose the stock price data into a multi-granularity information system. Secondly, due to the numerical characteristics of stock data, a kernel function is applied to construct binary relationships. Thirdly, recognizing the noise inherent in stock data, the double-quantitative rough set theory is utilized to improve fault tolerance during the construction of decision attributes' lower and upper approximations. Moreover, calculate the correlation between conditional and decision attributes, and retain highly correlated conditional attributes for prediction. The kernel multi-granularity double-quantitative rough set based on the EEMD (EEMD-KMGDQRS) model proposed identifies the key factors behind stock data. Finally, the efficacy of the proposed model is validated by selecting 356 stocks from diverse industries in the Shanghai and Shenzhen stock markets as experimental samples. The results show that the proposed model improves the generalization of attribute reduction results through a fault tolerance mechanism by combining kernel function with multi-granularity double-quantitative rough set, thereby enhancing the accuracy of stock trends prediction in subsequent LSTM prediction processes.</p></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"172 ","pages":"Article 109217"},"PeriodicalIF":3.2000,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Approximate Reasoning","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888613X2400104X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

As financial markets grow increasingly complex and dynamic, accurately predicting stock price trends becomes crucial for investors and financial analysts. Effectively identifying and selecting the most predictive attributes has become a challenge in stock trends prediction. To address this problem, this study proposes a new attribute reduction model. A rough set theory model is built by simplifying the prediction process and combining it with the long short-term memory network (LSTM) to enhance the accuracy of stock trends prediction. Firstly, the Ensemble Empirical Mode Decomposition (EEMD) is utilized to decompose the stock price data into a multi-granularity information system. Secondly, due to the numerical characteristics of stock data, a kernel function is applied to construct binary relationships. Thirdly, recognizing the noise inherent in stock data, the double-quantitative rough set theory is utilized to improve fault tolerance during the construction of decision attributes' lower and upper approximations. Moreover, calculate the correlation between conditional and decision attributes, and retain highly correlated conditional attributes for prediction. The kernel multi-granularity double-quantitative rough set based on the EEMD (EEMD-KMGDQRS) model proposed identifies the key factors behind stock data. Finally, the efficacy of the proposed model is validated by selecting 356 stocks from diverse industries in the Shanghai and Shenzhen stock markets as experimental samples. The results show that the proposed model improves the generalization of attribute reduction results through a fault tolerance mechanism by combining kernel function with multi-granularity double-quantitative rough set, thereby enhancing the accuracy of stock trends prediction in subsequent LSTM prediction processes.

查看原文本刊更多论文

基于集合经验模式分解的核多粒度双定量粗糙集：应用于股票价格趋势预测

随着金融市场日益复杂多变，准确预测股价走势对投资者和金融分析师来说变得至关重要。有效识别和选择最具预测性的属性已成为股票趋势预测中的一项挑战。为解决这一问题，本研究提出了一种新的属性还原模型。通过简化预测过程并将其与长短期记忆网络（LSTM）相结合，建立了一个粗糙集理论模型，以提高股票走势预测的准确性。首先，利用集合经验模式分解法（EEMD）将股价数据分解为多粒度信息系统。其次，根据股票数据的数值特征，运用核函数构建二元关系。第三，考虑到股票数据固有的噪声，利用双量化粗糙集理论提高决策属性下近似和上近似构建过程中的容错性。此外，计算条件属性和决策属性之间的相关性，保留高相关性的条件属性进行预测。基于 EEMD 的核多粒度双定量粗糙集（EEMD-KMGDQRS）模型可识别股票数据背后的关键因素。最后，通过选取沪深股市不同行业的 356 只股票作为实验样本，验证了所提模型的有效性。结果表明，所提模型通过将核函数与多粒度双定量粗糙集相结合的容错机制，提高了属性还原结果的泛化程度，从而提高了后续 LSTM 预测过程中股票走势预测的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Approximate Reasoning 工程技术-计算机：人工智能

CiteScore

6.90

自引率

12.80%

发文量

170

审稿时长

67 days

期刊介绍： The International Journal of Approximate Reasoning is intended to serve as a forum for the treatment of imprecision and uncertainty in Artificial and Computational Intelligence, covering both the foundations of uncertainty theories, and the design of intelligent systems for scientific and engineering applications. It publishes high-quality research papers describing theoretical developments or innovative applications, as well as review articles on topics of general interest. Relevant topics include, but are not limited to, probabilistic reasoning and Bayesian networks, imprecise probabilities, random sets, belief functions (Dempster-Shafer theory), possibility theory, fuzzy sets, rough sets, decision theory, non-additive measures and integrals, qualitative reasoning about uncertainty, comparative probability orderings, game-theoretic probability, default reasoning, nonstandard logics, argumentation systems, inconsistency tolerant reasoning, elicitation techniques, philosophical foundations and psychological models of uncertain reasoning. Domains of application for uncertain reasoning systems include risk analysis and assessment, information retrieval and database design, information fusion, machine learning, data and web mining, computer vision, image and signal processing, intelligent data analysis, statistics, multi-agent systems, etc.