{"title":"MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing","authors":"Boris Ter-Avanesov, Homayoon Beigi","doi":"arxiv-2409.06724","DOIUrl":null,"url":null,"abstract":"We explore the performance of various artificial neural network\narchitectures, including a multilayer perceptron (MLP), Kolmogorov-Arnold\nnetwork (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and a\ntime-delay neural network (TDNN) for pricing European call options. In this\nstudy, we attempt to leverage the ability of supervised learning methods, such\nas ANNs, KANs, and gradient-boosted decision trees, to approximate complex\nmultivariate functions in order to calibrate option prices based on past market\ndata. The motivation for using ANNs and KANs is the Universal Approximation\nTheorem and Kolmogorov-Arnold Representation Theorem, respectively.\nSpecifically, we use S\\&P 500 (SPX) and NASDAQ 100 (NDX) index options traded\nduring 2015-2023 with times to maturity ranging from 15 days to over 4 years\n(OptionMetrics IvyDB US dataset). Black \\& Scholes's (BS) PDE \\cite{Black1973}\nmodel's performance in pricing the same options compared to real data is used\nas a benchmark. This model relies on strong assumptions, and it has been\nobserved and discussed in the literature that real data does not match its\npredictions. Supervised learning methods are widely used as an alternative for\ncalibrating option prices due to some of the limitations of this model. In our\nexperiments, the BS model underperforms compared to all of the others. Also,\nthe best TDNN model outperforms the best MLP model on all error metrics. We\nimplement a simple self-attention mechanism to enhance the RNN models,\nsignificantly improving their performance. The best-performing model overall is\nthe LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperforms\nthe TDNN and MLP models. We analyze the performance of all models by ticker,\nmoneyness category, and over/under/correctly-priced percentage.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Computational Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06724","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We explore the performance of various artificial neural network
architectures, including a multilayer perceptron (MLP), Kolmogorov-Arnold
network (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and a
time-delay neural network (TDNN) for pricing European call options. In this
study, we attempt to leverage the ability of supervised learning methods, such
as ANNs, KANs, and gradient-boosted decision trees, to approximate complex
multivariate functions in order to calibrate option prices based on past market
data. The motivation for using ANNs and KANs is the Universal Approximation
Theorem and Kolmogorov-Arnold Representation Theorem, respectively.
Specifically, we use S\&P 500 (SPX) and NASDAQ 100 (NDX) index options traded
during 2015-2023 with times to maturity ranging from 15 days to over 4 years
(OptionMetrics IvyDB US dataset). Black \& Scholes's (BS) PDE \cite{Black1973}
model's performance in pricing the same options compared to real data is used
as a benchmark. This model relies on strong assumptions, and it has been
observed and discussed in the literature that real data does not match its
predictions. Supervised learning methods are widely used as an alternative for
calibrating option prices due to some of the limitations of this model. In our
experiments, the BS model underperforms compared to all of the others. Also,
the best TDNN model outperforms the best MLP model on all error metrics. We
implement a simple self-attention mechanism to enhance the RNN models,
significantly improving their performance. The best-performing model overall is
the LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperforms
the TDNN and MLP models. We analyze the performance of all models by ticker,
moneyness category, and over/under/correctly-priced percentage.