{"title":"Treatment of Missing Market Data: Case of bond Yield Curve Estimation","authors":"M. Makushkin, V. Lapshin","doi":"10.26794/2587-5671-2023-27-6-44-53","DOIUrl":null,"url":null,"abstract":"Missing observations in market data is a frequent problem in financial studies. The problem of missing data is often overlooked in practice. Missing data is mostly treated using ad hoc methods or just ignored. Our goal is to develop practical recommendations for treatment of missing observations in financial data. We illustrate the issue with an example of yield curve estimation on Russian bond market. We compare three methods of missing data imputation — last observation carried forward, Kalman filtering and EM–algorithm — with a simple strategy of ignoring missing observations. We conclude that the impact of data imputation on the quality of yield curve estimation depends on model sensitivity to the market data. For non-sensitive models, such as Nelson-Siegel yield curve model, final effect is insignificant. For more sensitive models, such as bootstrapping, missing data imputation allows to increase the quality of yield curve estimation. However, the result does not depend on the chosen data imputation method. Both simple last observation carried forward method and more advanced EM–algorithm lead to similar final results. Therefore, when estimating yield curves on the illiquid markets with missing market data, we recommend to use either simple non-sensitive to the data parametric models of yield curve or to impute missing data before using more advanced and sensitive yield curve models.","PeriodicalId":36110,"journal":{"name":"Finance: Theory and Practice","volume":"24 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Finance: Theory and Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26794/2587-5671-2023-27-6-44-53","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Economics, Econometrics and Finance","Score":null,"Total":0}
引用次数: 0
Abstract
Missing observations in market data is a frequent problem in financial studies. The problem of missing data is often overlooked in practice. Missing data is mostly treated using ad hoc methods or just ignored. Our goal is to develop practical recommendations for treatment of missing observations in financial data. We illustrate the issue with an example of yield curve estimation on Russian bond market. We compare three methods of missing data imputation — last observation carried forward, Kalman filtering and EM–algorithm — with a simple strategy of ignoring missing observations. We conclude that the impact of data imputation on the quality of yield curve estimation depends on model sensitivity to the market data. For non-sensitive models, such as Nelson-Siegel yield curve model, final effect is insignificant. For more sensitive models, such as bootstrapping, missing data imputation allows to increase the quality of yield curve estimation. However, the result does not depend on the chosen data imputation method. Both simple last observation carried forward method and more advanced EM–algorithm lead to similar final results. Therefore, when estimating yield curves on the illiquid markets with missing market data, we recommend to use either simple non-sensitive to the data parametric models of yield curve or to impute missing data before using more advanced and sensitive yield curve models.
市场数据缺失是金融研究中经常出现的问题。在实践中,缺失数据的问题常常被忽视。缺失数据大多采用临时方法处理或直接忽略。我们的目标是为处理金融数据中的缺失观测值提出实用建议。我们以俄罗斯债券市场的收益率曲线估算为例说明这一问题。我们比较了三种缺失数据估算方法--结转最后观测值、卡尔曼滤波和 EM 算法--以及忽略缺失观测值的简单策略。我们得出的结论是,数据估算对收益率曲线估算质量的影响取决于模型对市场数据的敏感度。对于不敏感的模型,如 Nelson-Siegel 收益曲线模型,最终影响并不显著。对于更敏感的模型,如 bootstrapping 模型,缺失数据估算可以提高收益率曲线估算的质量。然而,结果并不取决于所选择的数据估算方法。简单的最后观察结转法和更先进的 EM 算法都能得出相似的最终结果。因此,在对市场数据缺失的非流动性市场进行收益率曲线估算时,我们建议使用简单的对数据不敏感的收益率曲线参数模型,或者在使用更先进、更敏感的收益率曲线模型之前对缺失数据进行估算。