Tim Kutta , Agnieszka Jach , Michel Ferreira Cardia Haddad , Piotr Kokoszka , Haonan Wang
{"title":"Detection and localization of changes in a panel of densities","authors":"Tim Kutta , Agnieszka Jach , Michel Ferreira Cardia Haddad , Piotr Kokoszka , Haonan Wang","doi":"10.1016/j.jmva.2024.105374","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a new methodology for identifying and localizing changes in the Fréchet mean of a multivariate time series of probability densities. The functional data objects we study are random densities <span><math><msub><mrow><mi>f</mi></mrow><mrow><mi>s</mi><mo>,</mo><mi>t</mi></mrow></msub></math></span> indexed by discrete time <span><math><mi>t</mi></math></span> and a vector component <span><math><mi>s</mi></math></span>, which can be treated as a broadly understood spatial location. Our main objective is to identify the set of components <span><math><mi>s</mi></math></span>, where a change occurs with statistical certainty. A challenge of this analysis is that the densities <span><math><msub><mrow><mi>f</mi></mrow><mrow><mi>s</mi><mo>,</mo><mi>t</mi></mrow></msub></math></span> are not directly observable and must be estimated from sparse and potentially imbalanced data. Such setups are motivated by the analysis of two data sets that we investigate in this work. First, a hitherto unpublished large data set of Brazilian Covid infections and a second, a financial data set derived from intraday prices of U.S. Exchange Traded Funds. Chief statistical advances are the development of change point tests and estimators of components of change for multivariate time series of densities. We prove the theoretical validity of our methodology and investigate its finite sample performance in a simulation study.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multivariate Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0047259X24000812","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a new methodology for identifying and localizing changes in the Fréchet mean of a multivariate time series of probability densities. The functional data objects we study are random densities indexed by discrete time and a vector component , which can be treated as a broadly understood spatial location. Our main objective is to identify the set of components , where a change occurs with statistical certainty. A challenge of this analysis is that the densities are not directly observable and must be estimated from sparse and potentially imbalanced data. Such setups are motivated by the analysis of two data sets that we investigate in this work. First, a hitherto unpublished large data set of Brazilian Covid infections and a second, a financial data set derived from intraday prices of U.S. Exchange Traded Funds. Chief statistical advances are the development of change point tests and estimators of components of change for multivariate time series of densities. We prove the theoretical validity of our methodology and investigate its finite sample performance in a simulation study.
我们提出了一种新方法,用于识别和定位概率密度多元时间序列的弗雷谢特均值变化。我们研究的功能数据对象是以离散时间 t 和矢量分量 s 为索引的随机密度 fs,t,后者可视为广义上的空间位置。我们的主要目标是找出在统计上确定发生变化的分量 s 的集合。这项分析的挑战在于,密度 fs,t 无法直接观测,必须从稀疏且可能不平衡的数据中估算出来。我们在本研究中对两组数据进行了分析,从而激发了这种设置。第一组是迄今为止尚未发表的巴西 Covid 感染的大型数据集,第二组是源自美国交易所交易基金盘中价格的金融数据集。统计方面的主要进展是开发了变化点检验和多元时间序列密度变化成分估计器。我们证明了我们方法的理论有效性,并在模拟研究中调查了其有限样本性能。
期刊介绍:
Founded in 1971, the Journal of Multivariate Analysis (JMVA) is the central venue for the publication of new, relevant methodology and particularly innovative applications pertaining to the analysis and interpretation of multidimensional data.
The journal welcomes contributions to all aspects of multivariate data analysis and modeling, including cluster analysis, discriminant analysis, factor analysis, and multidimensional continuous or discrete distribution theory. Topics of current interest include, but are not limited to, inferential aspects of
Copula modeling
Functional data analysis
Graphical modeling
High-dimensional data analysis
Image analysis
Multivariate extreme-value theory
Sparse modeling
Spatial statistics.