{"title":"标量时间序列聚类的一种测试方法","authors":"Daniel Peña, Ruey S. Tsay","doi":"10.1111/jtsa.12706","DOIUrl":null,"url":null,"abstract":"<p>This article considers clustering stationary scalar time series using their marginal properties and a hierarchical method. Two major issues involved are to detect the existence of clusters and to determine their number. We propose a new test statistic for detecting whether a data set consists of multiple clusters and a new procedure to determine the number of clusters. The proposed method is based on the jumps, that is, the increments, in the heights of the dendrogram when a hierarchical clustering is applied to the data. We use autoregressive sieve bootstrap to obtain a reference distribution of the test statistics and propose an iterative procedure to find the number of clusters. The clusters found are internally homogeneous according to the test statistics used in the analysis. The performance of the proposed procedure in finite samples is investigated by Monte Carlo simulations and illustrated by some empirical examples. Comparisons with some existing methods for selecting the number of clusters are also investigated.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jtsa.12706","citationCount":"1","resultStr":"{\"title\":\"A testing approach to clustering scalar time series\",\"authors\":\"Daniel Peña, Ruey S. Tsay\",\"doi\":\"10.1111/jtsa.12706\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This article considers clustering stationary scalar time series using their marginal properties and a hierarchical method. Two major issues involved are to detect the existence of clusters and to determine their number. We propose a new test statistic for detecting whether a data set consists of multiple clusters and a new procedure to determine the number of clusters. The proposed method is based on the jumps, that is, the increments, in the heights of the dendrogram when a hierarchical clustering is applied to the data. We use autoregressive sieve bootstrap to obtain a reference distribution of the test statistics and propose an iterative procedure to find the number of clusters. The clusters found are internally homogeneous according to the test statistics used in the analysis. The performance of the proposed procedure in finite samples is investigated by Monte Carlo simulations and illustrated by some empirical examples. Comparisons with some existing methods for selecting the number of clusters are also investigated.</p>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2023-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jtsa.12706\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jtsa.12706\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jtsa.12706","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
A testing approach to clustering scalar time series
This article considers clustering stationary scalar time series using their marginal properties and a hierarchical method. Two major issues involved are to detect the existence of clusters and to determine their number. We propose a new test statistic for detecting whether a data set consists of multiple clusters and a new procedure to determine the number of clusters. The proposed method is based on the jumps, that is, the increments, in the heights of the dendrogram when a hierarchical clustering is applied to the data. We use autoregressive sieve bootstrap to obtain a reference distribution of the test statistics and propose an iterative procedure to find the number of clusters. The clusters found are internally homogeneous according to the test statistics used in the analysis. The performance of the proposed procedure in finite samples is investigated by Monte Carlo simulations and illustrated by some empirical examples. Comparisons with some existing methods for selecting the number of clusters are also investigated.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.