{"title":"基于线性有序分区的多变量数据排序聚类","authors":"Mariaelena Bottazzi Schenone, Maurizio Vichi","doi":"10.1007/s10182-025-00534-5","DOIUrl":null,"url":null,"abstract":"<div><p>This paper explores the use of clustering to rank multivariate observations by linking ranking to clustering through the Linear Ordered Partition (LOP) concept. A LOP allows optimal clustering into ordered “<i>equivalence classes</i>”. In fact, unlike simple units’ ordering, cluster ranking identifies classes where units are “<i>incomparable</i>”. The aim is to partition units into clusters with statistically distinct centroids, leading to an optimally ranked total order of clusters, where units within each one are considered “<i>ties</i>”. The proposed model finds the best least-squares (LS) LOP, alongside with a univariate transformation of the observed variables. This is because it identifies the LS LOP by orthogonally projecting multivariate units onto a line, thus creating a composite indicator that summarizes the observed variables. Model’s theoretical properties are discussed, and a large simulation study demonstrates its performance across different scenarios. Three real data applications highlight the method’s potential across different fields.\n</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"110 1","pages":"117 - 148"},"PeriodicalIF":1.4000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00534-5.pdf","citationCount":"0","resultStr":"{\"title\":\"Clustering for ranking multivariate data by Linear Ordered Partitions\",\"authors\":\"Mariaelena Bottazzi Schenone, Maurizio Vichi\",\"doi\":\"10.1007/s10182-025-00534-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper explores the use of clustering to rank multivariate observations by linking ranking to clustering through the Linear Ordered Partition (LOP) concept. A LOP allows optimal clustering into ordered “<i>equivalence classes</i>”. In fact, unlike simple units’ ordering, cluster ranking identifies classes where units are “<i>incomparable</i>”. The aim is to partition units into clusters with statistically distinct centroids, leading to an optimally ranked total order of clusters, where units within each one are considered “<i>ties</i>”. The proposed model finds the best least-squares (LS) LOP, alongside with a univariate transformation of the observed variables. This is because it identifies the LS LOP by orthogonally projecting multivariate units onto a line, thus creating a composite indicator that summarizes the observed variables. Model’s theoretical properties are discussed, and a large simulation study demonstrates its performance across different scenarios. Three real data applications highlight the method’s potential across different fields.\\n</p></div>\",\"PeriodicalId\":55446,\"journal\":{\"name\":\"Asta-Advances in Statistical Analysis\",\"volume\":\"110 1\",\"pages\":\"117 - 148\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10182-025-00534-5.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asta-Advances in Statistical Analysis\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10182-025-00534-5\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asta-Advances in Statistical Analysis","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10182-025-00534-5","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Clustering for ranking multivariate data by Linear Ordered Partitions
This paper explores the use of clustering to rank multivariate observations by linking ranking to clustering through the Linear Ordered Partition (LOP) concept. A LOP allows optimal clustering into ordered “equivalence classes”. In fact, unlike simple units’ ordering, cluster ranking identifies classes where units are “incomparable”. The aim is to partition units into clusters with statistically distinct centroids, leading to an optimally ranked total order of clusters, where units within each one are considered “ties”. The proposed model finds the best least-squares (LS) LOP, alongside with a univariate transformation of the observed variables. This is because it identifies the LS LOP by orthogonally projecting multivariate units onto a line, thus creating a composite indicator that summarizes the observed variables. Model’s theoretical properties are discussed, and a large simulation study demonstrates its performance across different scenarios. Three real data applications highlight the method’s potential across different fields.
期刊介绍:
AStA - Advances in Statistical Analysis, a journal of the German Statistical Society, is published quarterly and presents original contributions on statistical methods and applications and review articles.