{"title":"基于多元碎片分布有限混合的稳健聚类","authors":"M. Maleki, G. McLachlan, Sharon X. Lee","doi":"10.1177/1471082X211048660","DOIUrl":null,"url":null,"abstract":"A flexible class of multivariate distributions called scale mixtures of fragmental normal (SMFN) distributions, is introduced. Its extension to the case of a finite mixture of SMFN (FM-SMFN) distributions is also proposed. The SMFN family of distributions is convenient and effective for modelling data with skewness, discrepant observations and population heterogeneity. It also possesses some other desirable properties, including an analytically tractable density and ease of computation for simulation and estimation of parameters. A stochastic representation of the SMFN distribution is given and then a hierarchical representation is described, the latter aids in parameter estimation, derivation of statistical properties and simulations. Maximum likelihood estimation of the FM-SMFN distribution via the expectation–maximization (EM) algorithm is outlined before the clustering performance of the proposed mixture model is illustrated using simulated and real datasets. In particular, the ability of FM-SMFN distributions to model data generated from various well-known families is demonstrated.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"23 1","pages":"247 - 272"},"PeriodicalIF":1.2000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Robust clustering based on finite mixture of multivariate fragmental distributions\",\"authors\":\"M. Maleki, G. McLachlan, Sharon X. Lee\",\"doi\":\"10.1177/1471082X211048660\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A flexible class of multivariate distributions called scale mixtures of fragmental normal (SMFN) distributions, is introduced. Its extension to the case of a finite mixture of SMFN (FM-SMFN) distributions is also proposed. The SMFN family of distributions is convenient and effective for modelling data with skewness, discrepant observations and population heterogeneity. It also possesses some other desirable properties, including an analytically tractable density and ease of computation for simulation and estimation of parameters. A stochastic representation of the SMFN distribution is given and then a hierarchical representation is described, the latter aids in parameter estimation, derivation of statistical properties and simulations. Maximum likelihood estimation of the FM-SMFN distribution via the expectation–maximization (EM) algorithm is outlined before the clustering performance of the proposed mixture model is illustrated using simulated and real datasets. In particular, the ability of FM-SMFN distributions to model data generated from various well-known families is demonstrated.\",\"PeriodicalId\":49476,\"journal\":{\"name\":\"Statistical Modelling\",\"volume\":\"23 1\",\"pages\":\"247 - 272\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical Modelling\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1177/1471082X211048660\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Modelling","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1177/1471082X211048660","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Robust clustering based on finite mixture of multivariate fragmental distributions
A flexible class of multivariate distributions called scale mixtures of fragmental normal (SMFN) distributions, is introduced. Its extension to the case of a finite mixture of SMFN (FM-SMFN) distributions is also proposed. The SMFN family of distributions is convenient and effective for modelling data with skewness, discrepant observations and population heterogeneity. It also possesses some other desirable properties, including an analytically tractable density and ease of computation for simulation and estimation of parameters. A stochastic representation of the SMFN distribution is given and then a hierarchical representation is described, the latter aids in parameter estimation, derivation of statistical properties and simulations. Maximum likelihood estimation of the FM-SMFN distribution via the expectation–maximization (EM) algorithm is outlined before the clustering performance of the proposed mixture model is illustrated using simulated and real datasets. In particular, the ability of FM-SMFN distributions to model data generated from various well-known families is demonstrated.
期刊介绍:
The primary aim of the journal is to publish original and high-quality articles that recognize statistical modelling as the general framework for the application of statistical ideas. Submissions must reflect important developments, extensions, and applications in statistical modelling. The journal also encourages submissions that describe scientifically interesting, complex or novel statistical modelling aspects from a wide diversity of disciplines, and submissions that embrace the diversity of applied statistical modelling.