Pierpaolo D’Urso , Livia De Giovanni , Lorenzo Federico , Vincenzina Vitale
{"title":"Fuzzy clustering of mixed data with spatial regularization","authors":"Pierpaolo D’Urso , Livia De Giovanni , Lorenzo Federico , Vincenzina Vitale","doi":"10.1016/j.spasta.2024.100874","DOIUrl":null,"url":null,"abstract":"<div><div>A fuzzy clustering model for data with mixed features and spatial constraints is proposed. The clustering model allows different types of variables, or attributes, to be taken into account. This result is achieved by combining the dissimilarity measures for each attribute employing a weighting scheme, to obtain a distance measure for multiple attributes. The weights are objectively computed during the optimization process. The weights reflect the relevance of each attribute type in the clustering results. A spatial term is taken into account, considering a wide definition of contiguity, either physical contiguity or the adjacency matrix in a network. Simulation studies and two empirical applications, including both physical and abstract definitions of contiguity are presented that show the effectiveness of the proposed clustering model.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100874"},"PeriodicalIF":2.1000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675324000654","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
A fuzzy clustering model for data with mixed features and spatial constraints is proposed. The clustering model allows different types of variables, or attributes, to be taken into account. This result is achieved by combining the dissimilarity measures for each attribute employing a weighting scheme, to obtain a distance measure for multiple attributes. The weights are objectively computed during the optimization process. The weights reflect the relevance of each attribute type in the clustering results. A spatial term is taken into account, considering a wide definition of contiguity, either physical contiguity or the adjacency matrix in a network. Simulation studies and two empirical applications, including both physical and abstract definitions of contiguity are presented that show the effectiveness of the proposed clustering model.
期刊介绍:
Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication.
Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.