{"title":"Greedy Optimization for Contiguity-Constrained Hierarchical Clustering","authors":"Diansheng Guo","doi":"10.1109/ICDMW.2009.75","DOIUrl":null,"url":null,"abstract":"The discovery and construction of inherent regions in large spatial datasets is an important task for many research domains such as climate zoning, eco-region analysis, public health mapping, and political redistricting. From the perspective of cluster analysis, it requires that each cluster is geographically contiguous. This paper presents a contiguity constrained hierarchical clustering and optimization method that can partition a set of spatial objects into a hierarchy of contiguous regions while optimizing an objective function. The method consists of two steps: contiguity constrained hierarchical clustering and two-way fine-tuning. The above two steps are repeated to create a hierarchy of regions. Evaluations and comparison show that the proposed method consistently and significantly outperforms existing methods by a large margin in terms of optimizing the objective function. Moreover, the method is flexible to accommodate different objective functions and additional constraints (such as the minimum size of each region), which are useful to for various application domains.","PeriodicalId":351078,"journal":{"name":"2009 IEEE International Conference on Data Mining Workshops","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2009.75","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
The discovery and construction of inherent regions in large spatial datasets is an important task for many research domains such as climate zoning, eco-region analysis, public health mapping, and political redistricting. From the perspective of cluster analysis, it requires that each cluster is geographically contiguous. This paper presents a contiguity constrained hierarchical clustering and optimization method that can partition a set of spatial objects into a hierarchy of contiguous regions while optimizing an objective function. The method consists of two steps: contiguity constrained hierarchical clustering and two-way fine-tuning. The above two steps are repeated to create a hierarchy of regions. Evaluations and comparison show that the proposed method consistently and significantly outperforms existing methods by a large margin in terms of optimizing the objective function. Moreover, the method is flexible to accommodate different objective functions and additional constraints (such as the minimum size of each region), which are useful to for various application domains.