{"title":"An Adaptive Clustering Algorithm For Image Segmentation","authors":"T. Pappas","doi":"10.1109/CCV.1988.590006","DOIUrl":null,"url":null,"abstract":"The problem of segmenting images of objects with smooth surfaces is considered. The algorithm we present is a generalization of the ,K-means clustering algorithm to include spatial constraints and to account for local intensity variations in the image. Spatial constraints are included by the use of a Gibbs random field model. Local intensity variations are accounted for in a n iterative procedure involving averaging over a sliding window whose size decreases as the algorithm progresses. Results with an eight-neighbor Gibbs random field model applied to pictures of industrial objects, buildings, aerial photographs, optical characters, and faces, show that the algorithm performs better than the K-means algorithm and its nonadaptive extensions that incorporate spatial constraints by the use of Gibbs random fields. A hierarchical implementation is also presented and results in better performance and faster speed of execution. The segmented images are caricatures of the originals which preserve the most significant features, while removing unimportant details. They can be used in image recognition and as crude representations of the image. The caricatures are easy to display or print using a few grey levels and can be coded very efficiently. In particular, segmentation of faces results in binary sketches which preserve the main characteristics of the face, so that it is easily recognizable. I. INTRODUCTION E present a technique for segmenting a grey-scale image (typically 2.56 levels) into regions of uniform or slowly varying intensity. The segmented image consists of very few levels (typically 2-4). each denoting a different region, as shown in Fig. 1. It is a sketch, or caricature, of the original image which preserves its most significant features, while removing unimportant details. It can thus be the first stage of an image recognition system. However, assuming that the segmented image retains the intelligibility of the original, it can also be used as a crude representation of the image. The caricature has the advantage that it is easy to display or print with very few grey levels. The number of levels is crucial for special display media like paper, cloth, and binary computer screens. Also, the caricature can be coded very efficiently , since we only have to code the transitions between a few grey levels. We develop an algorithm that separates the pixels in the image into clusters based on both their intensity and their","PeriodicalId":229545,"journal":{"name":"[1988 Proceedings] Second International Conference on Computer Vision","volume":"95 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"537","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1988 Proceedings] Second International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCV.1988.590006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 537
Abstract
The problem of segmenting images of objects with smooth surfaces is considered. The algorithm we present is a generalization of the ,K-means clustering algorithm to include spatial constraints and to account for local intensity variations in the image. Spatial constraints are included by the use of a Gibbs random field model. Local intensity variations are accounted for in a n iterative procedure involving averaging over a sliding window whose size decreases as the algorithm progresses. Results with an eight-neighbor Gibbs random field model applied to pictures of industrial objects, buildings, aerial photographs, optical characters, and faces, show that the algorithm performs better than the K-means algorithm and its nonadaptive extensions that incorporate spatial constraints by the use of Gibbs random fields. A hierarchical implementation is also presented and results in better performance and faster speed of execution. The segmented images are caricatures of the originals which preserve the most significant features, while removing unimportant details. They can be used in image recognition and as crude representations of the image. The caricatures are easy to display or print using a few grey levels and can be coded very efficiently. In particular, segmentation of faces results in binary sketches which preserve the main characteristics of the face, so that it is easily recognizable. I. INTRODUCTION E present a technique for segmenting a grey-scale image (typically 2.56 levels) into regions of uniform or slowly varying intensity. The segmented image consists of very few levels (typically 2-4). each denoting a different region, as shown in Fig. 1. It is a sketch, or caricature, of the original image which preserves its most significant features, while removing unimportant details. It can thus be the first stage of an image recognition system. However, assuming that the segmented image retains the intelligibility of the original, it can also be used as a crude representation of the image. The caricature has the advantage that it is easy to display or print with very few grey levels. The number of levels is crucial for special display media like paper, cloth, and binary computer screens. Also, the caricature can be coded very efficiently , since we only have to code the transitions between a few grey levels. We develop an algorithm that separates the pixels in the image into clusters based on both their intensity and their