An Adaptive Clustering Algorithm For Image Segmentation

[1988 Proceedings] Second International Conference on Computer Vision Pub Date : 1988-12-05 DOI:10.1109/CCV.1988.590006

T. Pappas

{"title":"An Adaptive Clustering Algorithm For Image Segmentation","authors":"T. Pappas","doi":"10.1109/CCV.1988.590006","DOIUrl":null,"url":null,"abstract":"The problem of segmenting images of objects with smooth surfaces is considered. The algorithm we present is a generalization of the ,K-means clustering algorithm to include spatial constraints and to account for local intensity variations in the image. Spatial constraints are included by the use of a Gibbs random field model. Local intensity variations are accounted for in a n iterative procedure involving averaging over a sliding window whose size decreases as the algorithm progresses. Results with an eight-neighbor Gibbs random field model applied to pictures of industrial objects, buildings, aerial photographs, optical characters, and faces, show that the algorithm performs better than the K-means algorithm and its nonadaptive extensions that incorporate spatial constraints by the use of Gibbs random fields. A hierarchical implementation is also presented and results in better performance and faster speed of execution. The segmented images are caricatures of the originals which preserve the most significant features, while removing unimportant details. They can be used in image recognition and as crude representations of the image. The caricatures are easy to display or print using a few grey levels and can be coded very efficiently. In particular, segmentation of faces results in binary sketches which preserve the main characteristics of the face, so that it is easily recognizable. I. INTRODUCTION E present a technique for segmenting a grey-scale image (typically 2.56 levels) into regions of uniform or slowly varying intensity. The segmented image consists of very few levels (typically 2-4). each denoting a different region, as shown in Fig. 1. It is a sketch, or caricature, of the original image which preserves its most significant features, while removing unimportant details. It can thus be the first stage of an image recognition system. However, assuming that the segmented image retains the intelligibility of the original, it can also be used as a crude representation of the image. The caricature has the advantage that it is easy to display or print with very few grey levels. The number of levels is crucial for special display media like paper, cloth, and binary computer screens. Also, the caricature can be coded very efficiently , since we only have to code the transitions between a few grey levels. We develop an algorithm that separates the pixels in the image into clusters based on both their intensity and their","PeriodicalId":229545,"journal":{"name":"[1988 Proceedings] Second International Conference on Computer Vision","volume":"95 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"537","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1988 Proceedings] Second International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCV.1988.590006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 537

Abstract

The problem of segmenting images of objects with smooth surfaces is considered. The algorithm we present is a generalization of the ,K-means clustering algorithm to include spatial constraints and to account for local intensity variations in the image. Spatial constraints are included by the use of a Gibbs random field model. Local intensity variations are accounted for in a n iterative procedure involving averaging over a sliding window whose size decreases as the algorithm progresses. Results with an eight-neighbor Gibbs random field model applied to pictures of industrial objects, buildings, aerial photographs, optical characters, and faces, show that the algorithm performs better than the K-means algorithm and its nonadaptive extensions that incorporate spatial constraints by the use of Gibbs random fields. A hierarchical implementation is also presented and results in better performance and faster speed of execution. The segmented images are caricatures of the originals which preserve the most significant features, while removing unimportant details. They can be used in image recognition and as crude representations of the image. The caricatures are easy to display or print using a few grey levels and can be coded very efficiently. In particular, segmentation of faces results in binary sketches which preserve the main characteristics of the face, so that it is easily recognizable. I. INTRODUCTION E present a technique for segmenting a grey-scale image (typically 2.56 levels) into regions of uniform or slowly varying intensity. The segmented image consists of very few levels (typically 2-4). each denoting a different region, as shown in Fig. 1. It is a sketch, or caricature, of the original image which preserves its most significant features, while removing unimportant details. It can thus be the first stage of an image recognition system. However, assuming that the segmented image retains the intelligibility of the original, it can also be used as a crude representation of the image. The caricature has the advantage that it is easy to display or print with very few grey levels. The number of levels is crucial for special display media like paper, cloth, and binary computer screens. Also, the caricature can be coded very efficiently , since we only have to code the transitions between a few grey levels. We develop an algorithm that separates the pixels in the image into clusters based on both their intensity and their

查看原文本刊更多论文

一种图像分割的自适应聚类算法

研究了光滑表面物体图像的分割问题。我们提出的算法是k均值聚类算法的推广，包括空间约束并考虑图像中的局部强度变化。空间约束通过使用吉布斯随机场模型来包含。局部强度变化在一个n次迭代过程中被解释为涉及滑动窗口的平均，其大小随着算法的进展而减小。将八邻Gibbs随机场模型应用于工业对象、建筑物、航空照片、光学字符和人脸图像，结果表明该算法优于K-means算法及其使用Gibbs随机场合并空间约束的非自适应扩展。提出了一种分层实现方法，提高了系统的性能和执行速度。分割图像是原始图像的漫画，保留了最重要的特征，同时删除了不重要的细节。它们可以用于图像识别和作为图像的粗略表示。漫画很容易显示或打印使用几个灰色水平，可以非常有效地编码。特别是，人脸分割得到的二值草图保留了人脸的主要特征，使其易于识别。我们提出了一种将灰度图像(通常为2.56级)分割成均匀或缓慢变化强度区域的技术。分割后的图像由非常少的级别(通常为2-4)组成。每个表示一个不同的区域，如图1所示。它是原始图像的素描或漫画，保留了其最重要的特征，同时删除了不重要的细节。因此，它可以是图像识别系统的第一阶段。但是，假设分割后的图像保留了原始图像的可理解性，它也可以作为图像的粗略表示。漫画的优点是它很容易显示或印刷很少的灰度级。对于特殊的显示介质，如纸、布和二进制计算机屏幕，层数是至关重要的。此外，漫画可以非常有效地编码，因为我们只需要编码几个灰色级别之间的过渡。我们开发了一种算法，该算法将图像中的像素根据其强度和亮度分成簇

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

[1988 Proceedings] Second International Conference on Computer Vision

自引率

0.00%

发文量