具有实例级约束的半监督聚类和特征识别

H. Frigui, R. Mahdi
{"title":"具有实例级约束的半监督聚类和特征识别","authors":"H. Frigui, R. Mahdi","doi":"10.1109/FUZZY.2007.4295625","DOIUrl":null,"url":null,"abstract":"We propose a Semi-Supervised Clustering and Attribute Discrimination (S-SCAD) algorithm that performs fuzzy clustering and coarse feature weighting simultaneously. The supervision information in S-SCAD consists of a small set of constraints on which instances should or should not reside in the same cluster. The feature set is divided into logical subsets of features, and a degree of relevance is dynamically assigned to each subset based on its partial degree of dissimilarity. These weights have two advantages. First, they help in partitioning the data set into more meaningful clusters. Second, they can be used as part of a more complex learning system to enhance its learning behavior. We show that the partial supervision can guide the algorithm in learning the prototype parameters and the feature relevance weights, and thus, improve the final partition. The performance of the proposed algorithm is illustrated by using it to categorize a collection of color images. We use four feature subsets that encode color, structure, and texture information. The results are compared to other similar algorithms.","PeriodicalId":236515,"journal":{"name":"2007 IEEE International Fuzzy Systems Conference","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Semi-Supervised Clustering and Feature Discrimination with Instance-Level Constraints\",\"authors\":\"H. Frigui, R. Mahdi\",\"doi\":\"10.1109/FUZZY.2007.4295625\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a Semi-Supervised Clustering and Attribute Discrimination (S-SCAD) algorithm that performs fuzzy clustering and coarse feature weighting simultaneously. The supervision information in S-SCAD consists of a small set of constraints on which instances should or should not reside in the same cluster. The feature set is divided into logical subsets of features, and a degree of relevance is dynamically assigned to each subset based on its partial degree of dissimilarity. These weights have two advantages. First, they help in partitioning the data set into more meaningful clusters. Second, they can be used as part of a more complex learning system to enhance its learning behavior. We show that the partial supervision can guide the algorithm in learning the prototype parameters and the feature relevance weights, and thus, improve the final partition. The performance of the proposed algorithm is illustrated by using it to categorize a collection of color images. We use four feature subsets that encode color, structure, and texture information. The results are compared to other similar algorithms.\",\"PeriodicalId\":236515,\"journal\":{\"name\":\"2007 IEEE International Fuzzy Systems Conference\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE International Fuzzy Systems Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FUZZY.2007.4295625\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Fuzzy Systems Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FUZZY.2007.4295625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

提出了一种同时进行模糊聚类和粗特征加权的半监督聚类和属性判别(S-SCAD)算法。S-SCAD中的监督信息由一小组约束组成,这些约束决定了实例应该或不应该驻留在同一个集群中。将特征集划分为特征的逻辑子集,并根据每个子集的部分不相似度动态分配相关程度。这些砝码有两个优点。首先,它们有助于将数据集划分为更有意义的集群。其次,它们可以作为更复杂的学习系统的一部分,以增强其学习行为。研究表明,部分监督可以指导算法学习原型参数和特征相关权值,从而改善最终划分。通过对一组彩色图像进行分类,说明了该算法的性能。我们使用四个特征子集来编码颜色、结构和纹理信息。结果与其他类似算法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Semi-Supervised Clustering and Feature Discrimination with Instance-Level Constraints
We propose a Semi-Supervised Clustering and Attribute Discrimination (S-SCAD) algorithm that performs fuzzy clustering and coarse feature weighting simultaneously. The supervision information in S-SCAD consists of a small set of constraints on which instances should or should not reside in the same cluster. The feature set is divided into logical subsets of features, and a degree of relevance is dynamically assigned to each subset based on its partial degree of dissimilarity. These weights have two advantages. First, they help in partitioning the data set into more meaningful clusters. Second, they can be used as part of a more complex learning system to enhance its learning behavior. We show that the partial supervision can guide the algorithm in learning the prototype parameters and the feature relevance weights, and thus, improve the final partition. The performance of the proposed algorithm is illustrated by using it to categorize a collection of color images. We use four feature subsets that encode color, structure, and texture information. The results are compared to other similar algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信