SVM-KM: speeding SVMs learning with a priori cluster selection and k-means

Proceedings. Vol.1. Sixth Brazilian Symposium on Neural Networks Pub Date : 2000-01-22 DOI:10.1109/SBRN.2000.889732

M. B. D. Almeida, A. Braga, J. P. Braga

引用次数: 135

Abstract

A procedure called SVM-KM, based on clustering by k-means and to accelerate the training of support vector machines, is the main objective of the work. During the support vector machines (SVMs) optimization phase, training vectors near the separation margins, are likely to become support vector and must be preserved. Conversely, training vectors far from the margins are not in general taken into account for the SVM's design process. SVM-KM groups the training vectors in many clusters. Clusters formed only by a vector that belongs to the same class label can be disregard and only cluster centers are used. On the other hand, clusters with more than one class label are unchanged and all training vectors belonging to them are considered. Clusters with mixed composition are likely to happen near the separation margins and they may hold some support vectors. Consequently, the number of vectors in a SVM training is smaller and the training time can be decreased without compromising the generalization capability of the SVM.

查看原文本刊更多论文

SVM-KM:利用先验聚类选择和k-means加速svm学习

基于k-means聚类和加速支持向量机训练的SVM-KM过程是该工作的主要目标。在支持向量机优化阶段，靠近分离边界的训练向量很可能成为支持向量，必须保留。相反，在SVM的设计过程中，通常不会考虑远离边缘的训练向量。SVM-KM将训练向量分组在多个聚类中。仅由属于同一类标签的向量形成的聚类可以忽略，只使用聚类中心。另一方面，具有多个类标签的聚类保持不变，并考虑属于它们的所有训练向量。混合组成的簇可能发生在分离边缘附近，它们可能拥有一些支持向量。因此，在不影响支持向量机泛化能力的情况下，支持向量机训练中的向量数量更少，训练时间也更短。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. Vol.1. Sixth Brazilian Symposium on Neural Networks

自引率

0.00%

发文量