Self-organizing Map vs. Spectral Clustering on Visual Feature Extraction for Human Interface

N. Tsuruta, S. Aly, S. Maeda, S. Takahashi, T. Morimoto
{"title":"Self-organizing Map vs. Spectral Clustering on Visual Feature Extraction for Human Interface","authors":"N. Tsuruta, S. Aly, S. Maeda, S. Takahashi, T. Morimoto","doi":"10.1109/IFOST.2006.312245","DOIUrl":null,"url":null,"abstract":"Tasks of image recognition become important components for multi-modal interface. For developing feasible components, problems of huge dimensionality and non-linearity must be resolved. Image recognition consists of three stages: calibration stage, feature extraction (or representation) stage and recognition stage. For recognition stage, state of the art methods including nonlinear methods were proposed. On the other hand, linear methods, such as principle component analysis and linear discriminant method, are commonly used yet for feature extraction stage. Self-organizing feature map and spectral clustering are candidates of the non-linear feature extraction. Both methods have many empirical successes because of their simplicity and non-linearity. In this paper, we analyze characteristic of those methods. A summary of their characteristics shows the possibility to combine the both methods into a new approach. To clarify the importance of this topic, we also describe an overview of our multi-modal interface including lip-reading.","PeriodicalId":103784,"journal":{"name":"2006 International Forum on Strategic Technology","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 International Forum on Strategic Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IFOST.2006.312245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Tasks of image recognition become important components for multi-modal interface. For developing feasible components, problems of huge dimensionality and non-linearity must be resolved. Image recognition consists of three stages: calibration stage, feature extraction (or representation) stage and recognition stage. For recognition stage, state of the art methods including nonlinear methods were proposed. On the other hand, linear methods, such as principle component analysis and linear discriminant method, are commonly used yet for feature extraction stage. Self-organizing feature map and spectral clustering are candidates of the non-linear feature extraction. Both methods have many empirical successes because of their simplicity and non-linearity. In this paper, we analyze characteristic of those methods. A summary of their characteristics shows the possibility to combine the both methods into a new approach. To clarify the importance of this topic, we also describe an overview of our multi-modal interface including lip-reading.
自组织映射与光谱聚类在人机界面视觉特征提取中的应用
图像识别任务成为多模态界面的重要组成部分。为了开发可行部件,必须解决巨大的量纲和非线性问题。图像识别包括三个阶段:校准阶段、特征提取(或表示)阶段和识别阶段。在识别阶段,提出了包括非线性方法在内的最新方法。另一方面,在特征提取阶段,常用的是线性方法,如主成分分析和线性判别法。自组织特征映射和光谱聚类是非线性特征提取的候选方法。这两种方法都因其简单和非线性而获得了许多经验上的成功。本文对这些方法的特点进行了分析。对这两种方法的特点进行总结,表明将这两种方法结合成一种新方法的可能性。为了阐明这个主题的重要性,我们还描述了包括唇读在内的多模态界面的概述。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信