Extraction of frame-difference features based on PCA and ICA for lip-reading

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005. Pub Date : 2005-12-27 DOI:10.1109/IJCNN.2005.1555835

K. Lee, M. Lee, Soo-Young Lee

引用次数: 9

Abstract

The features of human lip motion from video clips are extracted by principal component analysis (PCA) and independent component analysis (ICA). Unlike many other features extracted from single-frame static images or multi-frame dynamic images, we extracted the features from the differences of consecutive frames. The PCA results in global features, while local features are extracted by the ICA. The features are extracted from several consecutive multi-frame differences as well as single-frame differences. The dynamic nature of multi-frame differences is more eminent. The resulting features maybe applicable in lip-reading and synthesis of lip motion videos with text-to-speech capability.

查看原文本刊更多论文

基于PCA和ICA的唇读帧差特征提取

采用主成分分析(PCA)和独立成分分析(ICA)对视频片段中人唇运动特征进行了提取。与从单帧静态图像或多帧动态图像中提取特征不同，我们从连续帧的差异中提取特征。PCA得到全局特征，ICA提取局部特征。这些特征是由连续的多帧差和单帧差提取的。多帧差分的动态性更为突出。所得到的特征可能适用于唇读和具有文本到语音功能的唇动视频合成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.

自引率

0.00%

发文量