A lip reading application on MS Kinect camera

Alper Yargic, M. Dogan
{"title":"A lip reading application on MS Kinect camera","authors":"Alper Yargic, M. Dogan","doi":"10.1109/INISTA.2013.6577656","DOIUrl":null,"url":null,"abstract":"Hearing-impaired people can read lips and lip reading applications may help them to improve their lip imitation skills. Speech of normal people can be recognized by even cellular phones but lip reading systems using only visual features remain important for hearing-impaired people. This paper aims to develop an application using MS Kinect camera to recognize Turkish color names to be used in the education of hearing-impaired children. Predefined lip points are located with depth information by the MS Kinect Face Tracking SDK. Words are segmented from the speech and the angles between the lip points are used as features to classify the words. Angles are computed using the 3D coordinates of the lip points. The KNN classifier is used to classify the words with Manhattan and Euclidean distances and the best feature vectors are tried to be found. As a result, the isolated words are classified with the success rate of 78.22%.","PeriodicalId":301458,"journal":{"name":"2013 IEEE INISTA","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE INISTA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INISTA.2013.6577656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24

Abstract

Hearing-impaired people can read lips and lip reading applications may help them to improve their lip imitation skills. Speech of normal people can be recognized by even cellular phones but lip reading systems using only visual features remain important for hearing-impaired people. This paper aims to develop an application using MS Kinect camera to recognize Turkish color names to be used in the education of hearing-impaired children. Predefined lip points are located with depth information by the MS Kinect Face Tracking SDK. Words are segmented from the speech and the angles between the lip points are used as features to classify the words. Angles are computed using the 3D coordinates of the lip points. The KNN classifier is used to classify the words with Manhattan and Euclidean distances and the best feature vectors are tried to be found. As a result, the isolated words are classified with the success rate of 78.22%.
微软Kinect摄像头上的唇读应用
听力受损的人可以读唇语,唇读应用程序可以帮助他们提高嘴唇模仿技能。正常人的语言甚至可以被手机识别,但仅使用视觉特征的唇读系统对听障人士来说仍然很重要。本文旨在开发一款使用MS Kinect摄像头识别土耳其语颜色名称的应用程序,用于听障儿童的教育。预定义的唇点由MS Kinect Face Tracking SDK定位深度信息。从语音中分割单词,并使用唇点之间的角度作为特征对单词进行分类。角度是使用唇点的三维坐标计算的。利用KNN分类器对具有曼哈顿距离和欧几里得距离的词进行分类,并试图找到最佳特征向量。结果表明,孤立词的分类成功率为78.22%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信