视频序列中唇部实时检测方法的研究与实现

Jianming Zhang, Liang-min Wang, Dejiao Niu, Y. Zhan
{"title":"视频序列中唇部实时检测方法的研究与实现","authors":"Jianming Zhang, Liang-min Wang, Dejiao Niu, Y. Zhan","doi":"10.1109/ICMLC.2003.1260027","DOIUrl":null,"url":null,"abstract":"Locating the lip in video sequences is one of the primary steps of the automatic lipreading system. In this paper a new approach to lip detection, which is based on Red Exclusion and Fisher transform, is presented. In this approach, firstly, we locate face region with skin-color model and motion correlation, then trisect the face image and take into account the lowest part, in which the lip lies, for the next processing. Secondly, we exclude R-component in RGB color space, then use G-component and B-component as the Fisher transform vector to enhance the lip image. Finally, in the enhanced image, we adaptively set the threshold to separate the lip color and the skin color in the light of the normal distribution of the gray value histogram. The experimental results showed that this fast approach is very efficient in detecting the whole lip and not affected by illuminant and different speakers.","PeriodicalId":64641,"journal":{"name":"计算机工程","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2003-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICMLC.2003.1260027","citationCount":"12","resultStr":"{\"title\":\"Research and implementation of a real time approach to lip detection in video sequences\",\"authors\":\"Jianming Zhang, Liang-min Wang, Dejiao Niu, Y. Zhan\",\"doi\":\"10.1109/ICMLC.2003.1260027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Locating the lip in video sequences is one of the primary steps of the automatic lipreading system. In this paper a new approach to lip detection, which is based on Red Exclusion and Fisher transform, is presented. In this approach, firstly, we locate face region with skin-color model and motion correlation, then trisect the face image and take into account the lowest part, in which the lip lies, for the next processing. Secondly, we exclude R-component in RGB color space, then use G-component and B-component as the Fisher transform vector to enhance the lip image. Finally, in the enhanced image, we adaptively set the threshold to separate the lip color and the skin color in the light of the normal distribution of the gray value histogram. The experimental results showed that this fast approach is very efficient in detecting the whole lip and not affected by illuminant and different speakers.\",\"PeriodicalId\":64641,\"journal\":{\"name\":\"计算机工程\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1109/ICMLC.2003.1260027\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"计算机工程\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC.2003.1260027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机工程","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/ICMLC.2003.1260027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

在视频序列中定位嘴唇是自动唇读系统的主要步骤之一。本文提出了一种基于红色排除和Fisher变换的唇形检测新方法。该方法首先利用肤色模型和运动相关性对人脸区域进行定位,然后对人脸图像进行三切分,并考虑唇所在的最低部分进行下一步处理。其次,排除RGB色彩空间中的r分量,利用g分量和b分量作为Fisher变换向量对唇形图像进行增强;最后,在增强后的图像中,根据灰度直方图的正态分布,自适应设置阈值分离唇色和肤色。实验结果表明,该方法能够有效地检测整个嘴唇,且不受光源和不同说话人的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Research and implementation of a real time approach to lip detection in video sequences
Locating the lip in video sequences is one of the primary steps of the automatic lipreading system. In this paper a new approach to lip detection, which is based on Red Exclusion and Fisher transform, is presented. In this approach, firstly, we locate face region with skin-color model and motion correlation, then trisect the face image and take into account the lowest part, in which the lip lies, for the next processing. Secondly, we exclude R-component in RGB color space, then use G-component and B-component as the Fisher transform vector to enhance the lip image. Finally, in the enhanced image, we adaptively set the threshold to separate the lip color and the skin color in the light of the normal distribution of the gray value histogram. The experimental results showed that this fast approach is very efficient in detecting the whole lip and not affected by illuminant and different speakers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.50
自引率
0.00%
发文量
34789
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信