计算机视觉唇读(CV)

Somireddy Sumanth, Kadiyam Jyosthana, Jonnala Karthik Reddy, G. Geetha
{"title":"计算机视觉唇读(CV)","authors":"Somireddy Sumanth, Kadiyam Jyosthana, Jonnala Karthik Reddy, G. Geetha","doi":"10.1109/ASSIC55218.2022.10088386","DOIUrl":null,"url":null,"abstract":"The pitch and content of the speech in this proposed work can be picked up by lip movements. We investigate the function of lip and speech combinations that is, Learn the word uttered only by the motion of lips. Emphasis is to decode the full content of speech produced by different categories of speakers. Identification of speakers is caught not only from facial features such as age, gender, and nationality, but also from shape and lip movements, making the identification of speaker as a perceptible expression. Here, we present a new approach to gain proper lip movement in unrestrained situations. Different comprehensive examinations are carried out based on quantity, quality indicators and individual tests.","PeriodicalId":441406,"journal":{"name":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computer Vision Lip Reading(CV)\",\"authors\":\"Somireddy Sumanth, Kadiyam Jyosthana, Jonnala Karthik Reddy, G. Geetha\",\"doi\":\"10.1109/ASSIC55218.2022.10088386\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The pitch and content of the speech in this proposed work can be picked up by lip movements. We investigate the function of lip and speech combinations that is, Learn the word uttered only by the motion of lips. Emphasis is to decode the full content of speech produced by different categories of speakers. Identification of speakers is caught not only from facial features such as age, gender, and nationality, but also from shape and lip movements, making the identification of speaker as a perceptible expression. Here, we present a new approach to gain proper lip movement in unrestrained situations. Different comprehensive examinations are carried out based on quantity, quality indicators and individual tests.\",\"PeriodicalId\":441406,\"journal\":{\"name\":\"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASSIC55218.2022.10088386\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSIC55218.2022.10088386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在这个提议的工作中,讲话的音高和内容可以通过嘴唇的运动来拾取。我们研究唇部和语音组合的功能,即学习仅通过唇部运动说出的单词。重点是解码不同类别的说话者所产生的言语的全部内容。说话人的身份识别不仅来自年龄、性别、国籍等面部特征,还来自形状和嘴唇的运动,使说话人的身份识别成为一种可感知的表情。在这里,我们提出了一种新的方法来获得适当的嘴唇运动在不受约束的情况下。根据数量、质量指标和个别测试进行不同的综合考试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Computer Vision Lip Reading(CV)
The pitch and content of the speech in this proposed work can be picked up by lip movements. We investigate the function of lip and speech combinations that is, Learn the word uttered only by the motion of lips. Emphasis is to decode the full content of speech produced by different categories of speakers. Identification of speakers is caught not only from facial features such as age, gender, and nationality, but also from shape and lip movements, making the identification of speaker as a perceptible expression. Here, we present a new approach to gain proper lip movement in unrestrained situations. Different comprehensive examinations are carried out based on quantity, quality indicators and individual tests.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信