计算机视觉唇读(CV)

2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC) Pub Date : 2022-11-19 DOI:10.1109/ASSIC55218.2022.10088386

Somireddy Sumanth, Kadiyam Jyosthana, Jonnala Karthik Reddy, G. Geetha

{"title":"计算机视觉唇读(CV)","authors":"Somireddy Sumanth, Kadiyam Jyosthana, Jonnala Karthik Reddy, G. Geetha","doi":"10.1109/ASSIC55218.2022.10088386","DOIUrl":null,"url":null,"abstract":"The pitch and content of the speech in this proposed work can be picked up by lip movements. We investigate the function of lip and speech combinations that is, Learn the word uttered only by the motion of lips. Emphasis is to decode the full content of speech produced by different categories of speakers. Identification of speakers is caught not only from facial features such as age, gender, and nationality, but also from shape and lip movements, making the identification of speaker as a perceptible expression. Here, we present a new approach to gain proper lip movement in unrestrained situations. Different comprehensive examinations are carried out based on quantity, quality indicators and individual tests.","PeriodicalId":441406,"journal":{"name":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computer Vision Lip Reading(CV)\",\"authors\":\"Somireddy Sumanth, Kadiyam Jyosthana, Jonnala Karthik Reddy, G. Geetha\",\"doi\":\"10.1109/ASSIC55218.2022.10088386\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The pitch and content of the speech in this proposed work can be picked up by lip movements. We investigate the function of lip and speech combinations that is, Learn the word uttered only by the motion of lips. Emphasis is to decode the full content of speech produced by different categories of speakers. Identification of speakers is caught not only from facial features such as age, gender, and nationality, but also from shape and lip movements, making the identification of speaker as a perceptible expression. Here, we present a new approach to gain proper lip movement in unrestrained situations. Different comprehensive examinations are carried out based on quantity, quality indicators and individual tests.\",\"PeriodicalId\":441406,\"journal\":{\"name\":\"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASSIC55218.2022.10088386\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSIC55218.2022.10088386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在这个提议的工作中，讲话的音高和内容可以通过嘴唇的运动来拾取。我们研究唇部和语音组合的功能，即学习仅通过唇部运动说出的单词。重点是解码不同类别的说话者所产生的言语的全部内容。说话人的身份识别不仅来自年龄、性别、国籍等面部特征，还来自形状和嘴唇的运动，使说话人的身份识别成为一种可感知的表情。在这里，我们提出了一种新的方法来获得适当的嘴唇运动在不受约束的情况下。根据数量、质量指标和个别测试进行不同的综合考试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Computer Vision Lip Reading(CV)

The pitch and content of the speech in this proposed work can be picked up by lip movements. We investigate the function of lip and speech combinations that is, Learn the word uttered only by the motion of lips. Emphasis is to decode the full content of speech produced by different categories of speakers. Identification of speakers is caught not only from facial features such as age, gender, and nationality, but also from shape and lip movements, making the identification of speaker as a perceptible expression. Here, we present a new approach to gain proper lip movement in unrestrained situations. Different comprehensive examinations are carried out based on quantity, quality indicators and individual tests.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)

自引率

0.00%

发文量