S. Chalamala, Balakrishna Gudla, B. Yegnanarayana, K. Sheela
{"title":"改进的唇轮廓提取用于视觉语音识别","authors":"S. Chalamala, Balakrishna Gudla, B. Yegnanarayana, K. Sheela","doi":"10.1109/ICCE.2015.7066486","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition systems perform on acoustic speech signals and therefore they are unreliable in noisy environments. Visual speech features such as lip movements of a speaker can make the speech recognition system robust. To track the lip movements, lip contour extraction is a necessary step and plays a crucial role in the visual speech recognition. In this paper, we propose a new method for lip contour extraction using fuzzy clustering with elliptic shape information and active contour model. In this method, we combined both image and model based methods to improve the performance of lip contour extraction. Our proposed lip contour extraction method outperforms few of existing lip contour extraction methods. We applied our lip contour extraction method on 3600 lip images from VidTimit database and results are found better than the few existing lip contour extraction methods.","PeriodicalId":169402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics (ICCE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Improved lip contour extraction for visual speech recognition\",\"authors\":\"S. Chalamala, Balakrishna Gudla, B. Yegnanarayana, K. Sheela\",\"doi\":\"10.1109/ICCE.2015.7066486\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic speech recognition systems perform on acoustic speech signals and therefore they are unreliable in noisy environments. Visual speech features such as lip movements of a speaker can make the speech recognition system robust. To track the lip movements, lip contour extraction is a necessary step and plays a crucial role in the visual speech recognition. In this paper, we propose a new method for lip contour extraction using fuzzy clustering with elliptic shape information and active contour model. In this method, we combined both image and model based methods to improve the performance of lip contour extraction. Our proposed lip contour extraction method outperforms few of existing lip contour extraction methods. We applied our lip contour extraction method on 3600 lip images from VidTimit database and results are found better than the few existing lip contour extraction methods.\",\"PeriodicalId\":169402,\"journal\":{\"name\":\"2015 IEEE International Conference on Consumer Electronics (ICCE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Consumer Electronics (ICCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCE.2015.7066486\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Consumer Electronics (ICCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE.2015.7066486","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved lip contour extraction for visual speech recognition
Automatic speech recognition systems perform on acoustic speech signals and therefore they are unreliable in noisy environments. Visual speech features such as lip movements of a speaker can make the speech recognition system robust. To track the lip movements, lip contour extraction is a necessary step and plays a crucial role in the visual speech recognition. In this paper, we propose a new method for lip contour extraction using fuzzy clustering with elliptic shape information and active contour model. In this method, we combined both image and model based methods to improve the performance of lip contour extraction. Our proposed lip contour extraction method outperforms few of existing lip contour extraction methods. We applied our lip contour extraction method on 3600 lip images from VidTimit database and results are found better than the few existing lip contour extraction methods.