{"title":"Temporal asymmetry in relations of acoustic and visual features of speech","authors":"G. Feldhoffer, T. Bárdi, G. Takács, A. Tihanyi","doi":"10.5281/ZENODO.40683","DOIUrl":null,"url":null,"abstract":"The fine temporal structure of relations of acoustic and visual features has been investigated to improve our speech to facial animation conversion system. Mutual information of acoustic and visual features has been calculated with different time shifts. Our result shows that the movement of feature points on the face of professional lip-speakers can precede the changes of acoustic parameters even by 100 milliseconds. Considering the measured time-shifts in synchrony in our system design the quality of our speech driven animations can be improved.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"291 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 15th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.40683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
The fine temporal structure of relations of acoustic and visual features has been investigated to improve our speech to facial animation conversion system. Mutual information of acoustic and visual features has been calculated with different time shifts. Our result shows that the movement of feature points on the face of professional lip-speakers can precede the changes of acoustic parameters even by 100 milliseconds. Considering the measured time-shifts in synchrony in our system design the quality of our speech driven animations can be improved.