{"title":"基于音视频积分的实时声源定位","authors":"Tokuo Tsuji, Kenichi Yamamoto, I. Ishii","doi":"10.1109/ICPR.2006.967","DOIUrl":null,"url":null,"abstract":"We propose a pixelwise sound source localization algorithm based on audiovisual frequency integration. The localization is realized by detecting the common vibration dynamics of sound sources in the audio and the brightness signal. In order to detect the common vibration dynamics, temporal correlation values between the two signals are calculated in the algorithm. Several experimental results are shown for vibrated objects, and the pixelwise sound source localization images are obtained","PeriodicalId":236033,"journal":{"name":"18th International Conference on Pattern Recognition (ICPR'06)","volume":"95 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Real-time Sound Source Localization Based on Audiovisual Frequency Integration\",\"authors\":\"Tokuo Tsuji, Kenichi Yamamoto, I. Ishii\",\"doi\":\"10.1109/ICPR.2006.967\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a pixelwise sound source localization algorithm based on audiovisual frequency integration. The localization is realized by detecting the common vibration dynamics of sound sources in the audio and the brightness signal. In order to detect the common vibration dynamics, temporal correlation values between the two signals are calculated in the algorithm. Several experimental results are shown for vibrated objects, and the pixelwise sound source localization images are obtained\",\"PeriodicalId\":236033,\"journal\":{\"name\":\"18th International Conference on Pattern Recognition (ICPR'06)\",\"volume\":\"95 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"18th International Conference on Pattern Recognition (ICPR'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPR.2006.967\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"18th International Conference on Pattern Recognition (ICPR'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPR.2006.967","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-time Sound Source Localization Based on Audiovisual Frequency Integration
We propose a pixelwise sound source localization algorithm based on audiovisual frequency integration. The localization is realized by detecting the common vibration dynamics of sound sources in the audio and the brightness signal. In order to detect the common vibration dynamics, temporal correlation values between the two signals are calculated in the algorithm. Several experimental results are shown for vibrated objects, and the pixelwise sound source localization images are obtained