{"title":"增强时空描述符的主要外观和运动","authors":"Guoying Zhao, M. Pietikäinen","doi":"10.1109/CVPRW.2008.4563174","DOIUrl":null,"url":null,"abstract":"Feature definition and selection are two important aspects in visual analysis of motion. In this paper, spatiotemporal local binary patterns computed at multiple resolutions are proposed for describing dynamic events, combining static and dynamic information from different spatiotemporal resolutions. Appearance and motion are the key components for visual analysis related to movements. AdaBoost algorithm is utilized for learning the principal appearance and motion from spatiotemporal descriptors derived from three orthogonal planes, providing important information about the locations and types of features for further analysis. In addition, learners are designed for selecting the most important features for each specific pair of different classes. The experiments carried out on diverse visual analysis tasks: facial expression recognition and visual speech recognition, show the effectiveness of the approach.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Principal appearance and motion from boosted spatiotemporal descriptors\",\"authors\":\"Guoying Zhao, M. Pietikäinen\",\"doi\":\"10.1109/CVPRW.2008.4563174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature definition and selection are two important aspects in visual analysis of motion. In this paper, spatiotemporal local binary patterns computed at multiple resolutions are proposed for describing dynamic events, combining static and dynamic information from different spatiotemporal resolutions. Appearance and motion are the key components for visual analysis related to movements. AdaBoost algorithm is utilized for learning the principal appearance and motion from spatiotemporal descriptors derived from three orthogonal planes, providing important information about the locations and types of features for further analysis. In addition, learners are designed for selecting the most important features for each specific pair of different classes. The experiments carried out on diverse visual analysis tasks: facial expression recognition and visual speech recognition, show the effectiveness of the approach.\",\"PeriodicalId\":102206,\"journal\":{\"name\":\"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CVPRW.2008.4563174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW.2008.4563174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Principal appearance and motion from boosted spatiotemporal descriptors
Feature definition and selection are two important aspects in visual analysis of motion. In this paper, spatiotemporal local binary patterns computed at multiple resolutions are proposed for describing dynamic events, combining static and dynamic information from different spatiotemporal resolutions. Appearance and motion are the key components for visual analysis related to movements. AdaBoost algorithm is utilized for learning the principal appearance and motion from spatiotemporal descriptors derived from three orthogonal planes, providing important information about the locations and types of features for further analysis. In addition, learners are designed for selecting the most important features for each specific pair of different classes. The experiments carried out on diverse visual analysis tasks: facial expression recognition and visual speech recognition, show the effectiveness of the approach.