{"title":"Multimedia content classification using motion and audio information","authors":"Yao Wang, Jincheng Huang, Zhu Liu, Tsuhan Chen","doi":"10.1109/ISCAS.1997.622200","DOIUrl":null,"url":null,"abstract":"Content-based video segmentation and classification is a key to the success of future multimedia databases. Research in this area in the past several years has focused on the use of speech recognition and image analysis techniques. As a complimentary effort to prior research, we have focused on the use of motion and audio characteristics. Fundamental to both segmentation and classification tasks is the characterization by certain features of a given video segment. In this paper, we describe several audio and motion features that have been found to be effective in distinguishing motion and audio characteristics of different types of scenes.","PeriodicalId":68559,"journal":{"name":"电路与系统学报","volume":"19 2 1","pages":"1488-1491 vol.2"},"PeriodicalIF":0.0000,"publicationDate":"1997-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"电路与系统学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/ISCAS.1997.622200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
Content-based video segmentation and classification is a key to the success of future multimedia databases. Research in this area in the past several years has focused on the use of speech recognition and image analysis techniques. As a complimentary effort to prior research, we have focused on the use of motion and audio characteristics. Fundamental to both segmentation and classification tasks is the characterization by certain features of a given video segment. In this paper, we describe several audio and motion features that have been found to be effective in distinguishing motion and audio characteristics of different types of scenes.