{"title":"Hierarchical Sound Classification using Mpeg-7","authors":"H. Crysandt","doi":"10.1109/MMSP.2005.248606","DOIUrl":null,"url":null,"abstract":"Due to the increasing amount of multimedia contents such as images, audio signals and videos in digital form the need for automatic or semi-automatic classification applications become more and more important. This paper describes a new sound classification technique based on the sound classification algorithm included in the MPEG-7 standard without extending or modifying it. There sequential classification is turned into a hierarchical. Thereby it is possible to use more linear transformations for (lossy) feature vector compression. Thus it is possible to work out differences between sound classes more precisely. This paper also gives a detailed view on how the algorithm is implemented using a XML database to store and request content information of the audio signals and model descriptions of sound classes using the MPEG-7 standard","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE 7th Workshop on Multimedia Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2005.248606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Due to the increasing amount of multimedia contents such as images, audio signals and videos in digital form the need for automatic or semi-automatic classification applications become more and more important. This paper describes a new sound classification technique based on the sound classification algorithm included in the MPEG-7 standard without extending or modifying it. There sequential classification is turned into a hierarchical. Thereby it is possible to use more linear transformations for (lossy) feature vector compression. Thus it is possible to work out differences between sound classes more precisely. This paper also gives a detailed view on how the algorithm is implemented using a XML database to store and request content information of the audio signals and model descriptions of sound classes using the MPEG-7 standard