11th International Multimedia Modelling Conference最新文献_第5页

Generic Audio Classification Using a Hybrid Model Based on GMMs and HMMs 基于GMMs和hmm混合模型的通用音频分类

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.44

Menaka Rajapakse, L. Wyse

{"title":"Generic Audio Classification Using a Hybrid Model Based on GMMs and HMMs","authors":"Menaka Rajapakse, L. Wyse","doi":"10.1109/MMMC.2005.44","DOIUrl":"https://doi.org/10.1109/MMMC.2005.44","url":null,"abstract":"A hybrid model comprised of Gaussian Mixtures Models (GMMs) and Hidden Markov Models (HMMs) is used to model generic sounds with large intra class perceptual variations. Each class has variable number of mixture components in the GMM. The number of mixture components is derived using the Minimum Description Length (MDL) criterion. The overall performance of the hybrid model was compared against models based on HMMs and GMMs with a fixed number of mixture components across all classes. We show that a hybrid model outperforms both class-based GMMs, HMMs, and GMMs based on fixed number of components. Further, our experiments revealed that the contribution of transitions between states in HMMs has no significant effect on the overall classification performance of generic sounds when large intra class perceptual variations are present among sounds in the training and test datasets. Sounds that show multi-event structure with events that tend to be similar (repetitive) indicated improved performance when modeled with HMMs that can be attributed to HMM’s state transition property. Conversely, GMMs indicate better performance when the sound samples show subtle or no repetitive behavior. These results were validated using the MuscleFish sound database.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129118188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Improved Perceptual Tempo Detection of Music 改进的音乐感知速度检测

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.49

Bee Yong Chua, Guojun Lu

引用次数: 10

WS-QBE: A QBE-Like Query Language for Complex Multimedia Queries WS-QBE:用于复杂多媒体查询的类qbe查询语言

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.72

I. Schmitt, Nadine Schulz, Thomas Herstel

引用次数: 19

Fast Screening in Large Face Databases Using Merit-Based Dominant Points 基于优势点的大型人脸数据库快速筛选

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.38

Yongsheng Gao

引用次数: 0

A Community-Based Recommendation System to Reveal Unexpected Interests 基于社区的推荐系统揭示意外兴趣

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.5

J. Kamahara, T. Asakawa, S. Shimojo, H. Miyahara

引用次数: 66

A Fuzzy Expert System for Concept-Based Image Indexing and Retrieval 基于概念的图像索引与检索模糊专家系统

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.8

I. A. Azzam, C. Leung, J. F. Horwood

引用次数: 17

Browsing Texture Image Databases 浏览纹理图像数据库

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.25

Suryani Lim, Lianping Chen, Guojun Lu, Ray Smith

引用次数: 3

A Wireless Layered Multicast Congestion Control Protocol for Multimedia 多媒体无线分层多播拥塞控制协议

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.16

Dongsheng Yin, F. Zhang, Guangzhao Zhang

引用次数: 0

Semantic-Sensitive Classification for Large Image Libraries 大型图像库的语义敏感分类

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.66

Jialie Shen, J. Shepherd, A. Ngu

引用次数: 14

Modeling of Output Constraints in Multimedia Database Systems 多媒体数据库系统中输出约束的建模

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.54

Thomas Heimrich

引用次数: 3