2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)最新文献

Uploader models for video concept detection 视频概念检测的上传模型

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-07-10 DOI: 10.1109/CBMI.2014.6849847

B. Mérialdo, U. Niaz

引用次数: 2

Annotation of still images by multiple visual concepts 多重视觉概念对静止图像的注释

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849844

Abdelkader Hamadi, P. Mulhem, G. Quénot

{"title":"Annotation of still images by multiple visual concepts","authors":"Abdelkader Hamadi, P. Mulhem, G. Quénot","doi":"10.1109/CBMI.2014.6849844","DOIUrl":"https://doi.org/10.1109/CBMI.2014.6849844","url":null,"abstract":"The automatic indexing of images and videos is a highly relevant and important research area in the field of multimedia information retrieval. The difficulty of this task is no longer something to prove. The majority of the efforts of the research community have been focused in the past on the detection of single concepts in images/videos, which is already a hard task. With the evolution of the information retrieval systems, users needs are more abstract, and lead to a larger number of words composing the queries. It is sensible to think about indexing multimedia documents by more than one concept, to help retrieval systems to answer such complex queries. Few studies addressed specifically the problem of detecting multiple concepts (multi-concept) in images and videos, most of them concern the detection of concept pairs. These studies showed that such challenge is even greater than the one of single concept detection. In this work, we address this problematic of mult-concept detection in still images. Two types of approaches are considered : 1) building models per multi-concept and 2) fusion of single concepts detectors. We conducted our evaluation on PASCAL VOC'12 collection regarding the detection of pairs and triplets of concepts. Our results show that the two types of approaches give globally comparable results, but they differ for specific kinds of pairs/triplets.","PeriodicalId":103056,"journal":{"name":"2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115069807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Searching images with MPEG-7 (& MPEG-7-like) Powered Localized dEscriptors: The SIMPLE answer to effective Content Based Image Retrieval 用MPEG-7(和类似MPEG-7)驱动的本地化描述符搜索图像:有效的基于内容的图像检索的简单答案

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849821

C. Iakovidou, N. Anagnostopoulos, Athanasios Ch. Kapoutsis, Y. Boutalis, S. Chatzichristofis

{"title":"Searching images with MPEG-7 (& MPEG-7-like) Powered Localized dEscriptors: The SIMPLE answer to effective Content Based Image Retrieval","authors":"C. Iakovidou, N. Anagnostopoulos, Athanasios Ch. Kapoutsis, Y. Boutalis, S. Chatzichristofis","doi":"10.1109/CBMI.2014.6849821","DOIUrl":"https://doi.org/10.1109/CBMI.2014.6849821","url":null,"abstract":"In this paper we propose and evaluate a new technique that localizes the description ability of the well established MPEG-7 and MPEG-7-like global descriptors. We employ the SURF detector to define salient image patches of blob-like textures and use the MPEG-7 Scalable Color (SC), Color Layout (CL) and Edge Histogram (EH) descriptors and the global MPEG-7-like Color and Edge Directivity Descriptor (CEDD), to produce the final local features' vectors. In order to test the new descriptors in the most straightforward fashion, we use the Bag-Of-Visual-Words framework for indexing and retrieval. The experimental results conducted on two different benchmark databases with varying codebook sizes, revealed an astonishing boost in the retrieval performance of the proposed descriptors compared both to their own performance (in their original form) and to other state-of-the-art methods of local and global descriptors. Open-source implementation of the proposed descriptors is available in c#, Java and MATLAB.","PeriodicalId":103056,"journal":{"name":"2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123640862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 32

Scalable video summarization of cultural video documents in cross-media space based on data cube approach 基于数据立方体方法的跨媒体空间文化视频文档的可扩展视频摘要

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849824

Karina Ruby Perez-Daniel, M. Nakano-Miyatake, J. Benois-Pineau, S. Maabout, G. Sargent

引用次数: 8

A robust audio fingerprinting method for content-based copy detection 一种基于内容的音频指纹检测鲁棒方法

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849814

Chahid Ouali, P. Dumouchel, Vishwa Gupta

{"title":"A robust audio fingerprinting method for content-based copy detection","authors":"Chahid Ouali, P. Dumouchel, Vishwa Gupta","doi":"10.1109/CBMI.2014.6849814","DOIUrl":"https://doi.org/10.1109/CBMI.2014.6849814","url":null,"abstract":"This paper presents a novel audio fingerprinting method that is highly robust to a variety of audio distortions. It is based on unconventional audio fingerprints generation scheme. The robustness is achieved by generating different versions of the spectrogram matrix of the audio signal by using a threshold based on the average of the spectral values to prune this matrix. We transform each version of this pruned spectrogram matrix into a 2-D binary image. Multiple 2-D images suppress noise to a varying degree. This varying degree of noise suppression improves likelihood of one of the images matching a reference image. To speed up matching, we convert each image into an n-dimensional vector, and perform a nearest neighbor search based on this n-dimensional vector. We test this method on TRECVID 2010 content-based copy detection evaluation dataset. Experimental results show the effectiveness of such fingerprints even when the audio is distorted. We compare the proposed method to a state-of-the-art audio copy detection system. Results of this comparison show that our method achieves an improvement of 22% in localization accuracy, and lowers minimal normalized detection cost rate (min NDCR) by half for audio transformations T1 and T2.","PeriodicalId":103056,"journal":{"name":"2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129481398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Online multimodal matrix factorization for human action video indexing 基于在线多模态矩阵分解的人体动作视频索引

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849823

F. Páez, Jorge A. Vanegas, F. González

引用次数: 1

Ultrasound image processing based on machine learning for the fully automatic evaluation of the Carotid Intima-Media Thickness 基于机器学习的超声图像处理用于颈动脉内膜-中膜厚度的全自动评估

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849839

R. Menchón-Lara, J. Sancho-Gómez

{"title":"Ultrasound image processing based on machine learning for the fully automatic evaluation of the Carotid Intima-Media Thickness","authors":"R. Menchón-Lara, J. Sancho-Gómez","doi":"10.1109/CBMI.2014.6849839","DOIUrl":"https://doi.org/10.1109/CBMI.2014.6849839","url":null,"abstract":"Atherosclerosis is responsible for a large proportion of cardiovascular diseases (CVD), which are the leading cause of death in the world. The atherosclerotic process, mainly affecting the medium- and large-size arteries, is a degenerative condition that causes thickening and the reduction of elasticity in the blood vessels. The Intima-Media Thickness (IMT) of the Common Carotid Artery (CCA) is a reliable early indicator of atherosclerosis. Usually, it is manually measured by marking pairs of points on a B-mode ultrasound scan image of the CCA. This paper proposes an automatic image segmentation procedure for the measurement of the IMT, avoiding the user dependence and the inter-rater variability. In particular, Radial Basis Function (RBF) Networks are designed and trained by means of the Optimally Pruned-Extreme Learning Machine (OP-ELM) algorithm to classify pixels from a given ultrasound image, allowing the extraction of IMT boundaries. The suggested approach has been validated on a set of 25 ultrasound images by comparing the automatic segmentations with manual tracings.","PeriodicalId":103056,"journal":{"name":"2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127461359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Inverse square rank fusion for multimodal search 多模态搜索的逆平方秩融合

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849825

André Mourão, Flávio Martins, João Magalhães

引用次数: 8

Bag of morphological words for content-based geographical retrieval 用于基于内容的地理检索的形态词包

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849837

E. Aptoula

引用次数: 19

Automatic object annotation from weakly labeled data with latent structured SVM 基于潜在结构化支持向量机的弱标记数据对象自动标注

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2014-06-18 DOI: 10.1109/CBMI.2014.6849838

Christian X. Ries, Fabian Richter, Stefan Romberg, R. Lienhart

引用次数: 4