2012 IEEE International Symposium on Multimedia最新文献

Batch Mode Active Learning for Multimedia Pattern Recognition 多媒体模式识别的批处理模式主动学习

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.101

Shayok Chakraborty, V. Balasubramanian, S. Panchanathan

{"title":"Batch Mode Active Learning for Multimedia Pattern Recognition","authors":"Shayok Chakraborty, V. Balasubramanian, S. Panchanathan","doi":"10.1109/ISM.2012.101","DOIUrl":"https://doi.org/10.1109/ISM.2012.101","url":null,"abstract":"Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our knowledge, this is the first attempt to develop such techniques in the active learning literature.","PeriodicalId":282528,"journal":{"name":"2012 IEEE International Symposium on Multimedia","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115387929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

EDContours: High-Speed Parameter-Free Contour Detector Using EDPF EDContours:使用EDPF的高速无参数轮廓检测器

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.37

C. Akinlar, C. Topal

引用次数: 6

3D Model Hypotheses for Player Segmentation and Rendering in Free-Viewpoint Soccer Video 自由视点足球视频中球员分割与渲染的三维模型假设

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.47

Haopeng Li, M. Flierl

引用次数: 0

TEEVE-Remote: A Novel User-Interaction Solution for 3D Tele-immersive System TEEVE-Remote: 3D远程沉浸式系统的新型用户交互解决方案

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.77

Pengye Xia, K. Nahrstedt, M. A. Jurik

引用次数: 1

Automated Visual Quality Analysis for Media Production 媒体制作的自动视觉质量分析

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.82

Hannes Fassold, Stefanie Wechtitsch, Albert Hofmann, W. Bailer, P. Schallauer, R. Borgotallo, A. Messina, Mohan Liu, P. Ndjiki-Nya, Peter Altendorf

引用次数: 4

Detailed Comparative Analysis of VP8 and H.264 VP8和H.264的详细对比分析

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.33

Yousef O. Sharrab, Nabil J. Sarhan

引用次数: 19

Automatic Classification of Teeth in Bitewing Dental Images Using OLPP 基于OLPP的咬翼牙齿图像自动分类

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.26

Nourdin Al-sherif, G. Guo, H. Ammar

引用次数: 5

Understanding Your Needs: An Adaptive VoD System 了解您的需求:一个自适应的视频点播系统

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.55

Mu Mu, W. Knowles, N. Race

引用次数: 5

Incorporating Fuzziness in Extended Local Ternary Patterns 扩展局部三元模式的模糊融合

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.36

W. Liao

引用次数: 2

Color-Weakness Compensation Using Riemann Normal Coordinates 利用黎曼法向坐标进行色弱补偿

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.42

S. Oshima, Rika Mochizuki, R. Lenz, J. Chao

引用次数: 3