2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)最新文献

筛选
英文 中文
Chebyshev and conjugate gradient filters for graph image denoising 图图像去噪的切比雪夫和共轭梯度滤波器
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890711
Dong Tian, H. Mansour, A. Knyazev, A. Vetro
{"title":"Chebyshev and conjugate gradient filters for graph image denoising","authors":"Dong Tian, H. Mansour, A. Knyazev, A. Vetro","doi":"10.1109/ICMEW.2014.6890711","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890711","url":null,"abstract":"In 3D image/video acquisition, different views are often captured with varying noise levels across the views. In this paper, we propose a graph-based image enhancement technique that uses a higher quality view to enhance a degraded view. A depth map is utilized as auxiliary information to match the perspectives of the two views. Our method performs graph-based filtering of the noisy image by directly computing a projection of the image to be filtered onto a lower dimensional Krylov subspace of the graph Laplacian. We discuss two graph spectral denoising methods: first using Chebyshev polynomials, and second using iterations of the conjugate gradient algorithm. Our framework generalizes previously known polynomial graph filters, and we demonstrate through numerical simulations that our proposed technique produces subjectively cleaner images with about 1-3 dB improvement in PSNR over existing polynomial graph filters.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131000320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Interactive body part contrast mining for human interaction recognition 面向人机交互识别的交互式身体部位对比挖掘
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890714
Yanli Ji, Guo Ye, Hong Cheng
{"title":"Interactive body part contrast mining for human interaction recognition","authors":"Yanli Ji, Guo Ye, Hong Cheng","doi":"10.1109/ICMEW.2014.6890714","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890714","url":null,"abstract":"The recognition of multi-person interactions still remains a challenge because of the mutual occlusion and redundant poses. We propose an interactive body part contrast mining method based on joints for human interaction recognition. To efficiently describe interactions, we propose an interactive body part model which connects the interactive limbs of different participants to represent the relationship of interactive body parts. Then we calculate the spatial-temporal joint features for 8 interactive limb pairs in a short frame set for motion description (poselets). Employing contrast mining, we determine the essential interactive pairs and poselets for each interaction class to delete the redundant action information, and use these poselets to generate a poselet dictionary for interaction representation following bag-of-words. SVM with RBF kernel is adopted for recognition. We evaluate the proposed algorithm on two databases, the SBU interaction database and a newly collected RGBD-skeleton interaction database. Experiment results indicate the effectiveness of the proposed algorithm. The recognition accuracy reaches 85.4% on our interaction database, and 86.8% on SBU interaction database, 6% higher than the method in [1].","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132373347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 125
Smart authoring and sharing of multimedia content in personal area networks based on Subject of Interest 基于兴趣主题的个人局域网多媒体内容智能创作与共享
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890603
Belal Abu-Naim, W. Klas
{"title":"Smart authoring and sharing of multimedia content in personal area networks based on Subject of Interest","authors":"Belal Abu-Naim, W. Klas","doi":"10.1109/ICMEW.2014.6890603","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890603","url":null,"abstract":"The evolution of smart phones' hardware and operating systems, users tendency to join social networks and to share multimedia content and daily life events, well-established methods and technologies of Semantic web, and the increasing establishment of Linked Open Data (LOD) APIs, motivate us to introduce a new approach in multimedia content composition and sharing in personal area networks that automatically analyzes, selects, composes, and shares the authored content. The capabilities of social network applications and the applications that address multimedia document composition, retrieval and presentation, and multimedia content sharing, do not go beyond allowing the users to share text, pictures, or other types of media content in social networks, performing manual or semi-automatic multimedia document composition, retrieving a list of pre-composed multimedia documents that eventually include datasets retrieved from DBpedia based on the geographic location. There is a lack of applications that are capable to automatically analyze the multimedia content on the devices of the users, compose multimedia documents about the Subject of Interest (SOI), retrieve and use additional data from LOD sources, and achieve a cross-multimedia document models authoring. In this paper we introduce our innovative approach of automatic analysis, composition, and sharing of multimedia content driven by a user's subject of interest (SOI). Our new approach enables us to achieve a smart multimedia authoring and sharing by incorporating new phases within the authoring process, which have not yet been applied by other applications.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"433 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132864273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Butterfly-like D-tree fusion strategy for real-time speech and music classification 实时语音和音乐分类的类蝴蝶d树融合策略
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890706
Min Lu, W. Dou
{"title":"Butterfly-like D-tree fusion strategy for real-time speech and music classification","authors":"Min Lu, W. Dou","doi":"10.1109/ICMEW.2014.6890706","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890706","url":null,"abstract":"Aimed at the problem of real-time speech and music discrimination, this paper proposes a frame-level classification method by using a novel “butterfly-like” fusion strategy based on decision tree (D-Tree).In our method, some homotypes of long-term features but in different time lengths are extracted to train each sub-classifier and make the fusion resultful. A testing experiment indicates our approach can achieve the desirable performance in reducing the misclassification and the imbalance of decision tree model. Meanwhile, superiorities in low overheads of computational complexity and memory resource make it competitive in practical applications.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128411523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Two dimensional non-negative sparse Partial Least Squares for face recognition 二维非负稀疏偏最小二乘人脸识别
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890696
Yongxin Ge, Sheng Huang, Xin Feng, Jiehui Zhang, Wenbin Bu, Dan Yang
{"title":"Two dimensional non-negative sparse Partial Least Squares for face recognition","authors":"Yongxin Ge, Sheng Huang, Xin Feng, Jiehui Zhang, Wenbin Bu, Dan Yang","doi":"10.1109/ICMEW.2014.6890696","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890696","url":null,"abstract":"The Partial Least Squares (PLS) algorithm has been widely applied in face recognition in recent years. However, all the improved algorithms of PLS did not utilize non-negativity and sparsity synchronously to improve the recognition accuracy and robustness. In order to solve these problems, this paper proposes a novel algorithm named Two-Dimension Non-negative Sparse Partial Least Squares (2DNSPLS), which incorporates the constraints of non-negativity and sparse to 2DPLS while extracting the facial features. Consequently, not only do the features extracted by 2DNSPLS contain the label information, as well as the internal structure of image matrix, but they also contain local non-negative interpretability and sparsity. For evaluating the approach's performance, a series of experiments are conducted on the Yale and the PIE face databases, which demonstrate that the proposed approach outperforms the state-of-art algorithms and has good robustness to occlusion.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128540725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Visualization of user interests in online music services 可视化用户对在线音乐服务的兴趣
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890536
Jingxian Zhang, Dong Liu
{"title":"Visualization of user interests in online music services","authors":"Jingxian Zhang, Dong Liu","doi":"10.1109/ICMEW.2014.6890536","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890536","url":null,"abstract":"Online music services have been popular for end users to obtain music, where user interests, as reflected by their downloading records, are crucial for service providers to understand users and thus to provide personalization. However, the raw downloading records are of huge volume and difficult to analyze intuitively. We study a visualization approach to analyzing downloading records so as to present user interests. To reveal the underlying relevance between music tracks, we utilized not only the metadata of music (especially genres), but also collaborative relevance that is voted by users. To present time varying user interests, we designed several new figures, namely Bean plot, Instrument plot, and Transitional Pie plot, that are capable in displaying different aspects of user interests variation. We have performed experiments with a real-world data set, and the results show the effectiveness of our proposed visualization method. Our work is also inspiring for visualization of time varying data in other applications.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121062088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A panoramic video system by direct manipulation video navigation 一个通过直接操作视频导航的全景视频系统
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890611
Chi-Cheng Ju, Ding-Yun Chen, Chen-Tsai Ho, Chung-Hung Tsai
{"title":"A panoramic video system by direct manipulation video navigation","authors":"Chi-Cheng Ju, Ding-Yun Chen, Chen-Tsai Ho, Chung-Hung Tsai","doi":"10.1109/ICMEW.2014.6890611","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890611","url":null,"abstract":"This paper presents a panoramic video system by direct manipulation video navigation without any image stitching. Our panoramic video browsing is interacted with user by displaying overlapped region of consecutive input video frames with corresponding viewing angle. That is, when user slides, the panoramic video will switch video frames to corresponding viewing angle according to sliding distance. When user stops to slide, the video will display the overlapped region of consecutive video frames in the same viewing angle. The switching of viewing angle and cropping of the overlapped region are based on video frame registration. The major advantage of our method is to provide a panoramic video navigation from hand-held mobile camera phone with low computation and high quality. Comparing to other panoramic video stitching approaches, our method can guarantee no ghosting and no distortion. Our demo video is in http://youtu.be/XYpPAdypIuI for detail.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128763540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recognition by detection: Perceiving human motion through part-configured feature maps 通过检测识别:通过部分配置的特征映射来感知人体运动
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890599
Lei Wang, Jun Wu, Zhimin Zhou, Yuncai Liu, Xu Zhao
{"title":"Recognition by detection: Perceiving human motion through part-configured feature maps","authors":"Lei Wang, Jun Wu, Zhimin Zhou, Yuncai Liu, Xu Zhao","doi":"10.1109/ICMEW.2014.6890599","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890599","url":null,"abstract":"Visually perceiving human motion at semantic level is an important however challenging problem in multimedia area. In this work, we propose a novel approach to map the low-level responses from visual detection to semantically sensitive description to human actions. The feature map is triggered by the output of deformable part model detection, in which the critical information about body parts configuration is contained implicitly under the specific human actions. We map the filter responses of the detectors to an effective feature description, which encodes the position and appearance information of the root and every body parts simultaneously. Statistically, the obtained feature map captures the significance of relative configuration of body parts, therefore is robust to the false detections occurred in the individual part detectors. We conduct comprehensive experiments and the results show that the method generates discriminative action features and achieves remarkable performance in most of the cases.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116824701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Online video session progress prediction using low-rank matrix completion 使用低秩矩阵补全的在线视频会话进度预测
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890668
Gang Wu, Viswanathan Swaminathan, Saayan Mitra, Ratnesh Kumar
{"title":"Online video session progress prediction using low-rank matrix completion","authors":"Gang Wu, Viswanathan Swaminathan, Saayan Mitra, Ratnesh Kumar","doi":"10.1109/ICMEW.2014.6890668","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890668","url":null,"abstract":"The prediction of online video session progress is useful for both optimizing and personalizing end-user experience. Our approach for online video recommendation is to use the session progress information instead of using a traditional rating system. We approach the prediction of session progress as a matrix completion problem, and complete the session progress matrix using noisy low-rank matrix completion (NLMC). Events collected from the end-user video sessions are tracked and logged in a server. We process a large number of logs, represent them as a partially observed user by video matrix, and use regularized nuclear norm minimization for matrix completion. Our initial results show improvement over baseline methods of prediction using just the means. We further investigate the reason for the difference in performance for the same prediction methods between our dataset and the dataset used in the Netflix challenge. Our experiments indicate that the number of observed entries at a given sparsity is a good indicator of the performance of the Singular Value Decomposition (SVD) based matrix completion methods. This implies that the results for our dataset would further improve by either observing more entries for the same set of users and videos or by including new users or videos at the same sparsity level. Moreover, we introduce an algorithm to generate submatrices of any required sparsity and size from a given matrix to fairly compare algorithm performances on datasets of varying characteristics.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115759931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Efficient image reranking by leveraging click data 有效的图像重新排序利用点击数据
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI: 10.1109/ICMEW.2014.6890605
Shusheng Cen, Lezi Wang, Yanchao Feng, Hongliang Bai, Yuan Dong
{"title":"Efficient image reranking by leveraging click data","authors":"Shusheng Cen, Lezi Wang, Yanchao Feng, Hongliang Bai, Yuan Dong","doi":"10.1109/ICMEW.2014.6890605","DOIUrl":"https://doi.org/10.1109/ICMEW.2014.6890605","url":null,"abstract":"This paper introduces our system competing in MSR-Bing Image Retrieval Challenge at ICME 2014. The task of the challenge is to rank images by their relevance to a given topic, by leveraging cues hidden in search engine's click log. With the successful trial in last year's challenge, search-based method is shown to be effective in this task. We reserve the basic idea of search-based method in our new system, and there are also some improvements made this time. The first one is an adjustment in textual search algorithm for related clicked images in database. We simplified the previous scheme and make it more straight-forward and effient. The second inovation is using support vector machines to predict the relevance of query-image pair.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127359784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信