Bart Pieters, Charles-Frederik Hollemeersch, J. D. Cock, P. Lambert, R. Walle, Patrice Rondao-Alface, C. Stevens
{"title":"Multiview Video Coding Using Video Game Context Information","authors":"Bart Pieters, Charles-Frederik Hollemeersch, J. D. Cock, P. Lambert, R. Walle, Patrice Rondao-Alface, C. Stevens","doi":"10.1109/ICMEW.2012.8","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.8","url":null,"abstract":"Remote rendering of video games for 3DTV becomes a hot topic with the emergence of 3D-enabled mobile devices and cloud-based services. It is however a very challenging task that requires live encoding at very low latency for user interactivity as well as optimal encoding decisions for an acceptable QoE. One key-aspect is that most video games make use of a 3D engine, which is typically accelerated on a GPU, containing information on the composition of the 3D scene and its objects as well as their motion. In this paper, we explore how to extract this information from the GPU and how to exploit it in order to successfully offload the most time consuming tasks of a multiview video encoder. We show that near-optimal encoding decisions can be taken while minimizing the encoder computational complexity as well as the total delay.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126186233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Real-Time Polyphonic Score Following System","authors":"Tingting Chou, Wen-Chieh Chen, Siang-An Wang, Ken-Ning Chang, Herng-Yow Chen","doi":"10.1109/ICMEW.2012.42","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.42","url":null,"abstract":"This paper proposes an efficient score tracking system that can track musical performance on a score in real time. This kind of technology is called score following. It can be used in wide range of applications. Our algorithm is like Dannenberg's Dynamic Programming algorithm but extends his algorithm to process polyphony music. Ideally, the notes of polyphony have to be played at the same time. But in fact, it is impossible. When the notes are played, there are tiny differences among the time. We group nearly played note and classify them into leading notes and following notes. The algorithm, adopting Oshima's coping with four types of errors, also takes in consideration some performer's habits and circumstances, such as repeating unfamiliar parts or playing the wrong note.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126749929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Tusch, Felix Pletzer, Vijay Mudunuri, A. Krätschmer, Karuna Sabbavarapu, M. Kogler, L. Böszörményi, B. Rinner, M. Harrer, Thomas Mariacher, Peter Hrassnig
{"title":"LOOK2 - A Video-based System for Real-time Notification of Relevant Traffic Events","authors":"R. Tusch, Felix Pletzer, Vijay Mudunuri, A. Krätschmer, Karuna Sabbavarapu, M. Kogler, L. Böszörményi, B. Rinner, M. Harrer, Thomas Mariacher, Peter Hrassnig","doi":"10.1109/ICMEW.2012.126","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.126","url":null,"abstract":"We demonstrate our novel video-based real-time traffic event notification and verification system LOOK2. It generates fast and reliable traffic information about relevant traffic state and road conditions changes on observed roads. It utilizes installed road-side sensors providing low-level traffic and environmental data, as well as video sensors which gain high-level traffic information from live video analysis. Spatio-temporal data fusion is applied on all available traffic and environmental data to gain reliable traffic information. This traffic information is published by a DATEXII compliant web service to a web-based traffic desk application. Road network and traffic channel operators receive real-time and relevant traffic event notifications by using this application. The system also enables a visual verification of the notified situations.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128149398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Po-Yen Su, Chieh-Kai Kao, Tsung-Yau Huang, Homer H. Chen
{"title":"Adopting Perceptual Quality Metrics in Video Encoders: Progress and Critiques","authors":"Po-Yen Su, Chieh-Kai Kao, Tsung-Yau Huang, Homer H. Chen","doi":"10.1109/ICMEW.2012.20","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.20","url":null,"abstract":"There is a need for video encoders to generate bitstreams of quality that matches human evaluation. This is often achieved by adopting perceptual quality metrics in video encoders. A huge body of work has been devoted to the development of perceptual video quality metrics that take the properties of human visual system into account. It is our interest in this paper to study how the existing video coding systems can benefit from such perceptual quality metrics. Specifically, we examine the existing perceptual video coding systems and introduce the concept of “codec-friendliness,” meaning how the perceptual quality metrics can be nicely incorporated into the coding process of standard video coding algorithms. We conclude the study by suggesting guidelines for the development of future quality metrics from the video coding perspective.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128223296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tele-Medical Applications in Home-Based Health Care","authors":"Reem Al-Attas, A. Yassine, S. Shirmohammadi","doi":"10.1109/ICMEW.2012.83","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.83","url":null,"abstract":"Tele-home-care systems are becoming more important for patients and society at large. Despite some surveys focusing on medical devices interoperability used on home-care systems, electronic measurements in rehabilitation, configuration of body area networks, a survey and taxonomy of enabling technologies for tele-home-care systems does not exist. This paper presents a survey and taxonomy of the design approaches. The discussion of open issues and suggestions for further research are detailed in this paper.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124494992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Video Description Length Guided Constant Quality Video Coding with Bitrate Constraint","authors":"Lei Yang, D. Mukherjee, D. Wu","doi":"10.1109/ICMEW.2012.70","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.70","url":null,"abstract":"In this paper, we propose a new video encoding strategy - Video description length guided Constant Quality video coding with Bitrate Constraint (V-CQBC), for large scale video transcoding systems of video charing websites with varying unknown video contents. It provides smooth quality and saves bitrate and computation for transcoding millions of videos in both real time and batch mode. The new encoding strategy is based on the average bitrate-quality regression model and adapt to the encoded videos. Furthermore, three types of video description length (VDL), describing the video overall, spatial and temporal content complexity, are proposed to guide video coding. Experimental results show that the proposed coding strategy with saved computation could achieve better or similar RD performance than other coding strategies.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115820356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cross-layer Optimized Coding Mode Selection for Wireless Video Communications","authors":"Yun Ye, S. Ci, Dalei Wu","doi":"10.1109/ICMEW.2012.24","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.24","url":null,"abstract":"This paper proposed a coding mode selection method for video transmission over wireless networks. Unlike previous mode selection methods disregarding channel distortion or assuming constant packet loss rate (PLR), this method includes a cross-layer controller to collect both source and channel information. The mode selection process is formulated as a delay constrained distortion minimization problem. The three components in the resulting Lagrange cost function, namely distortion, Lagrange multiplier and packet delay, are estimated with online channel information feedback. Sub optimal coding decision and physical layer modulation and coding scheme (MCS) are determined by the controller for each packet. In our experiment, three coding modes, intra, inter and down sampling, are tested under various channel conditions. Compared to conventional method, 3.6dB to 7.5dB average reduction in distortion is achieved under different channel condition, while down sampling further gains up to 2.2dB distortion reduction in low data rate transmission.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127713226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Web-based Augmented Reality Video Streaming for Marketing","authors":"Ville Valjus, Sari Järvinen, Johannes Peltola","doi":"10.1109/ICMEW.2012.63","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.63","url":null,"abstract":"This paper presents an Adobe Flash-based augmented reality video streaming application and its practical use in web marketing. The application enables augmenting the content of a web cam view by adding video content to it. Aller Media, a Nordic media company, used the application for advertising two Finnish movies with promotional video content. We have examined combining of conventional print media and digital media and the suitability of the augmented reality video streaming application for the web environment and marketing purposes. Additionally, we measured the technical performance of the application. The feedback from Aller Media and the end-users indicates that the application is useful for marketing purposes. In addition, the results show that the application is well suited for web environment as its performance is sufficient and as the distribution is more efficient compared to desktop or mobile applications.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133075507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bin Jin, Weiyao Lin, Jianxin Wu, Tianhao Wu, Jun Huang, Chongyang Zhang
{"title":"Layout-expectation-based Model for Image Search Re-ranking","authors":"Bin Jin, Weiyao Lin, Jianxin Wu, Tianhao Wu, Jun Huang, Chongyang Zhang","doi":"10.1109/ICMEW.2012.28","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.28","url":null,"abstract":"In this paper, a new algorithm is proposed for image re-ranking in Web search applications. The proposed algorithm introduces a new layout expectation model for improving the image search results. The motivation for using the expectation model is that users may often have potential expectations about the desired image during the search process. By including the layout expectation model to describe users' expectation on image layouts, the re-ranked search results can become more satisfactory to users. Experimental results demonstrate that our proposed algorithm can significantly improve the re-ranking precision compared with the state-of-the-art algorithms.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133377171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Content-Based Image Retrieval in P2P Networks with Bag-of-Features","authors":"Lelin Zhang, Zhiyong Wang, D. Feng","doi":"10.1109/ICMEW.2012.30","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.30","url":null,"abstract":"Recently, the Bag-of-Features (BoF) model has emerged as a popular solution to scalable content-based image retrieval (CBIR), due to great success of the Bag-of-Words (BoW) model in textual information processing. While most of the existing algorithms on CBIR in P2P networks focus on indexing high dimensional low level features, we propose to address such an issue by employing the BoF model. However, it is not straightforward due to the fact that the BoF model depends on a global codebook and it is very challenging to create and maintain such a global codebook across the whole P2P network. We design a novel online sampling mechanism to create a codebook with low network cost. Since the number of features in each image is large, compared to a text query generally consisting of several keywords, information exchange between nodes for each query image generates high network cost. In order to further reduce the network cost, we implement two static index pruning policies to limit the document length and the returned term weights. Our comprehensive experimental results show that our proposed approach is able to scale up to medium size networks with performance comparable to the centralized environment.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131157731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}