IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072564
Derek Pang, Sherif A. Halawa, Ngai-Man Cheung, B. Girod
{"title":"Mobile interactive region-of-interest video streaming with crowd-driven prefetching","authors":"Derek Pang, Sherif A. Halawa, Ngai-Man Cheung, B. Girod","doi":"10.1145/2072561.2072564","DOIUrl":"https://doi.org/10.1145/2072561.2072564","url":null,"abstract":"Small screen sizes, limited bandwidth, and low computational power often prohibit streaming of high-resolution videos to mobile devices over a wireless network. Recent advances in interactive region-of-interest (IRoI) video streaming technology allow users to interactively control pan/tilt/zoom, while providing bit-rate and complexity savings. In this paper, we present a mobile IRoI video streaming system that delivers high-quality interactive video to smartphones and tablets with multi-touch screens. One of the challenges in IRoI video streaming is to enable low-latency interaction when a user switches between different RoIs. We propose a crowd-driven RoI prediction scheme to prefetch future selected regions. Different from previous approaches that extrapolate past user inputs or perform video semantic analysis, our proposed scheme exploits user viewing statistics collected at the server to make RoI predictions. Our experiments show that a crowd-driven prefetching scheme can substantially reduce average RoI switching delays compared to a system without prefetching.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124860678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072569
Go Irie, T. Satou, Akira Kojima, T. Yamasaki, K. Aizawa
{"title":"Image collection summarization for search result overviewing on mobile devices","authors":"Go Irie, T. Satou, Akira Kojima, T. Yamasaki, K. Aizawa","doi":"10.1145/2072561.2072569","DOIUrl":"https://doi.org/10.1145/2072561.2072569","url":null,"abstract":"Due to small displays of mobile devices, overviewing an image search result that contains many and various images is difficult. To provide an overview of thousands of images, recent studies have tried to develop a framework for image collection summarization that extracts a smaller set of representative images from the original set. Most existing methods take (a) relevance and (b) coverage of each image into account. However, for the use on mobile devices, several important issues remain: generated summaries must be compact enough so as to suit the small mobile displays but the legibility of the summaries should be sufficient -- but how? Our focus in this paper is to extend the framework of image collection summarization to fit the context of overviewing image search results on mobile devices. The key advances of this paper are to introduce two primary factors of (c) compactness and (d) legibility when generating summaries. Our solution is a two-stage optimization method. Given a keyword query and display size, its first stage ranks the images by taking (a) relevance and (b) coverage into account. The second optimization stage takes into account (c) compactness and (d) legibility and determines the number and sizes of images included in the final summary so as to satisfy the display size constraint. Experiments conducted on over 240,000 images demonstrate the effectiveness of our method.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125369416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072570
J. Niu, Da Huo, Xiao Zeng, J. Mugan
{"title":"Interactive and real-time generation of home video summaries on mobile devices","authors":"J. Niu, Da Huo, Xiao Zeng, J. Mugan","doi":"10.1145/2072561.2072570","DOIUrl":"https://doi.org/10.1145/2072561.2072570","url":null,"abstract":"With the proliferation of mobile devices and multimedia, videos have become an indispensable part of life-logs for personal experiences. In this paper, we present a real-time and interactive application for home video summarization on mobile devices. The main challenge of this method is lack of information about the video content in the following frames, which we term \"partial-context\" in this paper. First of all, real-time segmentation algorithm based on partial-context is applied to decompose the captured video into segments in line with the change in dominant camera motion. Secondly, the main challenge to conventional video summarization is the semantic understanding of the video content. Thus, we leverage the fact that it is easy to get user input on a mobile device and attack this problem through the user interaction. The user preference is learned and modeled by a Gaussian Mixture Model (GMM), which is updated each time when users manually select key frames. Evaluation results demonstrate that our system significantly improves user experience and provides an efficient automatic/semi-automatic video summarization solution for mobile users.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"67 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130953352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072571
Jose G. Moreno, G. Dias
{"title":"Using ephemeral clustering and query logs to organize web image search results on mobile devices","authors":"Jose G. Moreno, G. Dias","doi":"10.1145/2072561.2072571","DOIUrl":"https://doi.org/10.1145/2072561.2072571","url":null,"abstract":"The recent shift in human-computer interaction from desktop to mobile computing fosters the needs of new interfaces for web image search results exploration. In this paper, we present two different strategies to cluster results gathered from an image search engine and propose an adapted interface for handled devices. For that purpose, we suggest to expand the original query based on labels of Ephemeral Clusters and compare it to a Query Log based approach. Consistent results were obtained for both strategies from manual and automatic evaluations, confirming that organizing image search results into clusters can improve mobile image information retrieval.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117204372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072565
Yu-Ming Hsu, Ming-Kuang Tsai, Yen-Liang Lin, Winston H. Hsu
{"title":"Comp2Watch: enhancing the mobile video browsing experience","authors":"Yu-Ming Hsu, Ming-Kuang Tsai, Yen-Liang Lin, Winston H. Hsu","doi":"10.1145/2072561.2072565","DOIUrl":"https://doi.org/10.1145/2072561.2072565","url":null,"abstract":"The mobile devices have been widely spread and become frequently used equipment in daily life. Besides, watching videos on these devices has become a more and more popular activity. However, there are several challenges (e.g., small mobile screen size, low bandwidth, fragmented watching time) hindering mobile video watching: they either interrupt the watching process or limit users to browse many contents at the same time. Traditional video summarization techniques are suffering the small screen issue. Therefore, we propose a system, Comp2Watch which is pronounced like \"come to watch\". It implies the meaning of \"composing the frames into a collage\" and \"compressing the watching time\". It puts ROI factors into consideration in order to help users take a quick glance at videos. Also, we modify the cost function to incorporate the templates with variable aspect ratios. We also address the monotone layout problem caused by the limited space. The experimental results show that users can obtain clearer subject without losing many contexts.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131521900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072567
S. Deshpande, L. Kerofsky
{"title":"System level power allocation algorithm for mobile devices for full playback guarantee","authors":"S. Deshpande, L. Kerofsky","doi":"10.1145/2072561.2072567","DOIUrl":"https://doi.org/10.1145/2072561.2072567","url":null,"abstract":"We propose a system level power allocation algorithm for mobile devices for guaranteed playback duration. The algorithm takes as input the media playback duration and current battery charge available on the device. It calculates energy budget based on this. It finds a set of feasible operating points based on maximization of a joint quality function while meeting energy budget constraint. The joint quality function consists of contributions from backlight and playback frame rate quality functions. A side-by-side comparison of our algorithm with state-of-the-art native video player application on the smartphone shows similar audio-visual experience while providing 19% system level energy savings.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131776915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
IMMPD '11Pub Date : 2011-11-29DOI: 10.1145/2072561.2072563
Wenhong Yuan, Bin Li, Kongqiao Wang
{"title":"A novel PRO-CAM based interactive display surface","authors":"Wenhong Yuan, Bin Li, Kongqiao Wang","doi":"10.1145/2072561.2072563","DOIUrl":"https://doi.org/10.1145/2072561.2072563","url":null,"abstract":"Vision-based human-computer interaction (HCI) is a natural and human-centered way to make interaction between human and computer. Recently, with the miniaturization of projectors and the development of embedded systems, there has been an explosion of interest in systems which combine projection technology with computer vision. Associating a projector with a camera offers a cheap means to transform any surface into an interactive display surface. However, it is very hard to segment hand and recognize hand gesture due to self-occlusion, non-rigid tissue, even when the occlusion due to projection content can be avoided. In this paper, a novel approach is proposed to recognize finger stroke under the dynamic illumination circumstances. The approach is based on one projector and two heterogeneous cameras. First, an NIR camera is used to get the finger-tip and hand model, then, the hand model is used to get the interest points of the hand in the visible camera, then we can get the disparity of interest points from the two heterogeneous images. The disparity change is related to the depth change that can be used to determine when stroke event happens. Experiment results from a prototype system show that the approach can run in real-time without using special markers or gloves.","PeriodicalId":185203,"journal":{"name":"IMMPD '11","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129152176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}