2012 IEEE International Conference on Multimedia and Expo最新文献

筛选
英文 中文
Enhanced Principal Component Using Polar Coordinate PCA for Stereo Audio Coding 基于极坐标PCA的增强主成分立体音频编码
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.22
Shi Dong, R. Hu, Weiping Tu, Xiang Zheng, Junjun Jiang, Song Wang
{"title":"Enhanced Principal Component Using Polar Coordinate PCA for Stereo Audio Coding","authors":"Shi Dong, R. Hu, Weiping Tu, Xiang Zheng, Junjun Jiang, Song Wang","doi":"10.1109/ICME.2012.22","DOIUrl":"https://doi.org/10.1109/ICME.2012.22","url":null,"abstract":"High efficiency audio compression is the basic technology in audio involved multimedia application. Down mixing and parametric coding are efficient coding scheme with widely applications in some up to date audio codecs such as PS in EAAC+ and MPEG-Surround, and PCA stereo coding followed this idea to map two channels to one channel with maximum energy and parameterize the secondary channel. This paper investigates the conventional PCA method performance under general stereo model with multiple sound sources and different directions, and then proposes a Polar Coordinate based PCA (PC-PCA) stereo coding method. It has been proved that when multiple sound sources exist with different directions, proposed method is better than the conventional PCA method in certain conditions. A stereo codec based on PC-PCA has also been proposed to validate the performance improvement of proposed method.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"391 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121781902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Adaptive Coding with CPU Energy Conservation for Mobile Video Calls 基于CPU节能的移动视频通话自适应编码
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.78
Haiyang Ma, Roger Zimmermann
{"title":"Adaptive Coding with CPU Energy Conservation for Mobile Video Calls","authors":"Haiyang Ma, Roger Zimmermann","doi":"10.1109/ICME.2012.78","DOIUrl":"https://doi.org/10.1109/ICME.2012.78","url":null,"abstract":"Video calling incurs very high power consumption on mobile platforms usually powered by capacity-constrained batteries. To remedy this problem, we develop a cross-layer energy-aware optimization framework to reduce CPU energy consumption for mobile video calls. We explore the texture similarity between neighboring macro blocks during H.264 coding and control the quality-complexity tradeoff with a single self-adaptive parameter. Dynamic Voltage and Frequency Scaling (DVFS) is then applied to reduce energy consumption and guarantee the stringent one-way delay in real-time applications. Experimental results show that considerable CPU energy saving is achieved while a high Quality of Service (QoS) is preserved for user satisfaction.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121821365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Band Codes: Controlled Complexity Network Coding for Peer-to-Peer Video Streaming 频带编码:点对点视频流的受控复杂性网络编码
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.84
A. Fiandrotti, Valerio Bioglio, E. Magli, Marco Grangetto, R. Gaeta
{"title":"Band Codes: Controlled Complexity Network Coding for Peer-to-Peer Video Streaming","authors":"A. Fiandrotti, Valerio Bioglio, E. Magli, Marco Grangetto, R. Gaeta","doi":"10.1109/ICME.2012.84","DOIUrl":"https://doi.org/10.1109/ICME.2012.84","url":null,"abstract":"We present Band Codes (BC), a novel class of rate less codes that makes possible to control the computational complexity of Network Coding (NC). NC increases throughput of the networks via packet recombinations at the network nodes. In a NC scenario based on rate less codes, the recombinations at the nodes alter the packet degree distribution selected at the source and increase the computational complexity of the packet decoding process. Unlike other classes of rate less codes, BC preserve the degree distribution of the encoded packets through the recombinations at the nodes. Furthermore, BC enable to control the decoding complexity of each network node independently from the rest of the network. We evaluate BC in a P2P scenario using a purposely designed random-push protocol for live video streaming. The experiments show that BC achieve high encoding efficiency, enable nodes with different computational capabilities to coexist within the same network and reduce the processor load on a real mobile device by nearly 50%.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125976720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Unsupervised Conversion of 3D Models for Interactive Metaverses 交互式元数据库中3D模型的无监督转换
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.186
J. Terrace, Ewen Cheslack-Postava, P. Levis, M. Freedman
{"title":"Unsupervised Conversion of 3D Models for Interactive Metaverses","authors":"J. Terrace, Ewen Cheslack-Postava, P. Levis, M. Freedman","doi":"10.1109/ICME.2012.186","DOIUrl":"https://doi.org/10.1109/ICME.2012.186","url":null,"abstract":"A virtual-world environment becomes a truly engaging platform when users have the ability to insert 3D content into the world. However, arbitrary 3D content is often not optimized for real-time rendering, limiting the ability of clients to display large scenes consisting of hundreds or thousands of objects. We present the design and implementation of an automatic, unsupervised conversion process that transforms 3D content into a format suitable for real-time rendering while minimizing loss of quality. The resulting progressive format includes a base mesh, allowing clients to quickly display the model, and a progressive portion for streaming additional detail as desired. Sirikata, an open virtual world platform, has processed over 700 models using this method.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125143871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Spread and Iterative Search: A High Quality Motion Estimation Algorithm for High Definition Videos and Its VLSI Design 扩展与迭代搜索:一种高质量的高清视频运动估计算法及其VLSI设计
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.53
G. Sanchez, L. Agostini, F. Sampaio, M. Porto, S. Bampi
{"title":"Spread and Iterative Search: A High Quality Motion Estimation Algorithm for High Definition Videos and Its VLSI Design","authors":"G. Sanchez, L. Agostini, F. Sampaio, M. Porto, S. Bampi","doi":"10.1109/ICME.2012.53","DOIUrl":"https://doi.org/10.1109/ICME.2012.53","url":null,"abstract":"This paper presents the Spread and Iterative Search (S&IS) motion estimation algorithm, which uses a random spread evaluation together with a central iterative evaluation to avoid local minima falls and to increase the image quality for high definition videos. Considering Full HD videos, S&IS reached an average PSNR gain of 1.41dB when compared to Diamond Search (DS), with an increase of about four times in the number of evaluated blocks. When compared to Full Search (FS), the S&IS achieved an average PSNR loss of 1.56 dB, evaluating 73 times less blocks than FS. An efficient architecture for the S&IS algorithm is also presented in this paper. The architecture was designed targeting in real time processing (30 frames per seconds) for QFHD videos (3840×2160 pixels). The architecture was described in VHDL and synthesized for and Altera Stratix 4 FPGA and for ST90nm standard cells technology. Booth syntheses show that the architecture is able to process QFHD frames in real time. The standard cells version is able to reach also a good trade-off among area, memory and power consumption, processing QFHD videos with 62.2 mW.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130263969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance 基于大数据集学习检测器的视频监控对象检索
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.132
R. Feris, Sharath Pankanti, Behjat Siddiquie
{"title":"Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance","authors":"R. Feris, Sharath Pankanti, Behjat Siddiquie","doi":"10.1109/ICME.2012.132","DOIUrl":"https://doi.org/10.1109/ICME.2012.132","url":null,"abstract":"We address the problem of learning robust and efficient multi-view object detectors for surveillance video indexing and retrieval. Our philosophy is that effective solutions for this problem can be obtained by learning detectors from huge amounts of training data. Along this research direction, we propose a novel approach that consists of strategically partitioning the training set and learning a large array of complementary, compact, deep cascade detectors. At test time, given a video sequence captured by a fixed camera, a small number of detectors is automatically selected per image location. We demonstrate our approach on the problem of vehicle detection in challenging surveillance scenarios, using a large training dataset composed of around one million images. Our system runs at an impressive average rate of 125 frames per second on a conventional laptop computer.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129695977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
View-Invariant Fall Detection System Based on Silhouette Area and Orientation 基于轮廓面积和方向的视不变跌落检测系统
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.193
Behzad Mirmahboub, S. Samavi, N. Karimi, S. Shirani
{"title":"View-Invariant Fall Detection System Based on Silhouette Area and Orientation","authors":"Behzad Mirmahboub, S. Samavi, N. Karimi, S. Shirani","doi":"10.1109/ICME.2012.193","DOIUrl":"https://doi.org/10.1109/ICME.2012.193","url":null,"abstract":"Population of old generation that live alone is growing in most countries. Surveillance systems help them stay home and reduce the burden on the healthcare system. Automatic visual surveillance systems have advantages over wearable devices. They extract features from video sequences and use them for event classification. But these features are dependent on the position of cameras relative to the person. Therefore they need multi-camera for more accuracy that increases cost and complexity. In this paper we propose using silhouette area combined with inclination angle as robust features that can be measured using only one camera with an arbitrary direction. Through rigorous simulations on a publicly available dataset the error rate of the system is found to be less than 1%.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129296266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Live Semantic Sport Highlight Detection Based on Analyzing Tweets of Twitter 基于Twitter推文分析的实时语义体育高光检测
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.135
Liang-Chi Hsieh, Ching-Wei Lee, Tzu-Hsuan Chiu, Winston H. Hsu
{"title":"Live Semantic Sport Highlight Detection Based on Analyzing Tweets of Twitter","authors":"Liang-Chi Hsieh, Ching-Wei Lee, Tzu-Hsuan Chiu, Winston H. Hsu","doi":"10.1109/ICME.2012.135","DOIUrl":"https://doi.org/10.1109/ICME.2012.135","url":null,"abstract":"Microblogging as a new form of communication on Internet, has attracted the attention from researchers recently. Relying the real-time and conversational properties of microblogging, its users update their statuses and share experience within their the social network. Those characteristics also make microblogging an important tool for users to share or discuss real world events such as earth quake or sport game. In this paper, we propose a novel and flexible solution to detect and recognize real-time events from sport games based on analyzing the messages posted on microblogging services. We take Twitter as the experiment platform and collect a large-scale dataset of Twitter messages that are called tweets for 18 prominent sport games covering four types of sports in 2011. We also collect corresponding sport videos for those games. The proposed solution applies moving-threshold burst detection on the volume of tweets to detect highlights in sport games. A tf-idf-based weighting method is applied on the tweets within detected highlights for semantic extraction. According to the experiments we perform on the tweet and video datasets, we find that the proposed methods can achieve competent performance in sport event detection and recognition. Besides, our method can find non pre-defined tidbits that are difficult to detect in previous works.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121484007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
A Fast and Robust Pedestrian Detection Framework Based on Static and Dynamic Information 基于静态和动态信息的快速鲁棒行人检测框架
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.66
T. Xu, Hong Liu, Yueliang Qian, Zhe Wang
{"title":"A Fast and Robust Pedestrian Detection Framework Based on Static and Dynamic Information","authors":"T. Xu, Hong Liu, Yueliang Qian, Zhe Wang","doi":"10.1109/ICME.2012.66","DOIUrl":"https://doi.org/10.1109/ICME.2012.66","url":null,"abstract":"With the powerful development of pedestrian detection technique based on sliding-window and machine-learning, detection-based tracking systems have become increasingly popular. Most of these systems rely on existing static pedestrian detectors only despite the obvious potential motion information for people detection. This paper proposes a novel pedestrian detection framework fusing static and dynamic features. Motion cue is firstly used to detect potential pedestrian regions. Secondly, static detector scans potential regions to get candidate pedestrian detections. Final detection results are improved by removing false detections based on their motion distribution. The proposed framework significantly raises detection speed and detection performance. Static detector of pedestrian in this paper is trained by AdaBoost with simplified HOG feature (1HOG). Additionally, we introduce a detection-window-pyramid based scanning strategy for quickly extracting 1HOG features. The experimental results on several public data sets show the effectiveness of the proposed approach.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120960541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Effective Spatial Data Broadcasting 有效的空间数据广播
2012 IEEE International Conference on Multimedia and Expo Pub Date : 2012-07-09 DOI: 10.1109/ICME.2012.100
Chung-Hua Chu
{"title":"Effective Spatial Data Broadcasting","authors":"Chung-Hua Chu","doi":"10.1109/ICME.2012.100","DOIUrl":"https://doi.org/10.1109/ICME.2012.100","url":null,"abstract":"Data broadcast is an advanced technique to realize large scalability and bandwidth utilization in a mobile computing environment. Three dimensional (3D) contents are emerging data in the data broadcast. However, traditional data broadcast did not consider 3D data to design a data schedule in a broadcast channel. Therefore, the above drawback leads to large access delay in the 3D data broadcast. In this paper, we remedy the problem by devising an indexing technique to index the 3D data of variant geometry shapes. We propose an indexing technique using a 3D data index tree to minimize average waiting time and average tuning time for broadcasting the 3D data. Experimental results show that our approach is able to generate broadcast programs including the 3D data indices with high quality and is very efficient in the 3D data broadcast.","PeriodicalId":273567,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114263527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信