{"title":"Integrated Feature Selection and Clustering from Multiple Views for a Taxonomic Problem","authors":"Huimin Chen, H. Bart, Shuqing Huang","doi":"10.1109/MMSP.2007.4412855","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412855","url":null,"abstract":"As computer and database technologies advance rapidly, biologists all over the world can share biologically meaningful data from images of specimens and use the data to classify the specimens taxonomically. Accurate shape analysis of a specimen from multiple views of 2D images is crucial for finding diagnostic features using geometric morphometric techniques. We propose an integrated feature selection and clustering framework that automatically identifies a set of feature variables to group specimens into a binary cluster tree. The candidate features are generated from reconstructed 3D shape and local saliency characteristics from 2D images of the specimen. We use a mixture model to estimate the significance value of each feature and control the false discovery rate in the feature selection process so that the clustering algorithm can efficiently partition the specimen samples into clusters that may correspond to different species. The experiments on a taxonomic problem involving species of suckers in the genus Carpiodes demonstrate promising results using the proposed framework with small sample size.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116279875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Phase-Domain Statistical Analysis for Audio Source Localization","authors":"A. Said, T. Kalker, R. Schafer","doi":"10.1109/MMSP.2007.4412826","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412826","url":null,"abstract":"We consider the problem of estimating the time-difference-of-arrival (TDOA) for audio source localization in noisy environments, defining a framework for statistical analysis in the phase domain which enables more reliable estimates. This is motivated by the fact that with complex sources, noise, and interfering signals, different frequency bands have significantly different signal-to-noise ratios, creating non-uniform distributions of errors in the phase measurements. Through a new method for analysis of the variations of the phase in frequency windows, we first estimate the signal-to-noise ratio for frequency, and then use it in a maximum-likelihood estimation of the time difference of arrival. We show that this corresponds to a generalization of the Phase Transform method (PHAT), and provides a theoretical justification of why it works so well. Numerical results show how the proposed technique compares favorably with PHAT.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121730519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Juntao Ouyang, Lifeng Sun, Y. Zhong, Shiqiang Yang
{"title":"Power-Rate-Distortion Optimization for Multi-Source Video Streaming under Energy Constraints over Ad Hoc Networks","authors":"Juntao Ouyang, Lifeng Sun, Y. Zhong, Shiqiang Yang","doi":"10.1109/MMSP.2007.4412834","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412834","url":null,"abstract":"We propose a dynamic rate allocation scheme based on power-rate-distortion (PRD) optimization model among multiple video sources over ad hoc networks. This work is an extension of the PRD model for single source video streaming. With a total rate constraint and different power consumption constraints for each node, our optimization algorithm minimizes the average video distortion for all sources. The optimization is performed at the receiver of the video streams. Experimental results for a video surveillance scenario demonstrate that the proposed scheme outperforms a simple fixed-QP scheme. The proposed scheme promotes the average PSNR by 0.32-0.45 dB without shortening the system lifetime, or prolongs system lifetime by more than 20% without cutting down the overall PSNR.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128058202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"High-Speed Stream-Centric Dense Stereo and View Synthesis on Graphics Hardware","authors":"Jiangbo Lu, S. Rogmans, G. Lafruit, F. Catthoor","doi":"10.1109/MMSP.2007.4412863","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412863","url":null,"abstract":"This paper presents an efficient image-based rendering system capable of performing online stereo matching and view synthesis at high speed, completely on the graphics processing unit (GPU). Given two rectified stereo images, our algorithm first extracts the disparity map with a stream-centric dense depth estimation approach. For high-quality view synthesis, multi-label masks are then automatically generated to postprocess occlusions and ambiguously estimated regions adaptively. To allow even faster interactive view generation, an alternative forward warping method is also integrated. The experiments show that photorealistic intermediate views of high image quality are yielded by our algorithm. The optimized implementation also provides the state-of-the-art stereo analysis and view synthesis speed, achieving over 47 fps with 450x375 stereo images and 60 disparity levels on an Nvidia GeForce 7900 graphics card.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130862497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nirit Bauminger, Dina Goren-Bar, E. Gal, P. Weiss, Judi Kupersmitt, F. Pianesi, O. Stock, M. Zancanaro
{"title":"Enhancing Social Communication in High-Functioning Children with Autism through a Co-Located Interface","authors":"Nirit Bauminger, Dina Goren-Bar, E. Gal, P. Weiss, Judi Kupersmitt, F. Pianesi, O. Stock, M. Zancanaro","doi":"10.1109/MMSP.2007.4412808","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412808","url":null,"abstract":"In this paper we describe a pilot study for an intervention aimed at enhancing social skills in high functioning children with autism. We found initial evidences that the use of a social interaction and may lessen the repetitive behaviors typical of autism. These positive effects also appear to be transferred to other tasks following the intervention. We hypothesize that the effect is due to some unique characteristics of the interfaces used, in particular enforcing some tasks to be done together through the use of multiple-user GUI actions.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122016665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Quality Measurement Modeling on Scalable Video Applications","authors":"Sung Ho Jin, C. Kim, Dong Jun Seo, Yong Man Ro","doi":"10.1109/MMSP.2007.4412835","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412835","url":null,"abstract":"For various mobile applications, measuring the grade of the video quality is needed in order to guarantee the optimal quality of video streaming service. As H.264/AVC scalable video coding (SVC) has been emerged and developed to support full scalability including spatial, temporal, and signal-to-noise ratio (SNR) scalability, each of which shows different visual effect, it is necessary to measure video quality with full scalability. In this paper, we develop a novel video quality metric allowing full scalability through the subjective quality assessment. Experimental results show that the proposed quality metric has high correlation with subjective quality and is useful to determine the video quality of SVC.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"204 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116391387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Flexible Layered Authentication Graph for Multimedia Streaming","authors":"Xinglei Zhu, Zhishou Zhang, Zhi Li, Qibin Sun","doi":"10.1109/MMSP.2007.4412888","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412888","url":null,"abstract":"In this paper, a new flexible layered authentication graph (FLAG) algorithm is proposed for multimedia streaming authentication. While maximizing the verification probability by avoiding authentication path overlapping, this algorithm allows flexible communication overhead in terms of the number of hash links, as well as flexible authentication group size. These flexibilities make FLAG an excellent candidate for multimedia streaming authentication, in that (i) in the sender buffering mode, it allows elastic sending delay required by multimedia streaming congestion control; (ii) in the receiver buffering mode, it facilitates adaptation to effective network bandwidth; (iii) it also has the potential to provide unequal authentication protection (UAP), which is a natural solution for multimedia code stream. Our analysis and experiment results further confirm the validity of our algorithm.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116785683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Object-Sensitive Query Analysis for Video Search","authors":"Jingjing Liu, Xiansheng Hua, Shipeng Li","doi":"10.1109/MMSP.2007.4412891","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412891","url":null,"abstract":"This paper is concerned with the problem of improving the performance of text search baseline in video retrieval, specifically for the search tasks in TRECVID. Given a query in plain text, we first implement syntactic segmentation and semantic expansion of the query, then identify the underlying \"targeted objects\" which should appear in the retrieved video shots, and scale up the weights of the video shots retrieved by the query terms that represent these targeted objects. We name the approaches as \"object-sensitive query analysis\" for video search. Specifically, we propose a set of methods to identify the specific terms representing the \"targeted objects\" in a video search query, and a modified object-centric BM25 algorithm to emphasize the impact of these specific object-terms. In practice, we place the process of object-sensitive query analysis before the text search stage, and verify the effectiveness of the proposed approaches with the TRECVID 2005 and 2006 datasets. The experimental results indicate that the proposed object-sensitive approaches to query analysis bring significant improvement upon the raw text search baseline of video search.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116588411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. D. Areia, J. Ascenso, Catarina Brites, F. Pereira
{"title":"Wyner-Ziv Stereo Video Coding using a Side Information Fusion Approach","authors":"J. D. Areia, J. Ascenso, Catarina Brites, F. Pereira","doi":"10.1109/MMSP.2007.4412914","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412914","url":null,"abstract":"Wyner-Ziv coding, also known as distributed video coding, is currently a very hot research topic in video coding due to the new opportunities it opens. This paper applies the distributed video coding principles to stereo video coding, to propose a practical solution for Wyner-Ziv stereo coding based on mask-based fusion of temporal and spatial side informations. The architecture includes a low-complexity encoder and avoids any communication between the cameras/encoders. While the rate-distortion (RD) performance strongly depends on the motion-based frame interpolation (MBFI) and disparity-based frame estimation (DBFE) solutions, first results show that the proposed approach is promising and there are still issues to address.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114491594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient Dependency Tracking in Packetised Media Streams","authors":"Alexander Eichhorn","doi":"10.1109/MMSP.2007.4412837","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412837","url":null,"abstract":"Scheduling and error control mechanisms for robust delivery of media streams over packet networks rely on distortion metrics to optimally allocate resources and protect streams front uncontrolled quality degradation. Current distortion metrics are accurate, but the actual distortion values are expensive to obtain. Therefore, distortion models often assume fixed dependency patterns and neglect fragmentation issues. While this decreases runtime complexity, it also limits the application of such models to special stream classes and network environments. In response, we present a practical, efficient and format-independent framework to reason about dependencies in media streams. Based on correlation analysis we show that the estimations made by our framework match traditional distortion metrics for a number of H.264 encoded streams. Performance benchmarks indicate, that our framework is applicable at very-low computational overheads.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130697775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}