{"title":"Extraction of Individual Perception Features for imagery-based Image Retrieval","authors":"Ying Dai","doi":"10.1109/MMSP.2005.248543","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248543","url":null,"abstract":"In this paper, with the objective to retrieving digital collections more intuitively and flexible, we proposed an approach of compact feature extraction and retrieval of images which matched the human perceptual image similarity criteria. For this, the eigen and difference SGLD (space gray level dependence) matrices were used to extract the features of images. The associations of the extracted features with the individual's perceptual similarity criteria regarding color and structure were analyzed. The user satisfaction-based evaluation illustrated the good performance of the proposed method","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116988142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Just-Noticeable Distortion Estimation for Image Pixels","authors":"Xiaohui Zhang, Weisi Lin, P. Xue","doi":"10.1109/MMSP.2005.248671","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248671","url":null,"abstract":"This paper addresses the issue of incorporating contrast sensitivity function (CSF) into just-noticeable distortion (JND) estimation for image pixels in spatial domain. Based upon our earlier work on DCT-subband JND estimation, the resultant new pixel-wise JND model fully incorporates the spatial CSF, in addition to considerations of luminance adaptation and local contrast masking effect. Various experiments confirm the improved accuracy of the proposed model over the existing relevant JND estimators","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123948175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Slice header reconstruction for H.264/AVC robust decoders","authors":"G. Gennari, D. Bagni, A. Borneo, L. Pezzoni","doi":"10.1109/MMSP.2005.248605","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248605","url":null,"abstract":"For H.264/AVC encoded video sequences, a transmission error in the bit-stream can affect either the underlying codeword or even the subsequent symbols, thus resulting in a great degradation of the received images. In this paper we present a powerful algorithm of slice-header reconstruction for H.264/AVC decoders: after having decoded the corrupted header of a slice, it can recover most of the related slice data without requiring any interaction from the encoder side. Our method exploits the redundancy available when slice partitioning is applied, that is, the repetition of the most important header parameters in every slice of a picture. The proposed approach allows to recover high quality pictures from the corresponding corrupted slices for a wide range of bit error rates (BER)","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131650276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Trajectory Matching and Classification of Video Moving Objects","authors":"Jiang-bin Zheng, D. Feng, R. Zhao","doi":"10.1109/MMSP.2005.248553","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248553","url":null,"abstract":"Trajectory matching is an important way to describe and classify behaviors of moving objects in a computer visual system. In this paper, we present two trajectory description methods, time-sampling sequence and space-sampling sequence, which can be used in different matching applications. We then propose two general trajectory matching schemes based on Levenshtein distance and relaxation matching respectively. Trajectory Levenshtein distance scheme is a good way to compare the topological shapes and directions of trajectories, and can be performed quickly. Trajectory relaxation matching scheme can gain the statistical optimal matching. Finally, we propose a top-to-bottom hierarchical clustering algorithm to classify trajectories, and several experiments demonstrate that our schemes are efficient in matching and classifying different shape and direction trajectories","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"506 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115320295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Invariant Salient Region Selection and Scale Normalization of Image","authors":"Xianfeng Yang, P. Xue, Q. Tian","doi":"10.1109/MMSP.2005.248592","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248592","url":null,"abstract":"Scale estimation is important in image and vision computing. We propose in this paper an invariant salient region selection and scale normalization method which is robust to rotation, scaling, translation and cropping. This new method is based on the first and second order invariant geometric moments calculated from an intensity difference map. The first-order moments are used to obtain invariant circular regions for different scale hypotheses, while a second-order moment is chosen as region descriptor to select the most salient scale. The image is normalized by scale of the selected salient region. Experiments demonstrate effectiveness of this method","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"2018 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121665013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automatic Feature Extraction and Semantic Feature Matrix for VRML Building Database Retrieval","authors":"Hsuan T. Chang, Kwang Y. Chang","doi":"10.1109/MMSP.2005.248570","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248570","url":null,"abstract":"A semantic content based information retrieval system for a three-dimensional (3-D) database is proposed in this paper. Here the studied database is composed of 3-D building objects defined by virtual reality modeling language (VRML). First of all, the specific low-level features for building objects are defined and then searched and extracted from the content described in the VRML file. Then, a semantic feature matrix (SFM) is constructed with the middle-level features that determined from the low-level features of all the objects in the database. For a query object, a similar process is applied such that the low-level features and the corresponding semantic vector can be obtained. By multiplying the SFM with the query vector, the scores corresponding to the similarities between the query and all objects in the database can be calculated. Simulation results show that the desired 3-D objects can be successfully and efficiently retrieved with high recall and precision rates","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121497708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yan Li, A. Markopoulou, J. Apostolopoulos, N. Bambos
{"title":"Joint Packet Scheduling and Content-Aware Playout Control for Video Streaming over Wireless Links","authors":"Yan Li, A. Markopoulou, J. Apostolopoulos, N. Bambos","doi":"10.1109/MMSP.2005.248594","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248594","url":null,"abstract":"Media streaming over wireless links is a challenging problem due to both the unreliable, time-varying nature of the wireless channel and the stringent delivery requirements of media traffic. In this paper, we use joint control of packet scheduling at the transmitter and content-aware playout at the receiver, so as to maximize the quality of media streaming over a wireless link. Our contributions are twofold. First, we formulate and study the problem of joint scheduling and playout control within a dynamic programming framework. Second, we propose a novel content-aware playout control, that takes into account the content of a video sequence, and in particular the motion characteristics of different scenes. We find that the joint scheduling and playout control can significantly improve the quality of the received video, at the expense of only a small amount of playout slowdown. Furthermore, thanks to the content-aware playout, the slowdown takes place mainly in the low-motion scenes, where its perceived effect is limited","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115151781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bit Allocation for Variable Bitrate Video","authors":"M. Rezaei, M. Gabbouj","doi":"10.1109/MMSP.2005.248639","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248639","url":null,"abstract":"In this paper we propose a special bit allocation method which can be used in most rate control algorithms in variable rate video applications. In real-time video communication applications, we need a constant short-term average bitrate, while in variable bitrate applications such as streaming and local recording applications, a constant long-term average bitrate is sufficient and more short-term variation in bitrate is acceptable. In comparison with constant bitrate video, a variable bitrate video can provide better visual quality and coding efficiency for compressed video sequences. Furthermore, while more variation in bitrate is possible we have additional degrees of freedom to control the encoding parameters. We propose a special bit allocation algorithm to take advantage of this freedom in variable bitrate video. We introduce a new type of frame namely SPP frame (SPecial P frame) that can be used in combination with I, P, B and other types of frames in different encoders including H.263, MPEG-4 and H.264/AVC encoders. We propose a simple method to implement the SPP frames independently of the rate control algorithm. The experimental results show that the SPP frames can considerably increase the total average quality of variable rate encoded video","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"385 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127768296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unequal Forced Intra-Refresh for Real-time Multicast Video","authors":"Hao Liu, Wenjun Zhang, Yutao Dong, Xiangzhong Fang","doi":"10.1109/MMSP.2005.248659","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248659","url":null,"abstract":"In motion-compensated video coding, the errors caused by packet loss not only impair the reconstruction quality of current frame, but also lead to error propagation to subsequent frames. Based on the error-propagation analysis in a group of pictures (GOP), we propose an unequal forced intra-refresh scheme to increase error resilience of multicast video. According to a GOP-level error-propagation model, the proposed scheme can distribute the unequal number of forced intra-mode MBs to different P-frames of a GOP. Experimental results show that the proposed scheme can effectively mitigate the error-propagation effect and achieve about 0.1~1.1 dB gains over the traditional average scheme in H.264/AVC","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133970993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Perception Principles Guided Video Segmentation","authors":"Cheng Chen, Guoliang Fan","doi":"10.1109/MMSP.2005.248664","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248664","url":null,"abstract":"In this paper, we present a perception principles-guided video segmentation method, where statistical modeling and graph-theoretic approaches are combined in a multi-layer classification architecture. Various visual cues are effectively incorporated in a sequential segmentation process. Specifically, low-level pixel-wise features are used in the first layer where a joint spatio-temporal statistical modeling approach is used to construct entry-level visual units in space-time. In the second layer, all units are first classified into dynamic or static units based their motion magnitudes. Then dynamic units are further parsed into over-segmented moving regions that are connected in space and time, and a mid-level feature, motion trajectory, is extracted for each moving region. In the third layer, still and moving regions are merged into background and moving objects by a graph-based approach with different similarity metrics. The proposed algorithm employs both long-range motion information, i.e., trajectory, and short-range motion information, i.e., change detection, to retain temporal continuity and spatial homogeneity of moving objects. The proposed multi-layer structure ensembles the joint spatio-temporal and cascade process of perception principles and support efficient and accurate object segmentation","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123163694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}