{"title":"Performance of MPEG-7 low level audio descriptors with compressed data","authors":"J. Lukasiak, D. Stirling, N. Harders, S. Perrow","doi":"10.1109/ICME.2003.1221301","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221301","url":null,"abstract":"This paper presents a detailed analysis of lossy compression effects on a set of the MPEG-7 low-level audio descriptors. The analysis results show that lossy compression has a detrimental effect on the integrity of practical search and retrieval schemes that utilize the low level audio descriptors. Methods are then proposed to reduce the detrimental effects of compression in searching schemes. These proposed methods include multi-frame searching and machine learning derived prediction. The proposed mechanisms greatly reduce the effect of compression on the set of MPEG-7 descriptors; however, future scope is identified to develop new audio descriptors that account for compression effects in their structure.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131227235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A protocol with transcoding to support QoS over Internet for multimedia traffic","authors":"Rajeev Kumar","doi":"10.1109/ICME.2003.1220955","DOIUrl":"https://doi.org/10.1109/ICME.2003.1220955","url":null,"abstract":"The growth of the Internet has brought with it a tremendous volume of multimedia traffic, which is bursty in nature. Providing a required QoS as well as modeling multimedia traffic has been a challenging task. In this work, we transcode multimedia data to cater for low bandwidth availability and different end-user requirements. We propose a protocol architecture which has been developed by the amalgamation of well-known components and that would provide guaranteed multimedia communication over the Internet. We model multimedia traffic using the M/Pareto distribution in an attempt to represent realistic traffic pattern. We use semantics of multimedia data-streams for transcoding to avoid network congestion and to ensure optimal use of network resources. The impact of transcoding the multimedia data to suit it to the network load and the end user requirements is also studied. The simulation results are presented and compared.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131544579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Time interval maximum entropy based event indexing in soccer video","authors":"Cees G. M. Snoek, M. Worring","doi":"10.1109/ICME.2003.1221353","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221353","url":null,"abstract":"Multimodal indexing of events in video documents poses problems with respect to representation, inclusion of contextual information, and synchronization of the heterogeneous information sources involved. In this paper, we present the time interval maximum entropy (TIME) framework that tackles aforementioned problems. To demonstrate the viability of TIME for event classification in multimodal video, an evaluation was performed on the domain of soccer broadcasts. It was found that by applying TIME, the amount of video a user has to watch in order to see almost all highlights is reduced considerably.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116523573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient buffering control for a software-only, high-level, high-profile, MPEG-2 decoder","authors":"Ju Wang, Jonathan C. L. Liu, Yishu He","doi":"10.1109/ICME.2003.1221355","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221355","url":null,"abstract":"A high-quality MPEG-2 software decoder should support a good scalability performance for a wide range of video format, especially for the high-resolution MPEG-2 video (e.g., HDTV). However, it is found that the existing parallel decoder suffers significant performance degradation when decoding high-level MPEG-2 video with the full system configuration, due to inefficient management mechanism such that the memory space in the decoder. We propose an efficient buffer management mechanism such that the memory requirement is reduced by 50%. This is approached by two steps: first we use an ST scheme to minimize the transmission buffer in a slave node by allowing dynamic sharing between frames in one group of picture (GOP). Then we further reduce the buffer space by a dynamic buffer allocation according to image type. The revised parallel decode showed a satisfactory scale-up performance when decoding the high-resolution video formats.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121884605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A hybrid pagoda broadcasting protocol: fixed-delay pagoda broadcasting protocol with partial preloading","authors":"Hong Kee Sul, Hyunchul Kim, K. Chon","doi":"10.1109/ICME.2003.1221039","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221039","url":null,"abstract":"Broadcasting protocols offer an efficient and scalable method to provide video-on-demand. We present a new broadcasting protocol that assumes that some portions of the video are already preloaded in the set-top-box (STB), and at the same time, requires the user to wait for a fixed-delay before viewing. As a result, there is a trade-off between the size of the consumed local storage, user waiting time, and bandwidth consumption. This trade-off makes this protocol very flexible, in that we can control the consumption of one resource by adjusting the use of the other two. Also, the performance of the proposed protocol is not very far from the theoretical minimum. We also present a heuristic version of the protocol, in which the performance is improved a little.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127735190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Lara, A. Orozco-Lugo, D. McLernon, Hugo J. Muro-Lemus
{"title":"Blind recovery of multiple packets in ad hoc mobile networks using polynomial phase modulating sequences","authors":"M. Lara, A. Orozco-Lugo, D. McLernon, Hugo J. Muro-Lemus","doi":"10.1109/ICME.2003.1221694","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221694","url":null,"abstract":"We consider multiple packet reception (MPR) for asynchronous random access wireless mobile ad hoc networks. An interference cancellation algorithm is proposed that exploits the base-band cyclostationarity properties of the signal, which are induced at the transmitters by means of modulating the symbols with distinct polynomial phase sequences. In contrast to the method presented in [A.J. van der Veen and L. tong, 2002], the proposed technique docs not require knowledge of the starting time of transmission of the desired signal and can be applied to dispersive multipath channels. Also a practical way of assigning the modulating sequences via the use of a common codebook known to all nodes is proposed, and the impact on local throughput of such scheme is analyzed.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132625631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Latest arrival time leaky bucket for HRD constrained video coding","authors":"Lujun Yuan, Wen Gao, Yan Lu","doi":"10.1109/ICME.2003.1221731","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221731","url":null,"abstract":"Hypothetical reference decoder (HRD) is a mathematical model of a decoder and its input buffer. It represents a set of normative requirements on bitstream for the purpose of avoiding buffer overflow and underflow. In other words, any coded video bitstream shall meet constraints imposed by HRD model. Two HRD models, i.e. the earliest arrival time leaky bucket (EAT- LB) and the constrained arrival time leaky bucket (CAT-LB), have been proposed for the JVT standard jointly developed by ISO/IEC and ITU-T. EAT-LB has the lower initial delay, whereas the CAT-LB model has the lower maximum delay. This paper proposes an improved leaky bucket model called latest arrival time leaky bucket (LAT-LB) so as to achieve the advantages of EAT-LB and CAT-LB simultaneously. The proposed LAT-LB first defines the data transmission schedule according to a set of resume points and stop points, and then derives the HRD parameters according to this schedule. Experimental results demonstrate that the proposed LAT-LB outperforms EAT-LB and CAT-LB in terms of initial delay and maximum delay.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134229437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A drift-free motion-compensated predictive encoding technique for multiple description coding","authors":"Yen-Chi Lee, Y. Altunbasak, R. Mersereau","doi":"10.1109/ICME.2003.1221378","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221378","url":null,"abstract":"Multiple description coding (MDC) is a source coding technique that exploits path diversity to increase the robustness of transmitting a compressed signal over error-prone channels. However, the applicant of MDC to video coding is still problematic because a prediction mismatch problem, called drift, may occur at the decoder when one description is lost. In this paper, we propose a drift free motion-compensated predictive coding method for multiple description scalar quantizers. The proposed method maintains two more prediction loops at the encoder side to produce all possible predictions. Then, the two descriptions are generated from these two additional prediction loops in such a way that the drift can be prevented when only one description is received. By receiving both descriptions, the decoder can still combine these two descriptions in the central prediction loop to improve the video quality. Experimental results show that the proposed method can effectively prevent drift and improve the quality of single-description reconstruction by 0.2-0.8 dB as compared to the method in [A. Reibman et al., 1999].","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134493272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Progressive image transmission by adaptive interpolation","authors":"T. Shih, Louis H. Lin, Jen-Shiun Chiang","doi":"10.1109/ICME.2003.1221573","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221573","url":null,"abstract":"Progressive image transmission is a mechanism that transmits the most significant portion of an image, followed by its less important parts. Applications of such a mechanism include browsing large image files on the Internet. We propose an adaptive mechanism, based on the characteristics of images. The mechanism use neighbor pixels to guess a target pixel value, without actually transmitting the target pixel. An error correction scheme is also designed to cope with a failure guessing. The prototype is tested on 1500 bit-mapped pictures of different categories. Preliminary results should that the transmission rate is lower than others, with reasonable PSNR values of the transmitted images. Interested readers can find the prototype tool and our evaluations at http://www.mine.tku.edu.tw/demos/ProgTransmission.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133888839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gaurav Harit, S. Chaudhury, Gaurav Garg, P. Sharma
{"title":"A framework for video representation and transcoding using appearance spaces","authors":"Gaurav Harit, S. Chaudhury, Gaurav Garg, P. Sharma","doi":"10.1109/ICME.2003.1221381","DOIUrl":"https://doi.org/10.1109/ICME.2003.1221381","url":null,"abstract":"We present a novel scheme for object-based video sequence presentation using appearance spaces. Our scheme enables fully automatic extraction of semantic video objects for a class of sequences, and their supervised organization in an object-class hierarchy. The hierarchy can be used for generic classification of query video objects, and transcoding using semantics of video objects.","PeriodicalId":118560,"journal":{"name":"2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130345732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}