{"title":"Fuzzy coded trellis quantization","authors":"D. Gleich, P. Planinsic, Z. Cucej","doi":"10.1109/VIPROM.2002.1026626","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026626","url":null,"abstract":"In this paper we propose an image compression algorithm based on embedded trellis quantization and inventive fuzzy context based modeling (TCQFCB). The indices produced by trellis-coded quantization are adaptively arithmetically coded, bit plane by bit plane. The fuzzy logic is used to assign the probability of the coded symbol for an arithmetic coder. The probability of the coded bit depends on the previous coded bits. The fuzzy logic is used in order to attribute the highest possible probability of the coded bit. The performance of the proposed method is comparable with the state-of-the-art methods.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124800823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Scale correlation-based edge detection","authors":"P. Bao, Lei Zhang","doi":"10.1109/VIPROM.2002.1026680","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026680","url":null,"abstract":"This paper proposes an effective edge detection scheme based on the scale correlation in the wavelet transform that is equivalent to Canny edge detection. A correlation function is defined as the product of two adjacent wavelet subbands to magnify edges while filtering noise. Unlike many multi-scale techniques that form the edge maps on different scales and then synthesize them into a spatial edge map, our scheme detects edges as local maxima directly in the correlation function. The scheme totally avoids the potentially ill-posed synthesizing operation. The product of detection and localization criteria is higher than that of a single scale, which results in means better edge detection. The dislocation of neighboring edges is also improved. The performance of the presented scheme is illustrated by synthetic and natural images.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130915170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A drift compensation architecture for DCT-pyramid video coding","authors":"R. Atta, M. Ghanbari","doi":"10.1109/VIPROM.2002.1026675","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026675","url":null,"abstract":"A new-layered video coding scheme is proposed for efficient spatial scalability based on the DCT pyramid. The proposed scheme is introduced to eliminate the drift error associated with reduced resolution video. A new algorithm to predict the motion vector of the reduced resolution video from the higher resolution motion vectors is presented. Based on simulation results, the proposed scheme achieves better coding efficiency and lower complexity than H.263+ spatial scalability.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115991418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The design and implementation of a Chinese financial invoice recognition system","authors":"Ming Delie, Liu Jian, Tian Jinwen","doi":"10.1109/VIPROM.2002.1026632","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026632","url":null,"abstract":"This paper designs and implements a financial invoice recognition system based on the features of the Chinese financial invoice. By using the linear whole block moving method in each vertical segment, a new fast algorithm is put forward to detect and rectify slanted images. To distinguish the different form types (the foundation necessary for locating the form fields, filtering the form lines, etc), several representative form features are discussed and an invoice-type features library is built by using a semi-automatic machine study method. On the basis of the recognized invoice type, a real invoice form is re-oriented against the corresponding blank form according to the invoice type feature, solving the problem of adhesion of characters and form lines, as well as the problem of character segmentation and recognition. Based on the financial Chinese invoice image feature, a mutual rectification mechanism founded on the recognition results of financial Chinese characters and Arabic numerals is put forward to raise the recognition rate. Finally, experimental results and conclusions are presented.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129048625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Embedded lossy image compression based on wavelet transform","authors":"M.B. Pardo, C.T. van der Reijden","doi":"10.1109/VIPROM.2002.1026654","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026654","url":null,"abstract":"This paper describes an algorithm for embedded lossy image compression based on the discrete wavelet transform (DWT) and the zerotree coding scheme (EZW). In this algorithm there are two lossy stages, a rounding operation after the transform and a truncation of the coding output. A comparative study of different filter banks (from different wavelet families and with different filter lengths) for the wavelet transform and different image scans for the encoding is presented. We show their impact in realistic image compression results for our algorithm, specially in compression ratio and reconstruction error.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1981 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130495999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Implementation of the Hough transform by the on-line mode","authors":"H. Bessalah, F. Alim, S. Seddiki","doi":"10.1109/VIPROM.2002.1026649","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026649","url":null,"abstract":"In this paper, the implementation of a new algorithm for the calculation of the Hough transform (HT) with on-line arithmetic is introduced. This algorithm allows reducing considerably the use of multipliers and tables of transfer. The main idea consists of using a combination of incremental method with the usual HT expression calling on-line calculation mode, which is efficient for real time applications.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1941 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129121085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"How to plan, develop and evaluate multimedia applications - a simple model","authors":"A. Prata, P. Lopes","doi":"10.1109/VIPROM.2002.1026638","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026638","url":null,"abstract":"This document describes a model for the planning, development and evaluation of educational multimedia applications (which resulted in the author's master's thesis in the area of multimedia systems). Having in count the results obtained with the referred model, it is being used at the Escola Superior de Ciencias Empresariais (Superior School of Management Sciences in the city of Setubal, Portugal), where the author teaches.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115974805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Combating geometrical attacks in a DWT based blind video watermarking system","authors":"C. Serdean, M. Ambroze, M. Tomlinson, G. Wade","doi":"10.1109/VIPROM.2002.1026666","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026666","url":null,"abstract":"This paper describes a high capacity blind video watermarking system invariant to geometrical attacks such as shift, rotation, scaling and cropping. A spatial domain reference watermark is used to obtain invariance to geometric attacks by employing image registration techniques to determine and invert the attacks. A second, high capacity watermark, which carries the data payload, is embedded in the wavelet domain according to a human visual system (HVS) model. This is protected by a state-of-the-art error correction code (turbo code). The proposed system is invariant to scaling up to 180%, rotation up to 70/spl deg/ and arbitrary aspect ratio changes up to 200% on both axes. Furthermore, the system is virtually invariant to any shifting, cropping, or combined shifting and cropping.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115434916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tracking moving objects with co-evolutionary snakes","authors":"P. Liatsis, C. Ooi","doi":"10.1109/VIPROM.2002.1026677","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026677","url":null,"abstract":"A new symbiotic genetic algorithm (SGA) based active contour model (snake) is proposed to track the B-spline contour of obstacles. It exploits the local control properties of the B-spline to decompose the contour into subcontours and optimizes each subcontour in separate genetic algorithms (GA). Unlike the GA-based snake, an SGA snake can track the obstacle's outline more robustly. Application-specific inter-population genetic operators are introduced to reinforce the symbiotic relationship via migration of genetic material. The use of symbiosis dramatically reduces the combinatorics of the search space, when compared to GAs. Results of tracking objects in real road scenarios demonstrate its robustness to noise and stability of convergence when compared to its GA counterpart.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128101503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Zahariadis, N. Vogiatzis, I. Kitroser, A. Andritsou
{"title":"Adaptive transport protocol for broadband fixed-wireless systems","authors":"T. Zahariadis, N. Vogiatzis, I. Kitroser, A. Andritsou","doi":"10.1109/VIPROM.2002.1026690","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026690","url":null,"abstract":"New interactive multimedia services have increased the urgent requirement for more bandwidth and turn broadband wireless technology as one of the most competitive solutions for the last mile problem. This paper presents an adaptive transport protocol design and evaluation for a multicarrier point-to-multipoint outdoor broadband wireless access system, in the 5.8 GHz and 10.5 GHz frequency bands. With minimal overhead, the protocol is able to support multiple terminals and services that range from video broadcasting to full symmetric traffic. Moreover, adaptive modulation and coding schemes can be applied per terminal per time frame, in either the downlink or the uplink direction. In the proposed system various innovative techniques are used for efficient utilization of the available bandwidth. Finally, the protocol gain is evaluated.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126259567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}