International Symposium on VIPromCom Video/Image Processing and Multimedia Communications最新文献

筛选
英文 中文
Fuzzy coded trellis quantization 模糊编码网格量化
D. Gleich, P. Planinsic, Z. Cucej
{"title":"Fuzzy coded trellis quantization","authors":"D. Gleich, P. Planinsic, Z. Cucej","doi":"10.1109/VIPROM.2002.1026626","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026626","url":null,"abstract":"In this paper we propose an image compression algorithm based on embedded trellis quantization and inventive fuzzy context based modeling (TCQFCB). The indices produced by trellis-coded quantization are adaptively arithmetically coded, bit plane by bit plane. The fuzzy logic is used to assign the probability of the coded symbol for an arithmetic coder. The probability of the coded bit depends on the previous coded bits. The fuzzy logic is used in order to attribute the highest possible probability of the coded bit. The performance of the proposed method is comparable with the state-of-the-art methods.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124800823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scale correlation-based edge detection 基于尺度相关的边缘检测
P. Bao, Lei Zhang
{"title":"Scale correlation-based edge detection","authors":"P. Bao, Lei Zhang","doi":"10.1109/VIPROM.2002.1026680","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026680","url":null,"abstract":"This paper proposes an effective edge detection scheme based on the scale correlation in the wavelet transform that is equivalent to Canny edge detection. A correlation function is defined as the product of two adjacent wavelet subbands to magnify edges while filtering noise. Unlike many multi-scale techniques that form the edge maps on different scales and then synthesize them into a spatial edge map, our scheme detects edges as local maxima directly in the correlation function. The scheme totally avoids the potentially ill-posed synthesizing operation. The product of detection and localization criteria is higher than that of a single scale, which results in means better edge detection. The dislocation of neighboring edges is also improved. The performance of the presented scheme is illustrated by synthetic and natural images.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130915170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A drift compensation architecture for DCT-pyramid video coding dct -金字塔视频编码的漂移补偿结构
R. Atta, M. Ghanbari
{"title":"A drift compensation architecture for DCT-pyramid video coding","authors":"R. Atta, M. Ghanbari","doi":"10.1109/VIPROM.2002.1026675","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026675","url":null,"abstract":"A new-layered video coding scheme is proposed for efficient spatial scalability based on the DCT pyramid. The proposed scheme is introduced to eliminate the drift error associated with reduced resolution video. A new algorithm to predict the motion vector of the reduced resolution video from the higher resolution motion vectors is presented. Based on simulation results, the proposed scheme achieves better coding efficiency and lower complexity than H.263+ spatial scalability.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115991418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
The design and implementation of a Chinese financial invoice recognition system 中国财务发票识别系统的设计与实现
Ming Delie, Liu Jian, Tian Jinwen
{"title":"The design and implementation of a Chinese financial invoice recognition system","authors":"Ming Delie, Liu Jian, Tian Jinwen","doi":"10.1109/VIPROM.2002.1026632","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026632","url":null,"abstract":"This paper designs and implements a financial invoice recognition system based on the features of the Chinese financial invoice. By using the linear whole block moving method in each vertical segment, a new fast algorithm is put forward to detect and rectify slanted images. To distinguish the different form types (the foundation necessary for locating the form fields, filtering the form lines, etc), several representative form features are discussed and an invoice-type features library is built by using a semi-automatic machine study method. On the basis of the recognized invoice type, a real invoice form is re-oriented against the corresponding blank form according to the invoice type feature, solving the problem of adhesion of characters and form lines, as well as the problem of character segmentation and recognition. Based on the financial Chinese invoice image feature, a mutual rectification mechanism founded on the recognition results of financial Chinese characters and Arabic numerals is put forward to raise the recognition rate. Finally, experimental results and conclusions are presented.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129048625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Embedded lossy image compression based on wavelet transform 基于小波变换的嵌入式有损图像压缩
M.B. Pardo, C.T. van der Reijden
{"title":"Embedded lossy image compression based on wavelet transform","authors":"M.B. Pardo, C.T. van der Reijden","doi":"10.1109/VIPROM.2002.1026654","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026654","url":null,"abstract":"This paper describes an algorithm for embedded lossy image compression based on the discrete wavelet transform (DWT) and the zerotree coding scheme (EZW). In this algorithm there are two lossy stages, a rounding operation after the transform and a truncation of the coding output. A comparative study of different filter banks (from different wavelet families and with different filter lengths) for the wavelet transform and different image scans for the encoding is presented. We show their impact in realistic image compression results for our algorithm, specially in compression ratio and reconstruction error.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1981 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130495999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Implementation of the Hough transform by the on-line mode 利用在线模式实现霍夫变换
H. Bessalah, F. Alim, S. Seddiki
{"title":"Implementation of the Hough transform by the on-line mode","authors":"H. Bessalah, F. Alim, S. Seddiki","doi":"10.1109/VIPROM.2002.1026649","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026649","url":null,"abstract":"In this paper, the implementation of a new algorithm for the calculation of the Hough transform (HT) with on-line arithmetic is introduced. This algorithm allows reducing considerably the use of multipliers and tables of transfer. The main idea consists of using a combination of incremental method with the usual HT expression calling on-line calculation mode, which is efficient for real time applications.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1941 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129121085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to plan, develop and evaluate multimedia applications - a simple model 如何规划、开发和评估多媒体应用程序——一个简单的模型
A. Prata, P. Lopes
{"title":"How to plan, develop and evaluate multimedia applications - a simple model","authors":"A. Prata, P. Lopes","doi":"10.1109/VIPROM.2002.1026638","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026638","url":null,"abstract":"This document describes a model for the planning, development and evaluation of educational multimedia applications (which resulted in the author's master's thesis in the area of multimedia systems). Having in count the results obtained with the referred model, it is being used at the Escola Superior de Ciencias Empresariais (Superior School of Management Sciences in the city of Setubal, Portugal), where the author teaches.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115974805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Combating geometrical attacks in a DWT based blind video watermarking system 基于小波变换的盲视频水印系统抗几何攻击
C. Serdean, M. Ambroze, M. Tomlinson, G. Wade
{"title":"Combating geometrical attacks in a DWT based blind video watermarking system","authors":"C. Serdean, M. Ambroze, M. Tomlinson, G. Wade","doi":"10.1109/VIPROM.2002.1026666","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026666","url":null,"abstract":"This paper describes a high capacity blind video watermarking system invariant to geometrical attacks such as shift, rotation, scaling and cropping. A spatial domain reference watermark is used to obtain invariance to geometric attacks by employing image registration techniques to determine and invert the attacks. A second, high capacity watermark, which carries the data payload, is embedded in the wavelet domain according to a human visual system (HVS) model. This is protected by a state-of-the-art error correction code (turbo code). The proposed system is invariant to scaling up to 180%, rotation up to 70/spl deg/ and arbitrary aspect ratio changes up to 200% on both axes. Furthermore, the system is virtually invariant to any shifting, cropping, or combined shifting and cropping.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115434916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Tracking moving objects with co-evolutionary snakes 用共同进化的蛇追踪移动的物体
P. Liatsis, C. Ooi
{"title":"Tracking moving objects with co-evolutionary snakes","authors":"P. Liatsis, C. Ooi","doi":"10.1109/VIPROM.2002.1026677","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026677","url":null,"abstract":"A new symbiotic genetic algorithm (SGA) based active contour model (snake) is proposed to track the B-spline contour of obstacles. It exploits the local control properties of the B-spline to decompose the contour into subcontours and optimizes each subcontour in separate genetic algorithms (GA). Unlike the GA-based snake, an SGA snake can track the obstacle's outline more robustly. Application-specific inter-population genetic operators are introduced to reinforce the symbiotic relationship via migration of genetic material. The use of symbiosis dramatically reduces the combinatorics of the search space, when compared to GAs. Results of tracking objects in real road scenarios demonstrate its robustness to noise and stability of convergence when compared to its GA counterpart.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128101503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Adaptive transport protocol for broadband fixed-wireless systems 宽带固定无线系统的自适应传输协议
T. Zahariadis, N. Vogiatzis, I. Kitroser, A. Andritsou
{"title":"Adaptive transport protocol for broadband fixed-wireless systems","authors":"T. Zahariadis, N. Vogiatzis, I. Kitroser, A. Andritsou","doi":"10.1109/VIPROM.2002.1026690","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026690","url":null,"abstract":"New interactive multimedia services have increased the urgent requirement for more bandwidth and turn broadband wireless technology as one of the most competitive solutions for the last mile problem. This paper presents an adaptive transport protocol design and evaluation for a multicarrier point-to-multipoint outdoor broadband wireless access system, in the 5.8 GHz and 10.5 GHz frequency bands. With minimal overhead, the protocol is able to support multiple terminals and services that range from video broadcasting to full symmetric traffic. Moreover, adaptive modulation and coding schemes can be applied per terminal per time frame, in either the downlink or the uplink direction. In the proposed system various innovative techniques are used for efficient utilization of the available bandwidth. Finally, the protocol gain is evaluated.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126259567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信