Proceedings of the International Conference on Signal Processing and Multimedia Applications最新文献

筛选
英文 中文
Estimation-decoding on LDPC-based 2D-barcodes 基于ldpc的二维条码估计译码
W. Proß, M. Otesteanu, F. Quint
{"title":"Estimation-decoding on LDPC-based 2D-barcodes","authors":"W. Proß, M. Otesteanu, F. Quint","doi":"10.5220/0003457400340039","DOIUrl":"https://doi.org/10.5220/0003457400340039","url":null,"abstract":"In this paper we propose an extension of the Estimation-Decoding algorithm for the decoding of our Data Matrix Code (DMC), which is based on Low-Density-Parity-Check (LDPC) codes and is designed for use in industrial environment. To include possible damages in the channel-model, a Markov-modulated Gaussian channel (MMGC) was chosen to represent everything in between the embossing of a LDPC-based DMC and the camera-based acquisition. The MMGC is based on a Hidden-Markov-Model (HMM) that turns into a two-dimensional model when used in the context of DMCs. The proposed ED2D-algorithm (Estimation-Decoding in two dimensions) is implemented to operate on a 2D-LDPC-Markov factor graph that comprises of a LDPC code's Tanner-graph and a 2D-HMM. For a subsequent comparison between different barcodes in industrial environment, a simulation of typical damages has been implemented. Tests showed a superior decoding behavior of our LDPC-based DMC decoded with the ED2D-decoder over the standard Reed-Solomon-based DMC.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"41 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129880185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wireless in-vehicle complaint driver environment recorder 无线车载投诉驾驶员环境记录仪
O. Siordia, Isaac Martín de Diego, C. Conde, E. Cabello
{"title":"Wireless in-vehicle complaint driver environment recorder","authors":"O. Siordia, Isaac Martín de Diego, C. Conde, E. Cabello","doi":"10.5220/0003567300520058","DOIUrl":"https://doi.org/10.5220/0003567300520058","url":null,"abstract":"In this paper, an in-vehicle complaint recording device is presented. The device is divided in independent systems for image and audio data acquisition and storage. The systems, designed to work under in-vehicle complaint devices, use existent in-vehicle wireless architectures for its communication. Several tests of the recording device in a highly realistic truck simulator show the reliability of the developed system to acquire and store driver related data. The acquired data will be used for the development of a valid methodology for the reconstruction and study of traffic accidents.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124561579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A non-uniform real-time speech time-scale stretching method 一种非均匀实时语音时间尺度拉伸方法
A. Kupryjanow, A. Czyżewski
{"title":"A non-uniform real-time speech time-scale stretching method","authors":"A. Kupryjanow, A. Czyżewski","doi":"10.5220/0003456300270033","DOIUrl":"https://doi.org/10.5220/0003456300270033","url":null,"abstract":"An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were analysed. Subjective tests were performed in order to compare the quality of the proposed method with the output of the standard SOLA algorithm. Accuracy of the ROS estimation was assessed to prove its robustness.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122709416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Managing multiple media streams in HTML5: The IEEE 1599-2008 case study 在HTML5中管理多个媒体流:IEEE 1599-2008案例研究
Stefano Baldan, L. A. Ludovico, D. Mauro
{"title":"Managing multiple media streams in HTML5: The IEEE 1599-2008 case study","authors":"Stefano Baldan, L. A. Ludovico, D. Mauro","doi":"10.5220/0003651401930199","DOIUrl":"https://doi.org/10.5220/0003651401930199","url":null,"abstract":"This paper deals with the problem of managing multiple multimedia streams in a Web environment. Multimedia types to support are pure audio, video with no sound, and audio/video. Data streams refer to the same event or performance, consequently they both have and should maintain mutual synchronization. Besides, a Web player should be able to play different multimedia streams simultaneously, as well as to switch from one to another in real time. The clarifying example of a music piece encoded in IEEE 1599 format will be presented as a case study.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115583542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Image matching algorithms in stereo vision using address-event-representation: A theoretical study and evaluation of the different algorithms 基于地址-事件表示的立体视觉图像匹配算法:不同算法的理论研究与评价
M. Domínguez-Morales, Elena Cerezuela-Escudero, A. Jiménez-Fernandez, R. Paz-Vicente, Juan Luis Font-Calvo, P. Iñigo-Blasco, A. Linares-Barranco, G. Jiménez-Moreno
{"title":"Image matching algorithms in stereo vision using address-event-representation: A theoretical study and evaluation of the different algorithms","authors":"M. Domínguez-Morales, Elena Cerezuela-Escudero, A. Jiménez-Fernandez, R. Paz-Vicente, Juan Luis Font-Calvo, P. Iñigo-Blasco, A. Linares-Barranco, G. Jiménez-Moreno","doi":"10.5220/0003518500790084","DOIUrl":"https://doi.org/10.5220/0003518500790084","url":null,"abstract":"Image processing in digital computer systems usually considers the visual information as a sequence of frames. These frames are from cameras that capture reality for a short period of time. They are renewed and transmitted at a rate of 25–30 fps (typical real-time scenario). Digital video processing has to process each frame in order to obtain a filter result or detect a feature on the input. In stereo vision, existing algorithms use frames from two digital cameras and process them pixel by pixel until it is found a pattern match in a section of both stereo frames. Spike-based processing is a relatively new approach that implements the processing by manipulating spikes one by one at the time they are transmitted, like a human brain. The mammal nervous system is able to solve much more complex problems, such as visual recognition by manipulating neuron's spikes. The spike-based philosophy for visual information processing based on the neuro-inspired Address-Event-Representation (AER) is achieving nowadays very high performances. In this work we study the existing digital stereo matching algorithms and how do they work. After that, we propose an AER stereo matching algorithm using some of the principles shown in digital stereo methods.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130827799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Latent topic visual language model for object categorization 面向对象分类的潜在主题视觉语言模型
Lei Wu, Nenghai Yu, J. Liu, Mingjing Li
{"title":"Latent topic visual language model for object categorization","authors":"Lei Wu, Nenghai Yu, J. Liu, Mingjing Li","doi":"10.5220/0003491601490158","DOIUrl":"https://doi.org/10.5220/0003491601490158","url":null,"abstract":"This paper presents a latent topic visual language model to handle variation problem in object categorization. Variations including different views, styles, poses, etc., have greatly affected the spatial arrangement and distribution of visual features, on which previous categorization models largely depend. Taking the object variations as hidden topics within each category, the proposed model explores the relationship between object variations and visual feature arrangement in the traditional visual language modeling process. With this improvement, the accuracy of object categorization is further boosted. Experiments on Caltech 101 dataset have shown that this model makes sense and is effective.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114172186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Two-dimensional codes on mobile devices and the development of the platform 移动设备上的二维码平台的开发
José Manuel Fornés Rumbao, F. R. Rubio
{"title":"Two-dimensional codes on mobile devices and the development of the platform","authors":"José Manuel Fornés Rumbao, F. R. Rubio","doi":"10.5220/0003481200190022","DOIUrl":"https://doi.org/10.5220/0003481200190022","url":null,"abstract":"In the last times, the mobile terminals have experienced an accelerated technological development. This evolution has provided numerous advances in presentation and interactivity in general and it has given rise to the generation of numerous applications for it. Following this line; this article shows how to incorporate on mobile terminals a simple interaction with the environment across the technological successor of the bar codes: the two-dimensional codes. We will use three basic elements-camera quality, growth in data traffic and increased bandwidth in mobile phones-to create a platform that provides to the user an easy and useful way of obtaining information multimedia that improves his relation with the environment. We will look for a complete and global development of the system, that is, the generation of the two-dimensional code; his interaction with the platform and final obtaining of the information in the terminal.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121668337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D visualization of single images using patch level depth 使用补丁级深度的单个图像的3D可视化
Shahrouz Yousefi, Farid Abedan Kondori, Haibo Li
{"title":"3D visualization of single images using patch level depth","authors":"Shahrouz Yousefi, Farid Abedan Kondori, Haibo Li","doi":"10.5220/0003511800610066","DOIUrl":"https://doi.org/10.5220/0003511800610066","url":null,"abstract":"In this paper we consider the task of 3D photo visualization using a single monocular image. The main idea is to use single photos taken by capturing devices such as ordinary cameras, mobile phones, tablet PCs etc. and visualize them in 3D on normal displays. Supervised learning approach is hired to retrieve depth information from single images. This algorithm is based on the hierarchical multi-scale Markov Random Field (MRF) which models the depth based on the multi-scale global and local features and relation between them in a monocular image. Consequently, the estimated depth image is used to allocate the specified depth parameters for each pixel in the 3D map. Accordingly, the multi-level depth adjustments and coding for color anaglyphs is performed. Our system receives a single 2D image as input and provides a anaglyph coded 3D image in output. Depending on the coding technology the special low-cost anaglyph glasses for viewers will be used.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126562324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Video surveillance at an industrial environment using an address event vision sensor: Comparative between two different video sensor based on a bioinspired retina 在工业环境中使用地址事件视觉传感器的视频监控:基于生物视网膜的两种不同视频传感器的比较
F. Perez-Peña, Arturo Morgado Estévez, R. Montero-Gonzalez, A. Linares-Barranco, G. Jiménez-Moreno
{"title":"Video surveillance at an industrial environment using an address event vision sensor: Comparative between two different video sensor based on a bioinspired retina","authors":"F. Perez-Peña, Arturo Morgado Estévez, R. Montero-Gonzalez, A. Linares-Barranco, G. Jiménez-Moreno","doi":"10.5220/0003521701310134","DOIUrl":"https://doi.org/10.5220/0003521701310134","url":null,"abstract":"Nowadays we live in very industrialization world that turns worried about surveillance and with lots of occupational hazards. The aim of this paper is to supply a surveillance video system to use at ultra fast industrial environments. We present an exhaustive timing analysis and comparative between two different Address Event Representation (AER) retinas, one with 64×64 pixel and the other one with 128×128 pixel in order to know the limits of them. Both are spike based image sensors that mimic the human retina and designed and manufactured by Delbruck's lab. Two different scenarios are presented in order to achieve the maximum frequency of light changes for a pixel sensor and the maximum frequency of requested pixel addresses on the AER output. Results obtained are 100 Hz and 1.88 MHz at each case for the 64×64 retina and peaks of 1.3 KHz and 8.33 MHz for the 128×128 retina. We have tested the upper spin limit of an ultra fast industrial machine and found it to be approximately 6000 rpm for the first retina and no limit achieve at top rpm for the second retina. It has been tested that in cases with high light contrast no AER data is lost.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124433921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Transmission of low-motion JPEG2000 image sequences using client-driven conditional replenishment 使用客户端驱动的条件补充传输低运动JPEG2000图像序列
J. J. Sánchez-Hernández, J. Ortiz, V. Ruiz, I. García, D. Muller
{"title":"Transmission of low-motion JPEG2000 image sequences using client-driven conditional replenishment","authors":"J. J. Sánchez-Hernández, J. Ortiz, V. Ruiz, I. García, D. Muller","doi":"10.5220/0003518600110016","DOIUrl":"https://doi.org/10.5220/0003518600110016","url":null,"abstract":"This work proposes a strategy for browsing interactively sequences of high resolution JPEG 2000 remote images. These sequences can be displayed in any order (forward and backward) and following any play/timing pattern. In order to increase the quality of the reconstructions where the retrieved images are only known at the moment of the visualization, this work has proposed and evaluated a novel technique based on conditional re-plenishment. This solution profits from the SNR/Spatial scalability of JPEG 2000 to determine which regions of the next image should be transmitted and what regions should be reused from the previously reconstructed image. Experimental results demonstrate that, even without motion compensation and with a transmission exclusively controlled by the client, the reconstructions are consistently better, both visually and from a rate-distortion point of view, than those that only remove the spatial redundancy (such as Motion JPEG 2000). Other advantages of our approach are that no data overhead is generated, the computational complexity is very small compared to similar techniques, and the fact that it can be used with any JPIP server.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122569188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信