V. Velisavljevic, V. Stanković, Jacob Chakareski, Gene Cheung
{"title":"View and rate scalable multiview image coding with depth-image-based rendering","authors":"V. Velisavljevic, V. Stanković, Jacob Chakareski, Gene Cheung","doi":"10.1109/ICDSP.2011.6005019","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6005019","url":null,"abstract":"“Texture plus depth” refers to the format where a sender encodes both texture and depth maps at multiple camera-captured viewpoints. Having received such a representation, the decoder can synthesize novel intermediate view images via depth-image-based rendering (DIBR), using as anchors the texture and depth maps of the two closest captured viewpoints. Ideally then, one would optimally allocate available source coding bits among the encoded texture and depth maps, such that the synthesized view distortion is minimized. However, in many practical application scenarios the precise rate constraint may either: i) be unknown at encoding time, or ii) it can take on multiple values for clients of heterogeneous connectivities. In this paper, we propose a flexible codec and an associated bit allocation strategy to address both of these scenarios. In particular, we first present an edge-adaptive wavelet multiview image codec capable of producing a scalable bitstream from which proper subsets can be extracted and decoded at different bit-rates. Given our scalable codec, we then propose a rate allocation algorithm that performs one of the following two actions. The algorithm will either incrementally increase the number of bits for encoding texture or depth maps of already encoded viewpoints, or it will introduce into the scalable representation new texture or depth maps of previously uncoded captured viewpoints. The incremental choice of either refining an existing view or introducing a new one is carried out one layer at a time, such that the associated rate-distortion tradeoff is locally optimized. By employing our novel bit allocation strategy the proposed coder outperforms the state-of-the-art H.264/SVC codec as well as the same wavelet-based coder when armed with a simple suboptimal bit allocation with the same rate allocated to each map, in all coding scenarios studied in our experiments. Furthermore, our coder can achieve an arbitrarily fine granularity of encoding bit rates, while providing the additional functionality of view embedded encoding, unlike the other related coders that we examined.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131015292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Admission and power control for cognitive radio networks by sequential geometric programming","authors":"E. Dall’Anese, Seung-Jun Kim, G. Giannakis","doi":"10.1109/ICDSP.2011.6005012","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6005012","url":null,"abstract":"In cognitive radio (CR) networks, power control is an effective means to limit the interference caused by the CRs upon the incumbent primary users (PUs) to ensure cohabitation of the two systems. When all CR links can not be active at the same time due to excessive interference, an admission control mechanism is necessary to schedule the CR links. Key to both tasks is accurate knowledge of the CR-to-PU channel gains. However, CRs generally face difficulties in estimating the channel gains very accurately, often due to lack of explicit support from the PU systems. In this work, admission and power control algorithms are developed to account for channel uncertainty through probabilistic interference constraints. Both log-normal shadowing and small-scale fading effects are considered through suitable approximations. The resulting problems can be solved via sequential geometric programming. The admission control is based on solving feasibility problems, whereby CR links violating the interference constraints the most are dropped progressively. The feasible point thus found to initialize the power control iterative solver.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"643 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129180241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A collection of benchmark images for traffic sign research","authors":"Xiaohong W. Gao","doi":"10.1109/ICDSP.2011.6004883","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004883","url":null,"abstract":"This paper details a collection of traffic signs that has been made available online to meet the current need of lack of standard datasets for evaluation and comparison of existing and future developed recognition systems of traffic signs, leading to the exploration of the breath of innovations and applications in the field. Examples on how to apply these data are also described with comparisons between a number of approaches on segmentations. This collection aims at improving the accuracy of sign recognition systems and expediting the processing speed. Within this remit, the images in the collection opt for 2D still pictures and are taken under controlled viewing conditions on London roads at the UK. The standard sign database employs British road signs (143 images), which are available at http://www.mitime.org/traffic_signs/, together with the collection (128 images) in this study.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125583732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Upmixing from mono - A source separation approach","authors":"D. Fitzgerald","doi":"10.1109/ICDSP.2011.6004991","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004991","url":null,"abstract":"We present a system for upmixing mono recordings to stereo through the use of sound source separation techniques. The use of sound source separation has the advantage of allowing sources to be placed at distinct points in the stereo field, resulting in more natural sounding upmixes. The system separates an input signal into a number of sources, which can then be imported into a digital audio workstation for upmixing to stereo. Considerations to be taken into account when upmixing are discussed, and a brief overview of the various sound source separation techniques used in the system are given. The effectiveness of the proposed system is then demonstrated on real-world mono recordings.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123248558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wall clutter mitigation based on eigen-analysis in through-the-wall radar imaging","authors":"F. Tivive, M. Amin, A. Bouzerdoum","doi":"10.1109/ICDSP.2011.6004992","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004992","url":null,"abstract":"This paper presents an effective approach for mitigating the wall EM returns in through-the-wall radar imaging. The wall returns tend to obscure indoor targets, rendering target detection and classification difficult, if not impossible. The proposed approach recognizes the relative strength of the front wall returns compared to behind-the-wall targets, and uses eigen-structure methods to identify, and then remove the wall subspace that is typically associated with the dominant eigenvalues. The paper provides analyses of wall and target subspace characteristics, dwelling on the underlying property that the wall and target subspaces are, in most cases, spanned by complex sinusoidal components. It is shown that both the wall and the target subspaces can be of multiple dimensions. The paper demonstrates, using simulated and real data, the effectiveness of the proposed approach and compares its performance to that of background subtraction.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116072944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
G. Mileounis, N. Kalouptsidis, B. Babadi, V. Tarokh
{"title":"Blind identification of sparse channels and symbol detection via the EM algorithm","authors":"G. Mileounis, N. Kalouptsidis, B. Babadi, V. Tarokh","doi":"10.1109/ICDSP.2011.6004920","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004920","url":null,"abstract":"In this paper, we address the problem of blind identification of sparse channels. For this purpose, the Expectation- Maximization is modified to accommodate channel sparsity. The resulting algorithm is applicable for linear and nonlinear channels. Computer simulations on various channel set ups illustrate that the proposed algorithm achieves performance close to the genie-aided estimator.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123752259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Yasakethu, S. Worrall, D. V. S. X. D. Silva, W.A.C. Fernando, A. Kondoz
{"title":"A compound depth and image quality metric for measuring the effects of packet loss on 3D video","authors":"S. Yasakethu, S. Worrall, D. V. S. X. D. Silva, W.A.C. Fernando, A. Kondoz","doi":"10.1109/ICDSP.2011.6004998","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004998","url":null,"abstract":"Interest in 3D video has surged in recent years. However, efforts to improve the quality of compression and transmission schemes are severely hampered by a lack of effective quality evaluation metrics. This is a particularly severe problem for researchers trying to improve the robustness of video transmission to packet loss. Subjective tests for evaluating error robustness present huge requirements in terms of time and resources. To solve this problem, this paper presents a quality metric for 3D video, and evaluates its effectiveness for the measurement of quality in the presence of packet loss. A key feature of the work is the use of depth planes to enable the metric to better model how the Human Visual System (HVS) perceives 3D video. The quality metric results are compared with subjective test results. The correlation between the proposed quality metric and the subjective test results is shown to be stronger than standard quality metrics, such as Video Quality Metric (VQM).","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122751384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Improved Tomlinson-Harashima precoding with interference optimization","authors":"C. Masouros, M. Sellathurai, T. Ratnarajah","doi":"10.1109/ICDSP.2011.6004953","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004953","url":null,"abstract":"This paper investigates a novel strategy for reducing the power loss in Tomlinson-Harashima precoding (THP) based on optimising the interference to be canceled. The proposed scheme is motivated by the fact that, since both the desired and interfering signals originate from the base station (BS) of the downlink system, the resulting interference can be influenced to reduce the transmission power required to cancel it, without altering the information content of the downlink message. This optimization aims at bringing the interference closer to the replica of the desired symbol in the modulo-extended constellation. By doing so, the quantized distance between the useful signal and interference is reduced and therefore the power required to pre-subtract interference is decreased. Here, the above concept is applied to a practical THP design for multiple input multiple output systems (MIMO-THP). The results show a considerable transmit power reduction compared to conventional THP.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121877578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Identification of roofs perimeter from aerial and satellite images","authors":"E. Angiati, S. Dellepiane","doi":"10.1109/ICDSP.2011.6005016","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6005016","url":null,"abstract":"The computer-driven extraction of roofs perimeter is a very vimportant step in 3D reconstruction of buildings as it allows vto reduce human interactions. In this context, very high vresolution optical images, from airborne surveys or from vsatellites, were used in this paper with the aim of extracting vsuch information.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132734617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Hildebrandt, R. Merkel, M. Leich, Stefan Kiltz, J. Dittmann, C. Vielhauer
{"title":"Benchmarking contact-less surface measurement devices for fingerprint acquisition in forensic investigations: Results for a differential scan approach with a chromatic white light sensor","authors":"M. Hildebrandt, R. Merkel, M. Leich, Stefan Kiltz, J. Dittmann, C. Vielhauer","doi":"10.1109/ICDSP.2011.6004969","DOIUrl":"https://doi.org/10.1109/ICDSP.2011.6004969","url":null,"abstract":"With the advent of new contact-less sensors for forensic investigations of latent fingerprint traces, the authors see the need for a benchmarking framework to evaluate existing devices and promising combinations of data acquisition and signal processing techniques. This paper extends the existing benchmarking framework from [1] by categorizing it into properties from a forensic point-of-view (end-user) and a technical point-of-view (scientific-user) and applies a known differential image technique for the subjective evaluation of which traces are visible. We show exemplary results for a chromatic white light (CWL) sensor for the surface quality assessment, using and comparing the experimental setup of 10 surfaces from [1] and additional 10 surfaces, including real-world objects, to determine its potential for detecting latent fingerprints. Using a differential image approach, the particular influence of sensor noise signals is analyzed, showing that this differential approach cannot always be considered as an ideal filter for fingerprint pattern detection.","PeriodicalId":360702,"journal":{"name":"2011 17th International Conference on Digital Signal Processing (DSP)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134459035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}