{"title":"MPEG-7 Audio Spectrum Basis as a signature of violin sound","authors":"Aleksander Kaminiarz, E. Lukasik","doi":"10.5281/ZENODO.40517","DOIUrl":"https://doi.org/10.5281/ZENODO.40517","url":null,"abstract":"The goal of the paper is to examine how robust MPEG-7 Audio Spectrum Basis features are as signatures of instruments from the same group. Instruments analyzed are contemporary concert violins competing in the international violinmaker competition. They have been recorded for research purposes, thus the set of sounds for each instrument and recording conditions are the same - 30 s long musical excerpts and a set of individual sounds. Audio Spectrum Basis captures the statistically most regular features of the sound feature space thus it has been expected to well characterize instruments. The results confirmed the expectations. Since violinmakers follow the same ideal model of instrument construction and use similar material for their creation, differences of their sound are tiny, Audio Spectrum Basis enabled discrimination of several instruments as more dissimilar then the others. However these outliers have been placed by jury musicians during competition on both boundaries of the ranking.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117121848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Signal-dependent noise removal in pointwise shape-adaptive DCT domain with locally adaptive variance","authors":"A. Foi, V. Katkovnik, K. Egiazarian","doi":"10.5281/ZENODO.40647","DOIUrl":"https://doi.org/10.5281/ZENODO.40647","url":null,"abstract":"This paper presents a novel effective method for denoising of images corrupted by signal-dependent noise. Denoising is performed by coefficient shrinkage in the shape-adaptive DCT (SA-DCT) transform-domain. The Anisotropic Local Polynomial Approximation (LPA) - Intersection of Confidence Intervals (ICI) technique is used to define the shape of the transform's support in a pointwise adaptive manner. The use of such an adaptive transform support enables both a simpler modelling of the noise in the transform domain and a sparser decomposition of the signal. Consequently, coefficient shrinkage is very effective and the reconstructed estimate's quality is high, in terms of both numerical error-criteria and visual appearance, with sharp detail preservation and clean edges. Simulation experiments demonstrate the superior performance of the proposed algorithm for a wide class of noise models with a signal-dependent variance, including Poissonian (photon-limited imaging), film-grain, and speckle noise.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129687788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Gil-Pita, M. Rosa-Zurera, R. Vicen-Bueno, F. López-Ferreras
{"title":"A new algorithm for fast search of the k nearest patterns","authors":"R. Gil-Pita, M. Rosa-Zurera, R. Vicen-Bueno, F. López-Ferreras","doi":"10.5281/ZENODO.40590","DOIUrl":"https://doi.org/10.5281/ZENODO.40590","url":null,"abstract":"The computational cost associated to the k-nearest neighbor classifier depends on the amount of available patterns, which makes this method impractical in many real-time applications. This fact makes interesting the study of fast algorithms for finding the k-nearest patterns, like, for example, the kLAESA algorithm. In this paper we propose a novel algorithm for finding the k-nearest patterns, denominated k-tuned approximating and eliminating search algorithm (kTAESA). The algorithm is used to implement kNN classifiers, which are applied to three databases from the UCI machine learning benchmark repository. Results are compared with those achieved by the exhaustive search, the kAESA and the kLAESA algorithms, in terms of number of distances to evaluate, number of simple operations (sums, comparisons and products) needed to classify each pattern, and amount of required memory. Results demonstrate the best performance of the proposal, mainly when the number of operations is considered.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127236594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Olivier Crave, C. Guillemot, B. Pesquet-Popescu, C. Tillier
{"title":"Robust video transmission based on distributed multiple description coding","authors":"Olivier Crave, C. Guillemot, B. Pesquet-Popescu, C. Tillier","doi":"10.5281/ZENODO.40495","DOIUrl":"https://doi.org/10.5281/ZENODO.40495","url":null,"abstract":"This paper proposes systematic lossy description coding for robust video transmission over error-prone channels. The problem of error propagation is addressed here by first structuring the data to be encoded into two descriptions. In a first approach, the two descriptions are constructed by splitting odd from even frames. Each description is separated into two sub-sequences, one being conventionally-encoded, the other one being coded with a Wyner-Ziv encoder. This amounts to having a systematic lossy Wyner-Ziv coding of every other frame of each description. This error control system can be used as an alternative to Automatic Repeat reQuest (ARQ) or Forward Error Correction (FEC), i.e. the additional bitstream can be systematically sent to the decoder or can be requested, similarly to an ARQ request. The amount of redundancy is mostly controlled by the quantization of the Wyner-Ziv data. This first approach leads to satisfactory lateral rate-distortion performance, however suffers from high redundancy which penalizes the central description. To cope with this problem, the approach is then extended to the use of motion-compensated temporal filtering (MCTF) for the Wyner-Ziv frames, in which case only the low-frequency subbands are WZ-coded and sent in the descriptions.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132458243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Performance evaluation of mobile video quality estimators","authors":"M. Ries, O. Nemethova, M. Rupp","doi":"10.5281/ZENODO.40236","DOIUrl":"https://doi.org/10.5281/ZENODO.40236","url":null,"abstract":"Provisioning of mobile video streaming is hitting toward to limitations in channel quality and capacity as well as in terminal processing power. These known limitations, network settings, and video content influence the end user quality. In this article we investigate the estimation of perceived video quality for mobile streaming scenarios. Firstly, we analyze streaming content and usage scenarios. Secondly, we define objective video parameters which reflect the sequence motion character and its content. Finally, video quality estimation methods based on these parameters are developed and compared with common methods. The presented results show that the proposed approach provides powerful solutions for automatic subjective video quality estimation.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126861943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Particle swarm optimization for time-difference-of-arrival based localization","authors":"K. Lui, Jun Zheng, H. So","doi":"10.5281/ZENODO.40289","DOIUrl":"https://doi.org/10.5281/ZENODO.40289","url":null,"abstract":"Time-difference-of-arrival (TDOA) based source localization has been intensively studied and broadly applied in many fields. In this paper, particle swarm optimization (PSO) is employed for positioning with TDOA measurements in the circumstances of known and unknown propagation speed. The optimization criterion is first developed and the PSO technique is then employed to search the global minimum of the cost function. For sufficiently small noise conditions, simulation results show that the PSO approach provides accurate source location estimation for both known and unknown propagation speed, and also gives an efficient speed estimate in the later case.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126873145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An improved stochastic model of the NLMS algorithm for correlated input data","authors":"J. Kolodziej, O. J. Tobias, R. Seara","doi":"10.5281/ZENODO.40278","DOIUrl":"https://doi.org/10.5281/ZENODO.40278","url":null,"abstract":"This paper proposes an improved stochastic model for the normalized least-mean-square (NLMS) algorithm considering correlated input signals obtained from a spherically invariant random process (SIRP). A SIRP describes both Gaussian and a wide class of non-Gaussian processes, including the ones with Laplacian, K0, and Gamma marginal density functions. Hence an approximate procedure for computing high-order hyperelliptic integrals arisen from the modeling process is developed. The resulting model outperforms other existing models discussed in the open literature. Through numerical simulations the accuracy of the proposed model is verified.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"211 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114146160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient implementation of the HMARM model identification and its application in spectral analysis","authors":"Chunjian Li, S. Andersen","doi":"10.5281/ZENODO.40364","DOIUrl":"https://doi.org/10.5281/ZENODO.40364","url":null,"abstract":"The Hidden Markov Auto-Regressive model (HMARM) has recently been proposed to model non-Gaussian AutoRegressive signals with hidden Markov-type driving noise. This model has been shown to be suitable to many signals, including voiced speech and digitally modulated signals received through ISI channels. The HMARM facilitates a blind system identification algorithm that has a good computational efficiency and data efficiency. In this paper, we solve an implementation issue of the HMARM identification, which can otherwise degrade the efficiency of the model and hinder extensive evaluations of the algorithm. Then we study in more detail the properties associated with the autoregressive (AR) spectral analysis for signals of interest.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116257715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. Ince, Fikri Goksu, A. Tewfik, I. Onaran, A. Cetin
{"title":"Subset selection with structured dictionaries in classification","authors":"N. Ince, Fikri Goksu, A. Tewfik, I. Onaran, A. Cetin","doi":"10.5281/ZENODO.40588","DOIUrl":"https://doi.org/10.5281/ZENODO.40588","url":null,"abstract":"This paper describes a new approach for the selection of discriminant time-frequency features for classification. Unlike previous approaches that use the individual discrimination power of expansion coefficients, the proposed approach selects a subset of features by implementing a classifier directed pruning of an initial redundant set of candidate features. The candidate features are calculated from a structured redundant time-frequency analysis of the signal, such as an undecimated wavelet transform. We show that the proposed approach has a performance that is as good as or better than traditional classification approaches while using a much smaller number of features. In particular, we provide experimental results to demonstrate the superior performance of the algorithm in the area of impact acoustic classification for food kernel inspection. The proposed algorithm achieved 91.8% and 98.5% classification accuracies in separating open shell from closed shell pistachio nuts and discriminating between empty and full hazelnuts respectively. Traditional methods used in this area resulted in 82% and 97% classification accuracies respectively.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115976147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Coverage and density of a low power, low data rate, spread spectrum wireless sensor network for agricultural monitoring","authors":"L. Crockett, E. Pfann, R. Stewart","doi":"10.5281/ZENODO.40620","DOIUrl":"https://doi.org/10.5281/ZENODO.40620","url":null,"abstract":"A physical layer specification for a low power, low complexity, low data rate sensor network suitable for agricultural monitoring is investigated. Code division multiple access (CDMA) with high processing gain is used to facilitate transmission powers which comply with the Ultra Wide Band (UWB) spectral mask, and this permits physically small nodes with limited energy storage capacity. The interference arising from each node is calculated, and it is shown that for the investigated scenario and specification, an aggregate data rate of 2 bytes per minute and a node population of approximately 1000 can be supported at distances up to a few kilometres from the central node, with less than 0.2% chance of failure due to multiple access interference.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116615488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}