{"title":"Nonlinear Speech Coding Using Backward Adaptive Variable-Length Quadratic Filters","authors":"G. Alipoor, M. Savoji","doi":"10.1109/ISPA.2007.4383687","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383687","url":null,"abstract":"The ADTCM coding technique with nonlinear prediction based on quadratic Volterra filters is examined using backward prediction schemes based on LMS and RLS algorithms. Utilizing backward adaptive quadratic filters in ADTCM based speech coding, by itself, does not result in an overall improvement in the quality of reconstructed signal in comparison with a linear scheme using the same bit rate. However, it is shown that a scheme can be developed in which, for each frame of constant length, a set of quadratic filters with different memory sizes is examined and the nonlinear filter resulting in best improved quality is decided on. The identifying code of the selected filter is sent to the decoder along with the quantized residual signals. The simulation results show that the proposed scheme results in a good improvement (up to 2 dB) in the overall quality of the reconstructed speech signal. This improvement is achieved at the cost of a slight increase in the bit rate and a small delay.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"51 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128941004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Non-Linear Variable Selection in a Regression Context","authors":"S. I. Hill","doi":"10.1109/ISPA.2007.4383734","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383734","url":null,"abstract":"A Bayesian approach to variable selection in a regression context is presented. This aims to find which of a large number of input variables are the important ones in that they contribute to the given regression output. This approach is unlike many in the literature which focus more on features, and do not explicitly seek to include prior belief that many of the input variables do not contribute any information. The EM methodology presented enables this to be done in a nonlinear regression framework, in particular that of kernel regression. An initial experiment on a biscuit dough problem is presented.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122370866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Directional Spatial Color Descriptor in a Perceptual Model: Proximity Grids","authors":"S. Kiranyaz, M. Birinci, M. Gabbouj","doi":"10.1109/ISPA.2007.4383675","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383675","url":null,"abstract":"Most of the color features widely used in content-based image retrieval (CBIR) present severe limitations and drawbacks due to their inefficiency of modeling human visual system on color perception. Accordingly, they are not capable of characterizing both spatial and global properties of the color composition in visual scenery. In this paper, we present a perceptual color feature, which describes the global properties of the prominent colors along with a directional spatial descriptor, called as Proximity Grids. In color domain the dominant colors are extracted along with their global properties and quad-tree decomposition partitions the image so as to characterize the spatial color distribution (SCD). This approach is in accordance with the well-known Gestalt law, i.e. utilizing a top-down approach in order to model (see) the whole color composition before its parts and in this way we can avoid the problems of pixel-based approaches. The proximity grids, which cumulate the spatial co-occurrence of colors in a 2D grid, can successfully model the SCD of the prominent colors with respect to inter-color proximities and directions. Fusing both global and spatial properties forms the final descriptor, which is neither biased nor become noisy from the presence of such color elements, the so-called outliers that are not visible for humans in both spatial and color domains. Finally a penalty-trio model cumulates the differences among the color properties in a similarity distance computation during retrieval. Comparative evaluations against well-known global and spatial descriptors demonstrate the superiority of the proposed descriptor.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128433143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimal Neighborhood Sequences on the Hexagonal Grid","authors":"B. Nagy","doi":"10.1109/ISPA.2007.4383711","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383711","url":null,"abstract":"The neighborhood sequences have got a very important role in the digital image processing. In this paper we give some new results from this area on the hexagonal grid. Digital distances are used to approximate the Euclidean one. The approximation can be done through digital discs (circles). We obtain optimal neighborhood sequences defining digital circles the most close to the Euclidean circle. It is known that there are non-metrical distances defined by neighborhood sequences, moreover there is a neighborhood relation which is useless respecting the digital Jordan property of curves. Optimal neighborhood sequences and digital circles are presented with metric properties and/or with only that types of neighborhood relations which play at Jordan curves.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126101631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Improved Non-parametric Subtraction for Detection of Wafer Defect","authors":"Hye Won Kim, S. Yoo","doi":"10.1109/ISPA.2007.4383738","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383738","url":null,"abstract":"Automated defect inspection for wafer has been developed since 1990 's to replace defect detection by human eye for low-cost and high-quality. Defects are detected by comparing an inspected die with a reference die in application of wafer defect inspection. Referential methods compare with reference image by computing the intensity difference pixel by pixel between a reference image and an inspected image or measuring the similarity between two images using normalized cross correlation or eigen value. These methods are problematic for defect detection due to illumination change, noise and alignment error. To reduce the sensitivity of illumination change and noise, the new image subtraction called non-parametric subtraction was proposed. Non-parametric subtraction can solve problem about illumination change and noise, but sensitivity of alignment remains unsolved. This paper introduces new approach less sensitive to alignment using non-parametric subtraction for wafer defect inspection.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"59 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133150945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Minimal Cost-Path for Path-Based Distances","authors":"R. Strand, F. Malmberg, S. Svensson","doi":"10.1109/ISPA.2007.4383723","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383723","url":null,"abstract":"Distance functions defined by the minimal cost-path using weights and neighbourhood sequences (n.s.) are considered for the constrained distance transform (CDT). The CDT is then used to find one minimal cost-path between two points. The behaviour of some path-based distance functions is analyzed and a new error function is introduced. It is concluded that the weighted n.s.-distance with two weights (3 times 3 neighbourhood) and the weighted distance with three weights (5 times 5 neighbourhood) have similar properties in terms of minimal cost-path computation, while the former is more efficient to compute.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"236 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132031773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SIFT-CCH: Increasing the SIFT distinctness by Color Co-occurrence Histograms","authors":"Cosmin Ancuti, P. Bekaert","doi":"10.1109/ISPA.2007.4383677","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383677","url":null,"abstract":"Describing regions in a distinctive way, in order to find correct correspondences in images of two separated views, represents a complex and essential task of computer vision. Until now, SIFT (Scale Invariant Feature Transform) has been proven to be the most reliable descriptor among the others. One of the main drawbacks of SIFT is its vulnerability to color images, being designed mainly for the gray images. To overcome this problem and also to increase the overall distinctness of the SIFT in this paper we introduce a new descriptor that combines the SIFT approach with the color co-occurrence histograms (CCH), a concept used extensively in color texture retrieval and object recognition applications. We evaluate the new descriptor in the context of image matching. The experimental results show that our descriptor outperforms the original version, detecting an important number of additional correct matched feature points while the mismatch ratio remains constant.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132631206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Accuracy Optimization of a Dialog ASR System Utilizing Evolutional Strategies","authors":"J. Kacur, J. Korosi","doi":"10.1109/ISPA.2007.4383686","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383686","url":null,"abstract":"This article deals with an accuracy optimization of a dialog ASR system based on ATK tool, whereas the optimization is performed by the means of evolutional strategies. First an introductory overview of the basic capabilities of ATK system is given followed by the founding principles and variations of the evolutional strategies applied to the nonlinear optimization problems. Then the process of ASR optimization is presented and supported by series of experiments. By applying stochastic optimization methods to the mutual adjustment of Viterbi decoder, speech detection, and grammar related settings, we were able to achieve improved overall performance (WER 18.72%) compared to the manual settings chosen by tests and gained \"experience \" (generally WER above 23%).","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127856186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Reference Point Detection Algorithm for Top-View Finger Image Recognition","authors":"P. Chaikan, M. Karnjanadecha","doi":"10.1109/ISPA.2007.4383717","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383717","url":null,"abstract":"This paper describes an algorithm for automatic reference point detection in a top-view finger image recognition system. In tests of 700 finger images, only 6 images were rejected by our algorithm. A reference point location error correction technique was developed to improve the recognition accuracy. When using the proposed algorithm, the accuracy of the top-view finger image identification system was only reduced to 93.80% compared to 96.57% when using a manually defined reference point. This shows the feasibility of using top-view finger images to increase the recognition accuracy of fingerprint identification.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"41 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130656050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Assessment of Tikhonov Regularizations to Evoked Brain Signals","authors":"S. Aydin","doi":"10.1109/ISPA.2007.4383699","DOIUrl":"https://doi.org/10.1109/ISPA.2007.4383699","url":null,"abstract":"The current work address two types of Tikhonov regularization to extracta template evoked potential (EP) signal from a small number of noisy records. Under the same goal, the subspace regularization technique (SRT) was experienced in literature without comparison regarding as the standard form Tikhonov regularization technique (STRT). Both methods are tested in experimental studies and simulations. The signal-to-noise ratio (SNR) improvement is used as error criteria. The superiority of SRT have not been observed in results. In addition, the STRT is found to be less computational complex. The STRT method is optimum for smooth solutions whereas the SRT allows sharp variations in the solutions. Thus, we propose the use of the STRT instead of the SRM for template auditory EP estimation.","PeriodicalId":112420,"journal":{"name":"2007 5th International Symposium on Image and Signal Processing and Analysis","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134445966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}