Hideki Kawahara, M. Morise, Toru Takahashi, T. Irino, Hideki Banno, O. Fujimura
{"title":"Group delay for acoustic event representation and its application for speech aperiodicity analysis","authors":"Hideki Kawahara, M. Morise, Toru Takahashi, T. Irino, Hideki Banno, O. Fujimura","doi":"10.5281/ZENODO.40659","DOIUrl":"https://doi.org/10.5281/ZENODO.40659","url":null,"abstract":"A new framework is proposed for representing acoustic events based on bandwise durations derived from a group delay function and bandwise aperiodicity indices. The goal is to provide an efficient and detailed source information for a high-quality speech manipulation system, STRAIGHT. The proposed representation enables event based processing of speech parameters and provides means to fill the gap between waveform based methods and VOCODERs in a perceptually relevant manner. Simulations using a pulse plus noise source and a time varying filter demonstrated that the proposed method provides accurate estimates of the source aperiodicity. Application of the proposed method to STRAIGHT illustrated that it enables significant reduction in storage size and improves reproduced sound quality.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121664656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modified leaky delayed LMS algorithm for imperfect estimate system delay","authors":"Juan R. V. Lopez, O. J. Tobias, R. Seara","doi":"10.5281/ZENODO.40279","DOIUrl":"https://doi.org/10.5281/ZENODO.40279","url":null,"abstract":"This paper proposes a modified leaky delayed least-mean-square (MLDLMS) algorithm, aiming to circumvent algorithm instability problems under imperfect system delay estimates. In addition, a model for the first and second moments of the algorithm is proposed. Such a model is obtained without invoking the independence theory and considering a slow adaptation condition. Numerical simulations corroborate the very good agreement between the results obtained with the Monte Carlo method and those from the proposed model for colored Gaussian inputs.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121711581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Low complexity covariance-based DOA estimation algorithm","authors":"T. Ferreira, S. L. Netto, P. Diniz","doi":"10.5281/ZENODO.40224","DOIUrl":"https://doi.org/10.5281/ZENODO.40224","url":null,"abstract":"The aim of this work is to present an alternative method for estimating the direction-of-arrival (DoA), that is, the incoming angle, of a signal impinging on an antenna array. The proposed method is similar to ESPRIT (estimation of signal parameters via rotational invariance techniques) algorithm, which is the most widely used technique for this application. The new algorithm exploits the structural similarities between ESPRIT and the Tong-Xu-Kailath method for blind channel equalization. The result is an ESPRIT-like algorithm for DoA estimation with substantially reduced computational complexity. Simulation results are included to verify the properties and performance of the new covariance-based DoA algorithm, in comparison to ESPRIT and to the theoretical Cramer-Rao lower bound.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123818552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Determination of synthetic covariance matrices — An application to GPS monitoring measurements","authors":"V. Schwieger","doi":"10.5281/ZENODO.40440","DOIUrl":"https://doi.org/10.5281/ZENODO.40440","url":null,"abstract":"This paper deals with a method to model variances and correlations for measurement quantities. The model of elementary errors is the base for this approach that leads to synthetic covariance matrices. Frequently measurements are correlated and these correlations influence the following processing and the respective results. The modelling and the influence on the results are presented on the base of precise measurements using the Global Positioning System (GPS). Due to the fact that monitoring measurements should lead to the detection of displacements and deformations within the level of cm up to mm the correct modelling of the correlations is important. The standard deviation of the displacement vector using GPS measurements for the detection of tectonic movements in Romania may be smaller up to 69 %. This allows the significant detection of smaller displacements.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123821833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimized path planning for UAVS with AOA/scan based sensors","authors":"K. Doğançay","doi":"10.5281/ZENODO.40600","DOIUrl":"https://doi.org/10.5281/ZENODO.40600","url":null,"abstract":"In emitter localization by unmanned aerial vehicles (UAVs) the objective of path planning is to determine the best UAV trajectories so as to maximize the instantaneous localization performance subject to various constraints. In this paper we propose gradient based waypoint update algorithms for UAVs equipped with angle-of-arrival (AOA) and scan based sensors. The optimization criterion used by the waypoint update algorithms is to maximize the determinant of the approximate Fisher information matrix. The effectiveness of the path planning algorithms is illustrated with several computer simulations.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"428 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122869099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hypercomplex analytic signals : Extension of the analytic signal concept to complex signals","authors":"S. Sangwine, N. L. Bihan","doi":"10.5281/ZENODO.40330","DOIUrl":"https://doi.org/10.5281/ZENODO.40330","url":null,"abstract":"The analytic signal is a complex signal derived from a real signal such that its real part is identical to the original real signal, and its imaginary part is in quadrature (orthogonal) to the original signal. The analytic signal permits the envelope of the original signal to be computed, and it also admits the definition of an instantaneous frequency and phase. In this paper we present some initial results on extending this idea to the case of a complex signal using a hypercomplex analytic signal. We show that using the hypercomplex analytic signal it is possible to calculate a complex envelope of the original complex signal and that the modulus of this complex envelope is the envelope of the modulus of the original signal.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127772625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
P. Salmela, Ruirui Gu, S. Bhattacharyya, J. Takala
{"title":"Efficient parallel memory organization for turbo decoders","authors":"P. Salmela, Ruirui Gu, S. Bhattacharyya, J. Takala","doi":"10.5281/ZENODO.40373","DOIUrl":"https://doi.org/10.5281/ZENODO.40373","url":null,"abstract":"An efficient turbo decoder must access memory in parallel and with two different access patterns. It is shown that the problem of accessing memory both with sequential and interleaved access patterns is analogous to the graph coloring problem. The derivation proves that the obtained graph is bipartite and, therefore, only two memory banks are required in theory. For practical implementations, a system with four memory modules and a buffer is proposed. It is shown that modest buffer length is sufficient for 3GPP standard interleavers. There is no performance degradation in the proposed system and the address generation and memory interfaces are of modest complexity.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132340832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extracting distorted grid points for compensation of lens radial nonlinearities","authors":"A. Nowakowski, W. Skarbek","doi":"10.5281/ZENODO.40229","DOIUrl":"https://doi.org/10.5281/ZENODO.40229","url":null,"abstract":"A novel method for extracting distorted grid points for compensation of lens radial nonlinearities is presented. It is based on identification of homographic transformation using single image of dense planar chessboard pattern. Undistorted grid image is determined from the central part of the grid and used to find the radial distortion model by linear least square method (LSM). The model is used for dense compensation by bilinear interpolation.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134523645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Stereoscopic images quality assessment","authors":"P. Campisi, P. Callet, Enrico Marini","doi":"10.5281/ZENODO.40637","DOIUrl":"https://doi.org/10.5281/ZENODO.40637","url":null,"abstract":"Although several metrics have been proposed in literature to assess the perceptual quality of bidimensional images, no similar effort has been devoted to quality assessment of stereoscopic images. Therefore, in this paper, we propose a methodology for subjective assessment of stereo images. Moreover, in the process of defining an objective metric specifically designed for stereoscopic images, we evaluate whether 2-D image quality objective metrics are also suited for quality assessment of stereo images. Specifically, distortions deriving from both coding and blurring are taken into account and the quality degradation of the stereo pair is estimated.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":" 18","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114087805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A study of temporal structure of glottal flow derivative estimates obtained via inverse filtering","authors":"E. Turajlić, S. Vaseghi","doi":"10.5281/ZENODO.40625","DOIUrl":"https://doi.org/10.5281/ZENODO.40625","url":null,"abstract":"This paper presents a comparative study of the temporal structure of the glottal flow derivative estimates in relation to an idealized view of voice source realizations as defined by Liljencrants-Fant's model. Specifically, we endeavor to ascertain the extent by which Liljencrants-Fant's model can be used to represent the glottal flow derivative estimates obtained via closed-phase pitch synchronous inverse filtering of recorded speech. The study includes several phonation types and two examples of voice pathology. The study has established the following. Due to the limited degrees of freedom, Liljencrants-Fant's model is only capable of adequately representing the “coarse” glottal pulse structure. The “fine” structural elements can constitute a considerable part of a glottal flow derivative realization, and we have presented evidence that they contain information related to voice individuality. In addition, we have shown that LF-parameters do not always accurately portray significant events in the vocal fold dynamics.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114528925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}