{"title":"Accurate speech decomposition into periodic and aperiodic components based on Discrete Harmonic Transform","authors":"P. Zubrycki, A. Petrovsky","doi":"10.5281/ZENODO.40682","DOIUrl":"https://doi.org/10.5281/ZENODO.40682","url":null,"abstract":"This paper presents a new method for the speech signal decomposition into periodic and aperiodic components. Proposed method is based on the Discrete Harmonic Transform (DHT). This transformation is able to analyze the signal spectrum in the harmonic domain. Another feature of the DHT is its ability to synchronize the transformation kernel with the time-varying pitch frequency. The system works without a priori knowledge about the pitch track. Unlike the most applications proposed method estimates the fundamental frequency changes within a frame before estimating fundamental the frequency itself. Periodic component is modelled as a sum of harmonically related sinusoids and for accurate estimation of the amplitudes and initial phases DHT is used. Aperiodic component is defined as a difference between the original speech and the estimated periodic component.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"124 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129605517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ASPROC: Adaptive smith predictor-based rate oscillation cancellation","authors":"A. Szwabe","doi":"10.5281/ZENODO.40322","DOIUrl":"https://doi.org/10.5281/ZENODO.40322","url":null,"abstract":"The paper presents a control-theoretic approach to the design of a rate oscillation cancellation module. The main objective was to develop an effective method applicable to an audiovisual streaming system operating in an IP network. The proposed discrete-time nonlinear model represents functions of a real stream adaptation system based on the widely used RTP/RTCP protocol set. The paper demonstrates how the model had been transformed into a linear time-invariant (LTI) form and used in analysis of the system's dynamic characteristics. The criteria for the optimal system configuration refer to representation of the transfer function's roots on the Z-transform plane. It has been demonstrated how the proposed optimization method enables development of a stable rate control module which effectively cancels oscillations in the most demanding test scenario.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123493976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multimodal medical image registration using a novel implementation of the ICP algorithm","authors":"A. Almhdie, C. Léger, Mohamed Deriche, R. Lédée","doi":"10.5281/ZENODO.40670","DOIUrl":"https://doi.org/10.5281/ZENODO.40670","url":null,"abstract":"Image registration is a valuable technique for medical diagnosis and treatment. In this paper, we present an enhanced implementation of the popular iterative closest point (ICP) algorithm developed for the registration of 3D free-form closed surfaces. The main step of the ICP consists of finding the closest points between data sets which are then used to estimate the parameters of the global rigid transformation. We propose a new technique based on the use of a look up matrix for finding the best correspondence pairs. The algorithm, called Comprehensive ICP (CICP) algorithm, is then successfully applied for the registration of 3D data of the left ventricle of the heart, acquired using two different medical imaging modalities.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"203 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123731309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data hiding in H.264 video for lossless reconstruction of region of interest","authors":"Peter Meuel, M. Chaumont, W. Puech","doi":"10.5281/ZENODO.40675","DOIUrl":"https://doi.org/10.5281/ZENODO.40675","url":null,"abstract":"In this paper, we propose a method to protect faces in video-surveillance scenes. Our method deletes any visible information of faces in a video and uses a data-hiding technique to embed information in the video that allows further reconstruction of the faces if needed. When the entire information needed for reconstruction is embedded, we obtain infinite PSNR for the face regions.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"55 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120893254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An efficient detection prototype based on a mixed architecture for GAIA","authors":"S. Mignot, P. Laporte, F. Rigaud","doi":"10.5281/ZENODO.40546","DOIUrl":"https://doi.org/10.5281/ZENODO.40546","url":null,"abstract":"We present a high throughput prototype detection framework designed to meet the stringent scientific requirements of the Gaia mission and satisfy real-time and processing resource constraints on board the satellite. A mixed architecture, whose feasibility has been confirmed as part of phase A, is proposed which manages pixel level operations synchronously with their acquisition through programmable logic (FPGA) and answers the need for more flexibility and higher level object-wise processing by the use of software.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124515619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
P. D. Kukharchik, D. Martynov, I. Kheidorov, O. Kotov
{"title":"Vocal fold pathology detection using modified wavelet-like features and support vector machines","authors":"P. D. Kukharchik, D. Martynov, I. Kheidorov, O. Kotov","doi":"10.5281/ZENODO.40658","DOIUrl":"https://doi.org/10.5281/ZENODO.40658","url":null,"abstract":"Acoustic analysis is a perspective vocal pathology diagnostic method that can complement (and in some cases replace) other methods, based on direct vocal fold observation. There are different approaches and algorithms for feature extraction from acoustic speech signal and for making decision on their basis. While the second stage implies a choice of a variety of machine learning methods (SVMs, neural networks, etc), the first stage plays crucial part in performance and accuracy of the classification system, providing much more creativity in development of different feature extraction methods. In this paper we present initial study of feature extraction based on wavelets and pseudo-wavelets in the task of vocal pathology diagnostic. A new type of feature vector, based on continuous wavelet and wavelet-like transform of input audio data is proposed. Support vector machine was used as a classifier for testing the feature extraction procedure. The results of our experimental study are shown.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128061236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. J. Canet, V. Almenar, S. J. Flores, J. Valls-Coquillat
{"title":"Improvement of a time synchronization algorithm for IEEE 802.11a/g WLAN standard","authors":"M. J. Canet, V. Almenar, S. J. Flores, J. Valls-Coquillat","doi":"10.5281/ZENODO.40319","DOIUrl":"https://doi.org/10.5281/ZENODO.40319","url":null,"abstract":"In this paper a time synchronization algorithm for IEEE 802.11a/g OFDM-WLAN standard is evaluated and some modifications are proposed to improve its performance. The original synchronization algorithm utilizes coarse and fine estimation. In this paper fine time estimation is done using a cross-correlation as the original algorithm does, but different solutions are evaluated to cope with the problems of the coarse estimation in the original algorithm. The performances of these alternatives are tested by simulation in multi-path channels, at low signal to noise ratio and with carrier frequency offset. Also, the computational cost is evaluated.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125246707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Time-reversal music imaging using a recursive approach","authors":"A. Baussard, T. Boutin","doi":"10.5281/ZENODO.40288","DOIUrl":"https://doi.org/10.5281/ZENODO.40288","url":null,"abstract":"The time-reversal imaging with multiple signal classification (MUSIC) method for the location of point targets was first proposed by Devaney et al. In this contribution, a recursive time-reversal MUSIC algorithm is proposed in order to improve the detection and the location of close targets. The considered approach is based on the recursively applied and projected (RAP) MUSIC method which was first introduced for magnetoencephalographic (MEG) data processing.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130085122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reduction of correlation computation in the permutation of the frequency domain ICA by selecting DOAs estimated in subarrays","authors":"Hao Yuan, H. Kawai, T. Horiuchi","doi":"10.5281/ZENODO.40290","DOIUrl":"https://doi.org/10.5281/ZENODO.40290","url":null,"abstract":"This paper addresses the permutation problem in blind source separation by frequency domain independent component analysis. We propose a method to reduce the correlation calculation in aligning permutation by selecting the DOAs (direction of arrival) estimated in several subarrays. When several subarrays are available, we have more choices to select a reliable DOA to determine the permutation. More reliable DOAs, more permutations can be determined by DOAs instead of the correlation of the separated signals between neighbouring frequency bins, which is much more complex than DOA estimation. To obtain accurate DOA in each subarray, we also propose an improved DOA estimation method by averaging the phase differences in each subarray. Experimental results show that about 58% of the correlation computation was reduced when compared with the case of a single array, and a more accurate DOA was estimated by the proposed method.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"140 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133246228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Affective sports highlight detection","authors":"Reede Ren, J. Jose, He Yin","doi":"10.5281/ZENODO.40352","DOIUrl":"https://doi.org/10.5281/ZENODO.40352","url":null,"abstract":"This paper explores a psychological attention approach for sports highlight detection. A multiresolution autoregressive algorithm is proposed to fuse misaligned audio-visual time sequences and estimate an unified attention curve. Game highlights are found by ranking attention intensity; content-based events are filtered out by allocating local attention peaks. The test bed includes six complete football games from World Cup 2002, 2006 and Champion League 2006, and two content suppliers, BBC and ITV. Two evaluations are presented, the comparison on average attention and event attention, and the ranking of goal events. Experiments show this fusion framework is robust on different data collections.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116350926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}