2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)最新文献_第2页

Individual authentication through hand posture recognition using Multi-Hilbert Scanning Distance 使用多希尔伯特扫描距离通过手部姿势识别的个人身份验证

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.52184

J. Ryu, S. Kamata

引用次数: 1

Object-based stereo up-mixer for wave field synthesis based on spatial information clustering 基于空间信息聚类的波场合成对象立体上混频器

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.43120

N. Kamado, Masayuki Hirata, H. Saruwatari, K. Shikano

引用次数: 6

A novel method for 3D prostate MR-histology registration using anatomical landmarks 一种利用解剖标志进行前列腺三维磁共振组织登记的新方法

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.52199

C. Hughes, O. Rouvière, F. Mege-Lechevallier, R. Souchon, R. Prost

引用次数: 3

Speaker verification using m-vector extracted from MLLR super-vector 从MLLR超向量中提取m向量的说话人验证

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.42813

A. K. Sarkar, J. Bonastre, D. Matrouf

{"title":"Speaker verification using m-vector extracted from MLLR super-vector","authors":"A. K. Sarkar, J. Bonastre, D. Matrouf","doi":"10.5281/ZENODO.42813","DOIUrl":"https://doi.org/10.5281/ZENODO.42813","url":null,"abstract":"In this paper, we propose a speaker verification system called m-vector system, where speakers are represented by uniform segmentation of their Maximum Likelihood Linear Regression (MLLR) super-vectors, denoted m-vectors. The MLLR super-vectors are extracted with respect to Universal Background Model (UBM) with MLLR adaptation using the speakers data. Two criterion are followed to segment the MLLR super-vector: one is disjoint segmentation technique and other one is overlapped windows. Afterward, m-vectors are conditioned by our recently proposed [1] session variability compensation algorithm before calculating score during test phase. However, the proposed method is not based on any total variability space concept and uses simple MLLR transformation for extracting m-vector without considering any transcription of the speech segment. The proposed system shows promising performance compared to the conventional i-vector system. This indicates that session variability compensation plays an important role in speaker verification. Speakers can be represented by simpler way instead of generating i-vector in conventional system and able to achieve performance comparable to the i-vector based system. Experiment results are shown on NIST 2008 SRE core condition.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121496212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Cosamp and SP for the cosparse analysis model Cosamp和SP用于协稀疏分析模型

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.42838

R. Giryes, Michael Elad

引用次数: 18

Fast blind channel shortening using a prediction-error filter aided by Autocorrelation Minimization 使用自相关最小化辅助的预测误差滤波器快速盲信道缩短

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.42756

William G. Dalzell, C. Cowan

引用次数: 0

Score-informed transcription for automatic piano tutoring 分数通知转录自动钢琴辅导

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.52547

Emmanouil Benetos, Anssi Klapuri, S. Dixon

引用次数: 32

Local stereo matching using motion cue and modified census in video disparity estimation 基于运动线索和修正普查的局部立体匹配视频视差估计

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.42823

Zucheul Lee, Ramsin Khoshabeh, Jason Juang, Truong Q. Nguyen

引用次数: 11

PP-RIDER: A Rotation-Invariant degraded partial palmprint recognition technique PP-RIDER:一种旋转不变性退化部分掌纹识别技术

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.43271

Sanchit Singh, M. Ramalho, P. Correia, Luís Ducla Soares

{"title":"PP-RIDER: A Rotation-Invariant degraded partial palmprint recognition technique","authors":"Sanchit Singh, M. Ramalho, P. Correia, Luís Ducla Soares","doi":"10.5281/ZENODO.43271","DOIUrl":"https://doi.org/10.5281/ZENODO.43271","url":null,"abstract":"Matching degraded partial palmprint images against full palmprints is a challenging problem, since these images may be arbitrarily rotated, incomplete and often noisy. Such partial palmprints can be recovered from the palm impressions left on some surface (called latent partial palmprints) or can be generated, for testing purposes, by cropping full palmprints into different regions/segments (called synthetic/pseudo latent partial palmprints). This paper proposes a new technique, PP-RIDER - Partial Palmprint Rotation-Invariant and DEgraded Recognition, for recognizing degraded partial palmprints, which combines the Fourier-Mellin Transform (FMT) with the Modified Phase-Only Correlation (MPOC) technique. FMT is used to correct the arbitrary rotation of partial palmprints. Then, the concept behind MPOC is used for matching the degraded, but aligned, partial palmprint to a full palmprint registered in a database. Experimental results, using the THUMPALMLAB high resolution palmprint database, from which partial palmprints were cropped, randomly rotated and further degraded by adding white additive Gaussian noise and motion blur, show an improvement in comparison to the original MPOC technique.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124311659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Unsupervised clustering of syllables for language identification 用于语言识别的无监督音节聚类

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI: 10.5281/ZENODO.52577

S. Dey, H. Murthy

{"title":"Unsupervised clustering of syllables for language identification","authors":"S. Dey, H. Murthy","doi":"10.5281/ZENODO.52577","DOIUrl":"https://doi.org/10.5281/ZENODO.52577","url":null,"abstract":"Automatic Language Recognition makes extensive use of phonotactics for identifying a language. The accuracy of phonotactic information depends upon the amount of data available for training. The state of the art approaches capture the phonotactics in terms of cross-lingual GMM tokens. The accuracy of such tokenisers crucially depends upon the availability of specific corpora. In this paper, we suggest an alternative to GMM tokens, namely, syllable based tokens. Syllables implicitly capture the phonotactics across phonemes in a language. Unsupervised Syllable tokenisation for language identification requires a) segmentation of speech into syllable-like units syllable level, and b) unsupervised modeling of the syllable tokens by Hidden Markov Models. The first issue is addressed by segmenting the wavform into syllable-like units using a well-established group delay based segmentation algorithm. To address the second issue, two different solutions are proposed, namely, (i) a top down clustering approach, which does not require significant parameter tuning, and is also robust, and (ii) a universal syllable approach. In this syllable models for every language are obtained from adapted universal syllable models. Experimental results on the OGI 1992 multilingual corpus and NIST 2003 LRE corpus show that the proposed approaches donot require significant tuning of parameters and the performance is comparable to that of a well-tuned baseline syllable tokenisation system.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114564264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3