2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)最新文献

筛选
英文 中文
Individual authentication through hand posture recognition using Multi-Hilbert Scanning Distance 使用多希尔伯特扫描距离通过手部姿势识别的个人身份验证
J. Ryu, S. Kamata
{"title":"Individual authentication through hand posture recognition using Multi-Hilbert Scanning Distance","authors":"J. Ryu, S. Kamata","doi":"10.5281/ZENODO.52184","DOIUrl":"https://doi.org/10.5281/ZENODO.52184","url":null,"abstract":"In this paper, we propose a novel Hand Posture Recognition (HPR) for biometrics. This study uses the three dimensional point clouds for robust hand posture recognition at the rotation and scale. Multi-Hilbert Scanning Distance (MHSD) are also introduced for mathematical approaches of shape matching. HPR framework is divided into five parts: detecting hand region, removing the wrist, aligning the hand pose, extracting feature descriptor, and matching. Based on the experimental results, this framework showed superior results for hand posture recognition rate.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128709707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Object-based stereo up-mixer for wave field synthesis based on spatial information clustering 基于空间信息聚类的波场合成对象立体上混频器
N. Kamado, Masayuki Hirata, H. Saruwatari, K. Shikano
{"title":"Object-based stereo up-mixer for wave field synthesis based on spatial information clustering","authors":"N. Kamado, Masayuki Hirata, H. Saruwatari, K. Shikano","doi":"10.5281/ZENODO.43120","DOIUrl":"https://doi.org/10.5281/ZENODO.43120","url":null,"abstract":"To build an acoustic system that can maintain the localization of sound images included in stereo mixed signals, we propose a new object-based up-mixer that performs sound source separation and sound location estimation. First, in a preliminary experiment, we show the effectiveness of sound location estimation using the proposed up-mixer via objective tests. Next, we evaluate the perception accuracy of sound localization by wave field synthesis using the proposed up-mixer via subjective tests. The results show that the proposed up-mixer provides a good localization of sound images included in stereo mixed signals at several listening positions.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129271546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A novel method for 3D prostate MR-histology registration using anatomical landmarks 一种利用解剖标志进行前列腺三维磁共振组织登记的新方法
C. Hughes, O. Rouvière, F. Mege-Lechevallier, R. Souchon, R. Prost
{"title":"A novel method for 3D prostate MR-histology registration using anatomical landmarks","authors":"C. Hughes, O. Rouvière, F. Mege-Lechevallier, R. Souchon, R. Prost","doi":"10.5281/ZENODO.52199","DOIUrl":"https://doi.org/10.5281/ZENODO.52199","url":null,"abstract":"No current imaging technique is capable of detecting with precision tumours within the prostate. To evaluate each technique, the histology data must be registered to the imaged data. As the histology slices cannot be assumed to be cut along the same plane as the imaged data was acquired, the registration must be considered as a 3D problem. We propose a novel 3D registration method which uses the ejaculatory ducts, an anatomical landmark present in every prostate and visible in both MR and histology. The method has been tested on 3 prostate specimens. The aligned histology slices are first shear corrected, with an average angular error after correction of 2.83±1.46°. The MR-histology registration accuracy, evaluated operator-independently, is on average 1.50 ± 0.74 mm.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124027525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Speaker verification using m-vector extracted from MLLR super-vector 从MLLR超向量中提取m向量的说话人验证
A. K. Sarkar, J. Bonastre, D. Matrouf
{"title":"Speaker verification using m-vector extracted from MLLR super-vector","authors":"A. K. Sarkar, J. Bonastre, D. Matrouf","doi":"10.5281/ZENODO.42813","DOIUrl":"https://doi.org/10.5281/ZENODO.42813","url":null,"abstract":"In this paper, we propose a speaker verification system called m-vector system, where speakers are represented by uniform segmentation of their Maximum Likelihood Linear Regression (MLLR) super-vectors, denoted m-vectors. The MLLR super-vectors are extracted with respect to Universal Background Model (UBM) with MLLR adaptation using the speakers data. Two criterion are followed to segment the MLLR super-vector: one is disjoint segmentation technique and other one is overlapped windows. Afterward, m-vectors are conditioned by our recently proposed [1] session variability compensation algorithm before calculating score during test phase. However, the proposed method is not based on any total variability space concept and uses simple MLLR transformation for extracting m-vector without considering any transcription of the speech segment. The proposed system shows promising performance compared to the conventional i-vector system. This indicates that session variability compensation plays an important role in speaker verification. Speakers can be represented by simpler way instead of generating i-vector in conventional system and able to achieve performance comparable to the i-vector based system. Experiment results are shown on NIST 2008 SRE core condition.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121496212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Cosamp and SP for the cosparse analysis model Cosamp和SP用于协稀疏分析模型
R. Giryes, Michael Elad
{"title":"Cosamp and SP for the cosparse analysis model","authors":"R. Giryes, Michael Elad","doi":"10.5281/ZENODO.42838","DOIUrl":"https://doi.org/10.5281/ZENODO.42838","url":null,"abstract":"CoSaMP and Subspace-Pursuit (SP) are two recovery algorithms that find the sparsest representation for a given signal under a given dictionary in the presence of noise. These two methods were conceived in the context of the synthesis sparse representation modeling. The cosparse analysis model is a recent construction that stands as an interesting alternative to the synthesis approach. This new model characterizes signals by the space they are orthogonal to. Despite the similarity between the two, the cosparse analysis model is markedly different from the synthesis one. In this paper we propose analysis versions of the CoSaMP and the SP algorithms, and demonstrate their performance for the compressed sensing problem.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130174033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Fast blind channel shortening using a prediction-error filter aided by Autocorrelation Minimization 使用自相关最小化辅助的预测误差滤波器快速盲信道缩短
William G. Dalzell, C. Cowan
{"title":"Fast blind channel shortening using a prediction-error filter aided by Autocorrelation Minimization","authors":"William G. Dalzell, C. Cowan","doi":"10.5281/ZENODO.42756","DOIUrl":"https://doi.org/10.5281/ZENODO.42756","url":null,"abstract":"A hybrid algorithm for blind adaptive channel-shortening of ADSL communication channels is here proposed. The prediction-error filter is a well-known technique that can equalize minimum-phase channels for Multi-Carrier Modulation (MCM) modulated signals. Another well-known algorithm, Sum-Squared Autocorrelation Minimization (SAM), also suited to blind adaptive channel-shortening of MCM signals, is used to aid the prediction-error filter. SAM exhibits fast convergence, but has high computational cost and an unstable behaviour. The objectives of the hybrid algorithm are fast convergence and stable steady-state behaviour for modelled ADSL channels from one channel-shortening algorithm; we show the performance of the hybrid fulfils the objectives.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131626662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Score-informed transcription for automatic piano tutoring 分数通知转录自动钢琴辅导
Emmanouil Benetos, Anssi Klapuri, S. Dixon
{"title":"Score-informed transcription for automatic piano tutoring","authors":"Emmanouil Benetos, Anssi Klapuri, S. Dixon","doi":"10.5281/ZENODO.52547","DOIUrl":"https://doi.org/10.5281/ZENODO.52547","url":null,"abstract":"In this paper, a score-informed transcription method for automatic piano tutoring is proposed. The method takes as input a recording made by a student which may contain mistakes, along with a reference score. The recording and the aligned synthesized score are automatically transcribed using the non-negative matrix factorization algorithm for multi-pitch estimation and hidden Markov models for note tracking. By comparing the two transcribed recordings, common errors occurring in transcription algorithms such as extra octave notes can be suppressed. The result is a piano-roll description which shows the mistakes made by the student along with the correctly played notes. Evaluation was performed on six pieces recorded using a Disklavier piano, using both manually-aligned and automatically-aligned scores as an input. Results comparing the system output with ground-truth annotation of the original recording reach a weighted F-measure of 93%, indicating that the proposed method can successfully analyze the student's performance.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130892189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Local stereo matching using motion cue and modified census in video disparity estimation 基于运动线索和修正普查的局部立体匹配视频视差估计
Zucheul Lee, Ramsin Khoshabeh, Jason Juang, Truong Q. Nguyen
{"title":"Local stereo matching using motion cue and modified census in video disparity estimation","authors":"Zucheul Lee, Ramsin Khoshabeh, Jason Juang, Truong Q. Nguyen","doi":"10.5281/ZENODO.42823","DOIUrl":"https://doi.org/10.5281/ZENODO.42823","url":null,"abstract":"In the human visual system, proximity, similarity, and motion are fundamental attributes that group visual objects together locally. The objects grouped by these attributes are most likely to have the same depth. In previous works, proximity and similarity have been considered in the computation of image disparity maps. However, they are insufficient for video disparity estimation because motion cues are very important for accurate depth estimation near edges of moving objects. We incorporate motion flow to compute each pixel's support weight, a measure directly affecting the accuracy of disparity maps in local methods. For robustness to image noise in flat areas, we propose a modified census transform with a noise buffer. The experimental results show that the proposed method produces more accurate disparity maps than current state-of-the-art methods, both on edges and in flat areas according to subjective and objective measures.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132687433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
PP-RIDER: A Rotation-Invariant degraded partial palmprint recognition technique PP-RIDER:一种旋转不变性退化部分掌纹识别技术
Sanchit Singh, M. Ramalho, P. Correia, Luís Ducla Soares
{"title":"PP-RIDER: A Rotation-Invariant degraded partial palmprint recognition technique","authors":"Sanchit Singh, M. Ramalho, P. Correia, Luís Ducla Soares","doi":"10.5281/ZENODO.43271","DOIUrl":"https://doi.org/10.5281/ZENODO.43271","url":null,"abstract":"Matching degraded partial palmprint images against full palmprints is a challenging problem, since these images may be arbitrarily rotated, incomplete and often noisy. Such partial palmprints can be recovered from the palm impressions left on some surface (called latent partial palmprints) or can be generated, for testing purposes, by cropping full palmprints into different regions/segments (called synthetic/pseudo latent partial palmprints). This paper proposes a new technique, PP-RIDER - Partial Palmprint Rotation-Invariant and DEgraded Recognition, for recognizing degraded partial palmprints, which combines the Fourier-Mellin Transform (FMT) with the Modified Phase-Only Correlation (MPOC) technique. FMT is used to correct the arbitrary rotation of partial palmprints. Then, the concept behind MPOC is used for matching the degraded, but aligned, partial palmprint to a full palmprint registered in a database. Experimental results, using the THUMPALMLAB high resolution palmprint database, from which partial palmprints were cropped, randomly rotated and further degraded by adding white additive Gaussian noise and motion blur, show an improvement in comparison to the original MPOC technique.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124311659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Unsupervised clustering of syllables for language identification 用于语言识别的无监督音节聚类
S. Dey, H. Murthy
{"title":"Unsupervised clustering of syllables for language identification","authors":"S. Dey, H. Murthy","doi":"10.5281/ZENODO.52577","DOIUrl":"https://doi.org/10.5281/ZENODO.52577","url":null,"abstract":"Automatic Language Recognition makes extensive use of phonotactics for identifying a language. The accuracy of phonotactic information depends upon the amount of data available for training. The state of the art approaches capture the phonotactics in terms of cross-lingual GMM tokens. The accuracy of such tokenisers crucially depends upon the availability of specific corpora. In this paper, we suggest an alternative to GMM tokens, namely, syllable based tokens. Syllables implicitly capture the phonotactics across phonemes in a language. Unsupervised Syllable tokenisation for language identification requires a) segmentation of speech into syllable-like units syllable level, and b) unsupervised modeling of the syllable tokens by Hidden Markov Models. The first issue is addressed by segmenting the wavform into syllable-like units using a well-established group delay based segmentation algorithm. To address the second issue, two different solutions are proposed, namely, (i) a top down clustering approach, which does not require significant parameter tuning, and is also robust, and (ii) a universal syllable approach. In this syllable models for every language are obtained from adapted universal syllable models. Experimental results on the OGI 1992 multilingual corpus and NIST 2003 LRE corpus show that the proposed approaches donot require significant tuning of parameters and the performance is comparable to that of a well-tuned baseline syllable tokenisation system.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114564264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信