2012 IEEE International Workshop on Machine Learning for Signal Processing最新文献

筛选
英文 中文
Efficient high dynamic range imaging via matrix completion 通过矩阵完成高效的高动态范围成像
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349802
Grigorios Tsagkatakis, P. Tsakalides
{"title":"Efficient high dynamic range imaging via matrix completion","authors":"Grigorios Tsagkatakis, P. Tsakalides","doi":"10.1109/MLSP.2012.6349802","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349802","url":null,"abstract":"Typical digital cameras exhibit a limitation regarding the dynamic range of the scene radiance they can capture. High Dynamic Range (HDR) imaging refers to methods and systems that aim to generate images that exhibit higher dynamic range between the lightest and the darkest parts of the an image. A typical approach for generating HDR images is exposure bracketing where multiple frames, each one with a different exposure setting, are captured and combined to a HDR image of the scene. The large number of images that exposure bracketing requires often leads to motion artefacts that limit the visual quality of the resulting HDR image. In this work, we propose a novel approach in HDR imaging that significantly reduces the necessary number of images. In our proposed system, we employ the notion of random exposure where each pixel of a single frame collects light for a random amount of time. By collecting a small number of such images, the full sequence of low dynamic range images can be reconstructed and subsequently used for HDR generation. The problem is solved by casting the reconstruction of the sequence as a nuclear norm minimization problem following the premises of the recently proposed theory of Matrix Completion. Experimental results suggest that the proposed method is able to reconstruct the sequence from as low as 20% of the images that traditional techniques require with minimal reduction in image quality.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129565495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Stochastic triplet embedding 随机三联体嵌入
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349720
L. Maaten, Kilian Q. Weinberger
{"title":"Stochastic triplet embedding","authors":"L. Maaten, Kilian Q. Weinberger","doi":"10.1109/MLSP.2012.6349720","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349720","url":null,"abstract":"This paper considers the problem of learning an embedding of data based on similarity triplets of the form “A is more similar to B than to C”. This learning setting is of relevance to scenarios in which we wish to model human judgements on the similarity of objects. We argue that in order to obtain a truthful embedding of the underlying data, it is insufficient for the embedding to satisfy the constraints encoded by the similarity triplets. In particular, we introduce a new technique called t-Distributed Stochastic Triplet Embedding (t-STE) that collapses similar points and repels dissimilar points in the embedding - even when all triplet constraints are satisfied. Our experimental evaluation on three data sets shows that as a result, t-STE is much better than existing techniques at revealing the underlying data structure.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"156 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126594354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 205
Optimal cost function and magnitude power for NMF-based speech separation and music interpolation 基于nmf的语音分离和音乐插值的最优代价函数和幅度功率
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349726
Brian King, C. Févotte, P. Smaragdis
{"title":"Optimal cost function and magnitude power for NMF-based speech separation and music interpolation","authors":"Brian King, C. Févotte, P. Smaragdis","doi":"10.1109/MLSP.2012.6349726","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349726","url":null,"abstract":"There has been a significant amount of research in new algorithms and applications for nonnegative matrix factorization, but relatively little has been published on practical considerations for real-world applications, such as choosing optimal parameters for a particular application. In this paper, we will look at two applications, single-channel source separation of speech and interpolating missing music data. We will present the optimal parameters found for the experiments as well as discuss how parameters affect performance.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124568061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Markov Chain Monte Carlo inference for probabilistic latent tensor factorization 概率潜张量分解的马尔可夫链蒙特卡罗推理
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349799
Umut Simsekli, A. Cemgil
{"title":"Markov Chain Monte Carlo inference for probabilistic latent tensor factorization","authors":"Umut Simsekli, A. Cemgil","doi":"10.1109/MLSP.2012.6349799","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349799","url":null,"abstract":"Probabilistic Latent Tensor Factorization (PLTF) is a recently proposed probabilistic framework for modeling multi-way data. Not only the popular tensor factorization models but also any arbitrary tensor factorization structure can be realized by the PLTF framework. This paper presents Markov Chain Monte Carlo procedures (namely the Gibbs sampler) for making inference on the PLTF framework. We provide the abstract algorithms that are derived for the general case and the overall procedure is illustrated on both synthetic and real data.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130751272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Single-sided objective speech intelligibility assessment based on Sparse signal representation 基于稀疏信号表示的单面客观语音可理解度评估
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349776
G. Costantini, M. Todisco, R. Perfetti, A. Paoloni, G. Saggio
{"title":"Single-sided objective speech intelligibility assessment based on Sparse signal representation","authors":"G. Costantini, M. Todisco, R. Perfetti, A. Paoloni, G. Saggio","doi":"10.1109/MLSP.2012.6349776","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349776","url":null,"abstract":"Transcription of speech signals, originating from a lawful interception, is particularly important in the forensic phonetics framework. These signals are often degraded and the transcript may not replicate what was actually pronounced. In the absence of the clean signal, the only way to estimate the level of accuracy that can be obtained in the transcription is to develop an objective methodology for intelligibility measurements. In this paper a method based on the Normalized Spectrum Envelope (NSE) and Sparse Non-negative Matrix Factorization (SNMF) is proposed to evaluate the signal intelligibility. The approaches are tested with three different noise types and the results are compared with the speech intelligibility scores measured by subjective tests. The results of the experiments show a high correlation between objective measurements and subjective evaluations. Therefore, the proposed methodology can be successfully used in order to establish whether a given intercepted signal can be transcribed with sufficient reliability.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"16 17","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114087992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Neural correlates of visual perception in rapid serial visual presentation paradigms 快速序列视觉呈现范式中视觉知觉的神经关联
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349766
Yonghong Huang, K. Hild, M. Pavel, S. Mathan, Deniz Erdoğmuş
{"title":"Neural correlates of visual perception in rapid serial visual presentation paradigms","authors":"Yonghong Huang, K. Hild, M. Pavel, S. Mathan, Deniz Erdoğmuş","doi":"10.1109/MLSP.2012.6349766","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349766","url":null,"abstract":"Human brain signals associated with visual perceptual processes have been used for image recognition. This paper presents several insights on the neural correlates of human visual perception by analyzing the neural correlates that result when humans view realistic images using a rapid serial visual presentation (RSVP) image display paradigm. We propose an image information extraction model and examine the relationship between the brain evoked response - using event related potential (ERP) characteristics - and the level of difficulty for humans to detect targets as a function of both visual stimulus complexity and task difficulty. We develop a computational model to quantify subject performance and the difficulty of realistic stimuli. Our results show that: (1) more difficult trials produce less prominent ERP patterns, thus reducing the performance of machine-based ERP detection; (2) on average for the same behavioral performance level, a pair of ERP's extracted from two easy trials are more similar than a pair of ERP's from two hard trials; and (3) both stimulus and task difficulty are correlated with neural activity. Our findings indicate that, for dynamic tasks involved in visual information processing, the brain may allocate additional cognitive resources, such as attention, to a given visual stimulus, as the task and/or stimulus difficulty increases.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114690248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Complex-valued analysis and visualization of fMRI data for event-related and block-design paradigms 事件相关和块设计范例的fMRI数据的复杂值分析和可视化
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349790
Pedro A. Rodriguez, T. Adalı, V. Calhoun
{"title":"Complex-valued analysis and visualization of fMRI data for event-related and block-design paradigms","authors":"Pedro A. Rodriguez, T. Adalı, V. Calhoun","doi":"10.1109/MLSP.2012.6349790","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349790","url":null,"abstract":"Independent Component Analysis (ICA) has been noted to be promising for the study of functional magnetic resonance imaging (fMRI) data also in its native complex-valued form. In this paper, we demonstrate the first successful application of group ICA to complex-valued fMRI data of an event-related paradigm. We show that networks associated with event-related responses as well as intrinsic fluctuations of hemodymamic activity can be extracted for data collected during an auditory oddball paradigm. The intrinsic networks are of particular interest due to their potential to study cognitive function and mental illness, including schizophrenia. More importantly, we show that analysis of fMRI data in its complex form can increase the sensitivity and specificity in the detection of activated brain regions both for event-related and block design paradigms when compared to magnitude-only applications. In addition, we introduce a novel fMRI phase-based visualization (FPV) technique to identify activated voxels such that the complex nature of the data is fully taken into account.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"264 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121873424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Voice analysis of patients with neurological disorders using acoustical and nonlinear tools 使用声学和非线性工具分析神经系统疾病患者的声音
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349803
M. E. Dajer, P. Scalassara, J. Marrara, J.C. Pereira
{"title":"Voice analysis of patients with neurological disorders using acoustical and nonlinear tools","authors":"M. E. Dajer, P. Scalassara, J. Marrara, J.C. Pereira","doi":"10.1109/MLSP.2012.6349803","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349803","url":null,"abstract":"In this paper, we analyze voice signals recorded from patients with neurological disorders of different etiologies. The study was based on three samples of each patient: one before any ingestion, one after the swallowing of a liquid solution, and one after the swallowing of a pasty solution. We used three approaches: first, acoustical analysis, specifically fundamental frequency, jitter and shimmer; second, a proposed analysis method of vocal dynamic visual patterns, which are based on phase space reconstruction of the signals; and third, relative entropy analysis between the groups of signals. We show that the acoustical measures were not able to differentiate the study cases, relative entropy was only partially able to perform this task, but the visual patterns analysis was successful.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129480218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Recursive outlier-robust filtering and smoothing for nonlinear systems using the multivariate student-t distribution 基于多元student-t分布的非线性系统递归离群鲁棒滤波与平滑
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349794
R. Piché, S. Särkkä, Jouni Hartikainen
{"title":"Recursive outlier-robust filtering and smoothing for nonlinear systems using the multivariate student-t distribution","authors":"R. Piché, S. Särkkä, Jouni Hartikainen","doi":"10.1109/MLSP.2012.6349794","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349794","url":null,"abstract":"Nonlinear Kalman filter and Rauch-Tung-Striebel smoother type recursive estimators for nonlinear discrete-time state space models with multivariate Student's t-distributed measurement noise are presented. The methods approximate the posterior state at each time step using the variational Bayes method. The nonlinearities in the dynamic and measurement models are handled using the nonlinear Gaussian filtering and smoothing approach, which encompasses many known nonlinear Kalman-type filters. The method is compared to alternative methods in a computer simulation.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"59 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120872752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 147
Accelerometer based gesture recognition system using distance metric learning for nearest neighbour classification 基于加速度计的手势识别系统,使用距离度量学习进行最近邻分类
2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI: 10.1109/MLSP.2012.6349717
T. Marasovic, V. Papić
{"title":"Accelerometer based gesture recognition system using distance metric learning for nearest neighbour classification","authors":"T. Marasovic, V. Papić","doi":"10.1109/MLSP.2012.6349717","DOIUrl":"https://doi.org/10.1109/MLSP.2012.6349717","url":null,"abstract":"The need to improve communication between humans and computers has been motivation for defining new communication models, and accordingly, new ways of interacting with machines. In many applications today, user interaction is moving away from traditional keyboards and mouses and is becoming much more physical, pervasive and intuitive. This paper examines hand gestures as an alternative or supplementary input modality for mobile devices. A new gesture recognition system based on the use of acceleration sensor, that is nowadays being featured in a growing number of consumer electronic devices, is presented. Accelerometer sensor readings can be used for detection of hand movements and their classification into previously trained gestures. The proposed system utilizes Mahalanobis distance metric learning to improve the accuracy of nearest neighbour classification. In the approach we adopted, the objective function for metric learning is convex and, therefore, the required optimization can be cast as an instance of semidefinite programming. The experiments, carried out to evaluate system performance, demonstrate its efficacy.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116754416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信