2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)最新文献

筛选
英文 中文
Object classification with convolution neural network based on the time-frequency representation of their echo 基于回声时频表示的卷积神经网络目标分类
Mariia Dmitrieva, Matias Valdenegro-Toro, K. Brown, G. Heald, D. Lane
{"title":"Object classification with convolution neural network based on the time-frequency representation of their echo","authors":"Mariia Dmitrieva, Matias Valdenegro-Toro, K. Brown, G. Heald, D. Lane","doi":"10.1109/MLSP.2017.8168134","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168134","url":null,"abstract":"This paper presents classification of spherical objects with different physical properties. The classification is based on the energy distribution in wideband pulses that have been scattered from objects. The echo is represented in Time-Frequency Domain (TFD), using Short Time Fourier Transform (STFT) with different window lengths, and is fed into a Convolution Neural Network (CNN) for classification. The results for different window lengths are analysed to study the influence of time and frequency resolution in classification. The CNN performs the best results with accuracy of (98.44 ± 0.8)% over 5 object classes trained on grayscale TFD images with 0.1 ms window length of STFT. The CNN is compared with a Multilayer Perceptron classifier, Support Vector Machine, and Gradient Boosting.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"2 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76796677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Speech recognition features based on deep latent Gaussian models 基于深隐高斯模型的语音识别特征
Andros Tjandra, S. Sakti, Satoshi Nakamura
{"title":"Speech recognition features based on deep latent Gaussian models","authors":"Andros Tjandra, S. Sakti, Satoshi Nakamura","doi":"10.1109/MLSP.2017.8168174","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168174","url":null,"abstract":"This paper constructs speech features based on a generative model using a deep latent Gaussian model (DLGM), which is trained using stochastic gradient variational Bayes (SGVB) algorithm and performs efficient approximate inference and learning with a directed probabilistic graphical model. The trained DLGM then generate latent variables based on Gaussian distribution, which is used as new features for a deep neural network (DNN) acoustic model. Here we compare our results with and without features transformed by DLGM and also observe the benefits of combining both the proposed and original features into a single DNN. Our experimental results show that the proposed features using DLGM improved the ASR performance. Furthermore, the DNN acoustic model, which combined the proposed and original features, gave the best performances.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"54 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75690678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A linear stochastic state space model for electrocardiograms 心电图的线性随机状态空间模型
Kimmo Suotsalo, S. Särkkä
{"title":"A linear stochastic state space model for electrocardiograms","authors":"Kimmo Suotsalo, S. Särkkä","doi":"10.1109/MLSP.2017.8168126","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168126","url":null,"abstract":"This paper proposes a linear stochastic state space model for electrocardiogram signal processing and analysis. The model is obtained as a discretized version of Wiener process acceleration model. The model is combined with a fixed-lag Rauch-Tung-Striebel smoother to perform on-line signal denoising, feature extraction, and beat classification. The results indicate that the proposed approach outperforms a conventional FIR filter in terms of improved signal-to-noise ratio, and that the approach can be used for highly accurate online classification of normal beats and premature ventricular contractions. The benefits of the model include the possibility to use closed-form solutions to the optimal filtering and smoothing problems, quick adaptation to sudden changes in beat morphology and heart rate, simple and fast initialization, preprocessing-free operation, intuitive interpretation of the system state, and more.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"180 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74373316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Learning embeddings for speaker clustering based on voice equality 基于语音平等的说话人聚类学习嵌入
Y. X. Lukic, Carlo Vogt, Oliver Durr, Thilo Stadelmann
{"title":"Learning embeddings for speaker clustering based on voice equality","authors":"Y. X. Lukic, Carlo Vogt, Oliver Durr, Thilo Stadelmann","doi":"10.1109/MLSP.2017.8168166","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168166","url":null,"abstract":"Recent work has shown that convolutional neural networks (CNNs) trained in a supervised fashion for speaker identification are able to extract features from spectrograms which can be used for speaker clustering. These features are represented by the activations of a certain hidden layer and are called embeddings. However, previous approaches require plenty of additional speaker data to learn the embedding, and although the clustering results are then on par with more traditional approaches using MFCC features etc., room for improvements stems from the fact that these embeddings are trained with a surrogate task that is rather far away from segregating unknown voices — namely, identifying few specific speakers. We address both problems by training a CNN to extract embeddings that are similar for equal speakers (regardless of their specific identity) using weakly labeled data. We demonstrate our approach on the well-known TIMIT dataset that has often been used for speaker clustering experiments in the past. We exceed the clustering performance of all previous approaches, but require just 100 instead of 590 unrelated speakers to learn an embedding suited for clustering.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"347 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77784383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Neonatal seizure detection using convolutional neural networks 基于卷积神经网络的新生儿癫痫检测
Alison O'Shea, G. Lightbody, G. Boylan, A. Temko
{"title":"Neonatal seizure detection using convolutional neural networks","authors":"Alison O'Shea, G. Lightbody, G. Boylan, A. Temko","doi":"10.1109/MLSP.2017.8168193","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168193","url":null,"abstract":"This study presents a novel end-to-end architecture that learns hierarchical representations from raw EEG data using fully convolutional deep neural networks for the task of neonatal seizure detection. The deep neural network acts as both feature extractor and classifier, allowing for end-to-end optimization of the seizure detector. The designed system is evaluated on a large dataset of continuous unedited multichannel neonatal EEG totaling 835 hours and comprising of 1389 seizures. The proposed deep architecture, with sample-level filters, achieves an accuracy that is comparable to the state-of-the-art SVM-based neonatal seizure detector, which operates on a set of carefully designed hand-crafted features. The fully convolutional architecture allows for the localization of EEG waveforms and patterns that result in high seizure probabilities for further clinical examination.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"27 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87000595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
Detecting malignant ventricular arrhythmias in electrocardiograms by Gaussian process classification 用高斯过程分类检测心电图中的恶性室性心律失常
Kimmo Suotsalo, S. Särkkä
{"title":"Detecting malignant ventricular arrhythmias in electrocardiograms by Gaussian process classification","authors":"Kimmo Suotsalo, S. Särkkä","doi":"10.1109/MLSP.2017.8168160","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168160","url":null,"abstract":"Ventricular tachycardia, ventricular flutter, and ventricular fibrillation are malignant forms of cardiac arrhythmias, whose occurrence may be a life-threatening event. Several methods exist for detecting these arrhythmias in the electrocardiogram. However, the use of Gaussian process classifiers in this context has not been reported in the current literature. In comparison to the popular support vector machines, Gaussian processes have the advantage of being fully probabilistic, they can be re-casted in Bayesian filtering compatible state-space form, and they can be flexibly combined with first-principles physical models. In this paper we use Gaussian process classification to detect malignant ventricular arrhythmias in the electrocardiogram. We describe how Gaussian process classifiers can be used to solve the detection problem, and show that the proposed classifiers achieve a performance that is comparable to that of the state-of-the-art methods henceforth laying down promising foundations for more general electrocardiogram-based arrhythmia detection framework.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"36 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91515808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Predicting individualized intelligence quotient scores using brainnetome-atlas based functional connectivity 使用基于脑网络图谱的功能连通性预测个性化智商分数
R. Jiang, S. Qi, Yuhui Du, Weizheng Yan, V. Calhoun, T. Jiang, J. Sui
{"title":"Predicting individualized intelligence quotient scores using brainnetome-atlas based functional connectivity","authors":"R. Jiang, S. Qi, Yuhui Du, Weizheng Yan, V. Calhoun, T. Jiang, J. Sui","doi":"10.1109/MLSP.2017.8168150","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168150","url":null,"abstract":"Variation in several brain regions and neural parameters is associated with intelligence. In this study, we adopted functional connectivity (FC) based on Brainnetome-atlas to predict the intelligence quotient (IQ) scores quantitatively with a prediction framework incorporating advanced feature selection and regression methods. We compared prediction performance of five regression models and evaluated the effectiveness of feature selection. The best prediction performance was achieved by ReliefF+LASSO, by which correlations of r=0.72 and r=0.46 between prediction and true values were obtained for 174 female and 186 male subjects respectively in a leave-one-out-cross-validation, suggesting that for female subjects, a better prediction of IQ scores can be achieved using precise FCs. Further, weight analysis revealed the most predictive FCs and the relevant regions. Results support the hypothesis that intelligence is characterized by interaction between multiple brain regions, especially the parieto-frontal integration theory implicated areas. This study facilitates our understanding of the biological basis of intelligence by individualized prediction.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"60 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86376970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Discriminating bipolar disorder from major depression based on kernel SVM using functional independent components 基于功能独立分量的核支持向量机判别双相情感障碍与重度抑郁症
Shuang Gao, E. Osuch, M. Wammes, J. Théberge, T. Jiang, V. Calhoun, J. Sui
{"title":"Discriminating bipolar disorder from major depression based on kernel SVM using functional independent components","authors":"Shuang Gao, E. Osuch, M. Wammes, J. Théberge, T. Jiang, V. Calhoun, J. Sui","doi":"10.1109/MLSP.2017.8168110","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168110","url":null,"abstract":"Bipolar disorder (BD) and major depressive disorder (MDD) both share depressive symptoms, so how to discriminate them in early depressive episodes is a major clinical challenge. Independent components (ICs) extracted from fMRI data have been proved to carry distinguishing information and can be used for classification. Here we extend a previous method that makes use of multiple fMRI ICs to build linear subspaces for each individual, which is further used as input for classifiers. The similarity matrix between different subjects is first calculated using distance metric of principal angle, which is then projected into kernel space for support vector machine (SVM) classification among 37 BDs and 36 MDDs. In practice, we adopt forward selection technique on 20 ICs and nested 10-fold cross validation to select the most discriminative IC combinations of fMRI and determine the final diagnosis by majority voting mechanism. The results on human data demonstrate that the proposed method achieves much better performance than its initial version [8] (93% vs. 75%), and identifies 5 discriminative fMRI components for distinguishing BD and MDD patients, which are mainly located in prefrontal cortex, default mode network and thalamus etc. This work provides a new framework for helping diagnose the new patients with overlapped symptoms between BD and MDD, which not only adds to our understanding of functional deficits in mood disorders, but also may serve as potential biomarkers for their differential diagnosis.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"9 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87577059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Gaussian density guided deep neural network for single-channel speech enhancement 基于高斯密度的深度神经网络单通道语音增强
Li Chai, Jun Du, Yannan Wang
{"title":"Gaussian density guided deep neural network for single-channel speech enhancement","authors":"Li Chai, Jun Du, Yannan Wang","doi":"10.1109/MLSP.2017.8168116","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168116","url":null,"abstract":"Recently, the minimum mean squared error (MMSE) has been a benchmark of optimization criterion for deep neural network (DNN) based speech enhancement. In this study, a probabilistic learning framework to estimate the DNN parameters for single-channel speech enhancement is proposed. First, the statistical analysis shows that the prediction error vector at the DNN output well follows a unimodal density for each log-power spectral component. Accordingly, we present a maximum likelihood (ML) approach to DNN parameter learning by charactering the prediction error vector as a multivariate Gaussian density with a zero mean vector and an unknown covariance matrix. It is demonstrated that the proposed learning approach can achieve a better generalization capability than MMSE-based DNN learning for unseen noise types, which can significantly reduce the speech distortions in low SNR environments.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"24 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82227098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An accelerated newton's method for projections onto the ℓ1-ball 用加速牛顿法求球面上的投影
P. Rodríguez
{"title":"An accelerated newton's method for projections onto the ℓ1-ball","authors":"P. Rodríguez","doi":"10.1109/MLSP.2017.8168161","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168161","url":null,"abstract":"We present a simple and computationally efficient algorithm, based on the accelerated Newton's method, to solve the root finding problem associated with the projection onto the ℓ1-ball problem. Considering an interpretation of the Michelot's algorithm as Newton method, our algorithm can be understood as an accelerated version of the Michelot's algorithm, that needs significantly less major iterations to converge to the solution. Although the worst-case performance of the propose algorithm is O(n2), it exhibits in practice an O(n) performance and it is empirically demonstrated that it is competitive or faster than existing methods.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"14 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83513320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信