2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第10页

Compressive graph clustering from random sketches 随机草图的压缩图聚类

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7179016

Yuejie Chi

引用次数: 4

Accurate kernel-based spectrum sensing for Gaussian and non-Gaussian noise models 高斯和非高斯噪声模型的精确核频谱感知

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178552

Argin Margoosian, J. Abouei, K. Plataniotis

引用次数: 10

Efficient construction of dictionaries for kernel adaptive filtering in a dynamic environment 动态环境下核自适应滤波字典的高效构造

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178629

Taichi Ishida, Toshihisa Tanaka

引用次数: 6

Cognitive biases in Bayesian updating and optimal information sequencing 贝叶斯更新与最优信息排序中的认知偏差

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178741

Sara Mourad, A. Tewfik

引用次数: 2

Closed-form Cramer-Rao lower bounds for DOA estimation from turbo-coded square-QAM-modulated transmissions 涡轮编码方形qam调制传输的DOA估计的闭形式Cramer-Rao下界

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178620

F. Bellili, Chaima Elguet, Souheib Ben Amor, S. Affes, A. Stephenne

引用次数: 4

Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables 在深度神经网络中集成高斯混合:带隐变量的Softmax层

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178779

Zoltán Tüske, Muhammad Ali Tahir, R. Schlüter, H. Ney

{"title":"Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables","authors":"Zoltán Tüske, Muhammad Ali Tahir, R. Schlüter, H. Ney","doi":"10.1109/ICASSP.2015.7178779","DOIUrl":"https://doi.org/10.1109/ICASSP.2015.7178779","url":null,"abstract":"In the hybrid approach, neural network output directly serves as hidden Markov model (HMM) state posterior probability estimates. In contrast to this, in the tandem approach neural network output is used as input features to improve classic Gaussian mixture model (GMM) based emission probability estimates. This paper shows that GMM can be easily integrated into the deep neural network framework. By exploiting its equivalence with the log-linear mixture model (LMM), GMM can be transformed to a large softmax layer followed by a summation pooling layer. Theoretical and experimental results indicate that the jointly trained and optimally chosen GMM and bottleneck tandem features cannot perform worse than a hybrid model. Thus, the question “hybrid vs. tandem” simplifies to optimizing the output layer of a neural network. Speech recognition experiments are carried out on a broadcast news and conversations task using up to 12 feed-forward hidden layers with sigmoid and rectified linear unit activation functions. The evaluation of the LMM layer shows recognition gains over the classic softmax output.","PeriodicalId":117666,"journal":{"name":"2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134092128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 40

Neighborhood regression for edge-preserving image super-resolution 边缘保持图像超分辨率的邻域回归

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178160

Yanghao Li, Jiaying Liu, Wenhan Yang, Zongming Guo

引用次数: 14

Estimation of multipath propagation delays and interaural time differences from 3-D head scans 三维头部扫描的多径传播延迟和间隔时间差估计

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178019

H. Gamper, Mark R. P. Thomas, I. Tashev

引用次数: 7

Ocean acoustic waveguide invariant parameter estimation using tonal noise sources 基于调性噪声源的海声波导不变参数估计

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178722

Andrew Harms, J. L. Odom, J. Krolik

引用次数: 7

JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition 具有从左到右结构的JFA建模和用于依赖文本的说话人识别的新后端

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178860

P. Kenny, Themos Stafylakis, Md. Jahangir Alam, M. Kockmann

引用次数: 11