2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第4页

Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification 利用卷积神经网络进行语音法方言识别

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8461486

M. Najafian, Sameer Khurana, Suwon Shon, Ahmed Ali, James R. Glass

引用次数: 34

A Stem Reu Site on the Integrated Design of Sensor Devices and Signal Processing Algorithms 传感器件集成设计与信号处理算法研究

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462483

A. Spanias, J. Christen

引用次数: 10

An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech 混合语言语音的端到端语言跟踪语音识别器

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462180

Hiroshi Seki, Shinji Watanabe, Takaaki Hori, Jonathan Le Roux, J. Hershey

{"title":"An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech","authors":"Hiroshi Seki, Shinji Watanabe, Takaaki Hori, Jonathan Le Roux, J. Hershey","doi":"10.1109/ICASSP.2018.8462180","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462180","url":null,"abstract":"End-to-end automatic speech recognition (ASR) can significantly reduce the burden of developing ASR systems for new languages, by eliminating the need for linguistic information such as pronunciation dictionaries. This also creates an opportunity to build a monolithic multilingual ASR system with a language-independent neural network architecture. In our previous work, we proposed a monolithic neural network architecture that can recognize multiple languages, and showed its effectiveness compared with conventional language-dependent models. However, the model is not guaranteed to properly handle switches in language within an utterance, thus lacking the flexibility to recognize mixed-language speech such as code-switching. In this paper, we extend our model to enable dynamic tracking of the language within an utterance, and propose a training procedure that takes advantage of a newly created mixed-language speech corpus. Experimental results show that the extended model outperforms both language-dependent models and our previous model without suffering from performance degradation that could be associated with language switching.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"54 1","pages":"4919-4923"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80791050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 58

Parametric Approximation of Piano Sound Based on Kautz Model with Sparse Linear Prediction 基于Kautz模型和稀疏线性预测的钢琴声音参数逼近

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8461547

Kenji Kobayashi, Daiki Takeuchi, Mio Iwamoto, K. Yatabe, Yasuhiro Oikawa

引用次数: 5

Variational Deep Learning for Low-Dose Computed Tomography 低剂量计算机断层扫描的变分深度学习

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462312

Erich Kobler, Matthew Muckley, Baiyu Chen, F. Knoll, K. Hammernik, T. Pock, D. Sodickson, R. Otazo

引用次数: 9

Envelope Estimation by Tangentially Constrained Spline 切线约束样条包络估计

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462203

Tsubasa Kusano, K. Yatabe, Yasuhiro Oikawa

引用次数: 4

Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning 利用广义倒谱特征和深度学习的自动音乐转录

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462079

Yu-Te Wu, Berlin Chen, Li Su

引用次数: 8

Joint Probabilistic Forecasts of Temperature and Solar Irradiance 温度和太阳辐照度的联合概率预报

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462496

Raksha Ramakrishna, A. Bernstein, E. Dall’Anese, A. Scaglione

引用次数: 3

Adaptive Bayesian Channel Gain Cartography 自适应贝叶斯信道增益制图

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8461412

Donghoon Lee, Dimitris Berberidis, G. Giannakis

引用次数: 5

Modal Decomposition of Musical Instrument Sound Via Alternating Direction Method of Multipliers 乘法器交替方向法的乐器声音模态分解

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-10 DOI: 10.1109/ICASSP.2018.8462350

Yoshiki Masuyama, Tsubasa Kusano, K. Yatabe, Yasuhiro Oikawa

引用次数: 4