2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

Reduced Dimension Minimum BER PSK Precoding for Constrained Transmit Signals in Massive MIMO 大规模MIMO中约束发射信号的降维最小误码率PSK预编码

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-11-30 DOI: 10.1109/ICASSP.2018.8461642

A. L. Swindlehurst, H. Jedda, I. Fijalkow

引用次数: 11

Low Complexity Joint RDO of Prediction Units Couples for HEVC Intra Coding HEVC编码预测单元对的低复杂度联合RDO

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-10-08 DOI: 10.1109/ICASSP.2018.8462489

Maxime Bichon, J. L. Tanou, M. Ropert, W. Hamidouche, L. Morin, Lu Zhang

引用次数: 5

Non-Native Children Speech Recognition Through Transfer Learning 通过迁移学习的非母语儿童语音识别

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-25 DOI: 10.1109/ICASSP.2018.8462059

M. Matassoni, R. Gretter, D. Falavigna, D. Giuliani

{"title":"Non-Native Children Speech Recognition Through Transfer Learning","authors":"M. Matassoni, R. Gretter, D. Falavigna, D. Giuliani","doi":"10.1109/ICASSP.2018.8462059","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462059","url":null,"abstract":"This work deals with non-native children's speech and investigates both multi-task and transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to speakers, specifically children, learning a foreign language. The application scenario is characterized by young students learning English and German and reading sentences in these second-languages, as well as in their mother language. The paper analyzes and discusses techniques for training effective DNN-based acoustic models starting from children's native speech and performing adaptation with limited non-native audio material. A multi -lingual model is adopted as baseline, where a common phonetic lexicon, defined in terms of the units of the International Phonetic Alphabet (IPA), is shared across the three languages at hand (Italian, German and English); DNN adaptation methods based on transfer learning are evaluated on significant non-native evaluation sets. Results show that the resulting non-native models allow a significant improvement with respect to a mono-lingual system adapted to speakers of the target language.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"137 1","pages":"6229-6233"},"PeriodicalIF":0.0,"publicationDate":"2018-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75301711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 36

Ranking Using Transition Probabilities Learned from Multi-Attribute Data 基于转移概率的多属性数据排序

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8462132

Sigurd Løkse, R. Jenssen

引用次数: 0

Synthesis of Images by Two-Stage Generative Adversarial Networks 基于两阶段生成对抗网络的图像合成

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8461984

Qiang Huang, P. Jackson, Mark D. Plumbley, Wenwu Wang

引用次数: 2

Pulse-Stream Models in Time-of-Flight Imaging 飞行时间成像中的脉冲流模型

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8461767

Adrien Besson, Dimitris Perdios, Y. Wiaux, J. Thiran

引用次数: 2

Emg Acquisition and Hand Pose Classification for Bionic Hands from Randomly-Placed Sensors 随机传感器仿生手的肌电信号采集与手部姿势分类

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8462409

Sumit A. Raurale, J. McAllister, J. M. D. Rincón

引用次数: 17

Statistical T+2d Subband Modelling for Crowd Counting 人群计数的统计T+2d子带建模

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8462345

Deepayan Bhowmik, A. Wallace

引用次数: 1

Globally Optimal Energy Efficiency Maximization for Capacity-Limited Fronthaul Crans with Dynamic Power Amplifiers’ Efficiency 基于动态功率放大器效率的有限容量前传起重机全局最优能效最大化

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8461308

K. Nguyen, Quang-Doanh Vu, Le-Nam Tran, M. Juntti

引用次数: 1

Inexact Proximal Operators for $ell_{p}$-Quasinorm Minimization $ell_{p}$-拟信息最小化的不精确近邻算子

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2018-09-13 DOI: 10.1109/ICASSP.2018.8462524

Cian O'Brien, Mark D. Plumbley

引用次数: 0