ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第8页

Group Sparsity Based Target Localization for Distributed Sensor Array Networks 基于组稀疏度的分布式传感器阵列网络目标定位

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8683867

Qing Shen, Wei Liu, Li Wang, Yin Liu

引用次数: 3

Estimation of Network Processes via Blind Graph Multi-filter Identification 基于盲图多滤波器辨识的网络过程估计

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8683844

Yu Zhu, F. J. Garcia, A. Marques, Santiago Segarra

引用次数: 4

End-to-end Change Detection Using a Symmetric Fully Convolutional Network for Landslide Mapping 基于对称全卷积网络的滑坡映射端到端变化检测

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8682802

Tao Lei, Qi Zhang, Dinghua Xue, Tao Chen, H. Meng, A. Nandi

引用次数: 20

Self-attention Based Prosodic Boundary Prediction for Chinese Speech Synthesis 基于自注意的汉语语音合成韵律边界预测

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8682770

Chunhui Lu, Pengyuan Zhang, Yonghong Yan

引用次数: 24

Convex Combination of Constraint Vectors for Set-membership Affine Projection Algorithms 集隶属仿射投影算法约束向量的凸组合

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8682305

T. Ferreira, W. Martins, Markus V. S. Lima, P. Diniz

引用次数: 3

Non-local Self-attention Structure for Function Approximation in Deep Reinforcement Learning 深度强化学习中函数逼近的非局部自注意结构

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8682832

Z. Wang, Xi Xiao, Guangwu Hu, Yao Yao, Dianyan Zhang, Zhendong Peng, Qing Li, Shutao Xia

引用次数: 0

A Spiking Neural Network Approach to Auditory Source Lateralisation 听觉源侧化的脉冲神经网络方法

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8683767

R. Luke, D. McAlpine

引用次数: 3

Information Theoretic Lower Bound of Restricted Isometry Property Constant 受限等距性质常数的信息论下界

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8683742

Gen Li, Jingkai Yan, Yuantao Gu

引用次数: 1

A Variational Adaptive Population Importance Sampler 变分适应种群重要性采样器

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8683152

Yousef El-Laham, P. Djurić, M. Bugallo

引用次数: 6

Towards End-to-end Speech-to-text Translation with Two-pass Decoding 基于双通道解码的端到端语音到文本翻译

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI: 10.1109/ICASSP.2019.8682801

Tzu-Wei Sung, Jun-You Liu, Hung-yi Lee, Lin-Shan Lee

{"title":"Towards End-to-end Speech-to-text Translation with Two-pass Decoding","authors":"Tzu-Wei Sung, Jun-You Liu, Hung-yi Lee, Lin-Shan Lee","doi":"10.1109/ICASSP.2019.8682801","DOIUrl":"https://doi.org/10.1109/ICASSP.2019.8682801","url":null,"abstract":"Speech-to-text translation (ST) refers to transforming the audio in source language to the text in target language. Mainstream solutions for such tasks are to cascade automatic speech recognition with machine translation, for which the transcriptions of the source language are needed in training. End-to-end approaches for ST tasks have been investigated because of not only technical interests such as to achieve globally optimized solution, but the need for ST tasks for the many source languages worldwide which do not have written form. In this paper, we propose a new end-to-end ST framework with two decoders to handle the relatively deeper relationships between the source language audio and target language text. The first-pass decoder generates some useful latent representations, and the second-pass decoder then integrates the output of both the encoder and the first-pass decoder to generate the text translation in target language. Only paired source language audio and target language text are used in training. Preliminary experiments on several language pairs showed improved performance, and offered some initial analysis.","PeriodicalId":13203,"journal":{"name":"ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"39 1","pages":"7175-7179"},"PeriodicalIF":0.0,"publicationDate":"2019-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88029031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 25