ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第4页

Preconditioned Ghost Imaging Via Sparsity Constraint 通过稀疏性约束的预条件鬼影成像

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053414

Zhishen Tong, Jian Wang, Shensheng Han

引用次数: 2

End-end Speech-to-Text Translation with Modality Agnostic Meta-Learning 基于情态不可知元学习的端到端语音到文本翻译

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054759

S. Indurthi, HyoJung Han, Nikhil Kumar Lakumarapu, Beomseok Lee, Insoo Chung, Sangha Kim, Chanwoo Kim

{"title":"End-end Speech-to-Text Translation with Modality Agnostic Meta-Learning","authors":"S. Indurthi, HyoJung Han, Nikhil Kumar Lakumarapu, Beomseok Lee, Insoo Chung, Sangha Kim, Chanwoo Kim","doi":"10.1109/ICASSP40776.2020.9054759","DOIUrl":"https://doi.org/10.1109/ICASSP40776.2020.9054759","url":null,"abstract":"Collecting large amounts of data to train end-to-end Speech Translation (ST) models is more difficult compared to the ASR and MT tasks. Previous studies have proposed the use of transfer learning approaches to overcome the above difficulty. These approaches benefit from weakly supervised training data, such as ASR speech-to-transcript or MT text-to-text translation pairs. However, the parameters in these models are updated independently of each task, which may lead to sub-optimal solutions. In this work, we adopt a meta-learning algorithm to train a modality agnostic multi-task model that transfers knowledge from source tasks=ASR+MT to target task=ST where the ST task severely lacks data. In the meta-learning phase, parameters are updated in such a way that they act as a good ini-tialization point for the target ST task. We evaluate the proposed meta-learning approach for ST tasks on English-German (En-De) and English-French (En-Fr) language pairs from the Multilingual Speech Translation Corpus (MuST-C). Our method outperforms the previous transfer learning approaches and sets new state-of-the-art results for En-De and En-Fr ST tasks by obtaining 9.18, and 11.76 BLEU point improvements, respectively.","PeriodicalId":13127,"journal":{"name":"ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"145 6 1","pages":"7904-7908"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79392323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 31

Indoor Altitude Estimation of Unmanned Aerial Vehicles Using a Bank of Kalman Filters 基于卡尔曼滤波器的无人机室内高度估计

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054203

Liu Yang, Hechuan Wang, Yousef El-Laham, J. Fonte, David Trillo Pérez, M. Bugallo

引用次数: 3

On the Effect of Reflectance on Phasor Field Non-Line-of-Sight Imaging 反射率对相场非视距成像的影响

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9052985

Ibón Guillén, Xiaochun Liu, A. Velten, D. Gutierrez, A. Jarabo

{"title":"On the Effect of Reflectance on Phasor Field Non-Line-of-Sight Imaging","authors":"Ibón Guillén, Xiaochun Liu, A. Velten, D. Gutierrez, A. Jarabo","doi":"10.1109/ICASSP40776.2020.9052985","DOIUrl":"https://doi.org/10.1109/ICASSP40776.2020.9052985","url":null,"abstract":"Non-line-of-sight (NLOS) imaging aims to visualize occluded scenes by exploiting indirect reflections on visible surfaces. Previous methods approach this problem by inverting the light transport on the hidden scene, but are limited to isolated, diffuse objects. The recently introduced phasor fields framework computationally poses NLOS reconstruction as a virtual line-of-sight (LOS) problem, lifting most assumptions about the hidden scene. In this work we complement recent theoretical analysis of phasor field-based reconstruction, by empirically analyzing the effect of reflectance of the hidden scenes on reconstruction. We experimentally study the reconstruction of hidden scenes composed of objects with increasingly specular materials. Then, we evaluate the effect of the virtual aperture size on the reconstruction, and establish connections between the effect of these two different dimensions on the results. We hope our analysis helps to characterize the imaging capabilities of this promising new framework, and foster new NLOS imaging modalities.","PeriodicalId":13127,"journal":{"name":"ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"10 1","pages":"9269-9273"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84888628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

DGAN: Disentangled Representation Learning for Anisotropic BRDF Reconstruction 各向异性BRDF重建的解纠缠表示学习

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054095

Zhongyun Hu, Xue Wang, Qing Wang

引用次数: 0

Robust Global Optimized Affine Registration Method for Microscopic Images of Biological Tissue 生物组织显微图像的鲁棒全局优化仿射配准方法

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054568

Yanan Lv, Xi Chen, Chang Shu, Hua Han

引用次数: 3

Time-Frequency Loss for CNN Based Speech Super-Resolution 基于CNN的语音超分辨率时频损失

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053712

Heming Wang, Deliang Wang

引用次数: 17

Generalized Spatial Modulation for Wireless Terabits Systems Under Sub-THZ Channel With RF Impairments 具有射频损伤的亚太赫兹信道下无线太比特系统的广义空间调制

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053208

Majed Saad, F. Bader, A. Ghouwayel, Hussein Hijazi, Nizar Bouhel, J. Palicot

{"title":"Generalized Spatial Modulation for Wireless Terabits Systems Under Sub-THZ Channel With RF Impairments","authors":"Majed Saad, F. Bader, A. Ghouwayel, Hussein Hijazi, Nizar Bouhel, J. Palicot","doi":"10.1109/ICASSP40776.2020.9053208","DOIUrl":"https://doi.org/10.1109/ICASSP40776.2020.9053208","url":null,"abstract":"Multiple-Input Multiple-Output (MIMO) technique with Index Modulation (IM) over sub-TeraHertz (sub-THz) bands represent a promising solution to design new wireless ultrahigh data rate systems. However, the system design over sub-THz bands suffers from many technological limitations and severe RF-impairments such as low output power, limited resolution of high-speed low-power Analog-to-Digital Converters and important Phase Noise (PN) introduced by the Local Oscillator (LO). In this paper, different modulations schemes with Generalized Spatial Modulation (GSM) are compared from different perspectives while considering the sub-THz impairments. The effect of PN has been investigated for these modulation schemes in sub-THz channels using uniform linear and rectangular antenna arrays. The obtained results reveal that QPSK-GSM system is the best combination compared to GSM systems with any other Mary modulation scheme (e.g. PSK, DPSK, QAM, PAM). Compared to DQPSK-GSM and 4PAM-GSM at 12bpcu, same number of receive and activated transmit antennas, the QPSK-GSM system offers a gain ranging from 3.4 dB up to 5 dB. The results reveals that low to medium residual PN in distributed oscillator architecture can be tolerated when using GSM-QPSK without phase noise mitigation. Thus, enforcing the GSM to be a promising candidate for ultra-high wireless data rate communication in sub-THz bands.","PeriodicalId":13127,"journal":{"name":"ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"17 1","pages":"5135-5139"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85198100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Gaussian Lpcnet for Multisample Speech Synthesis 多样本语音合成的高斯Lpcnet

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053337

Vadim Popov, M. Kudinov, T. Sadekova

引用次数: 11

Robust Hybrid Beamforming for Satellite-Terrestrial Integrated Networks 星地融合网络的鲁棒混合波束形成

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053756

Zhi Lin, Min Lin, B. Champagne, Wei-Ping Zhu, N. Al-Dhahir

引用次数: 4