ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

Vocal Tract Articulatory Contour Detection in Real-Time Magnetic Resonance Images Using Spatio-Temporal Context 基于时空背景的实时磁共振图像声道发音轮廓检测

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053111

Ashwin Hebbar, Rahul Sharma, Krishna Somandepalli, Asterios Toutios, Shrikanth S. Narayanan

{"title":"Vocal Tract Articulatory Contour Detection in Real-Time Magnetic Resonance Images Using Spatio-Temporal Context","authors":"Ashwin Hebbar, Rahul Sharma, Krishna Somandepalli, Asterios Toutios, Shrikanth S. Narayanan","doi":"10.1109/ICASSP40776.2020.9053111","DOIUrl":"https://doi.org/10.1109/ICASSP40776.2020.9053111","url":null,"abstract":"Due to its ability to visualize and measure the dynamics of vocal tract shaping during speech production, real-time magnetic resonance imaging (rtMRI) has emerged as one of the prominent research tools. The ability to track different articulators such as the tongue, lips, velum, and the pharynx is a crucial step toward automating further scientific and clinical analysis. Recently, various researchers have addressed the problem of detecting articulatory boundaries, but those are primarily limited to static-image based methods. In this work, we propose to use information from temporal dynamics together with the spatial structure to detect the articulatory boundaries in rtMRI videos. We train a convolutional LSTM network to detect and label the articulatory contours. We compare the produced contours against reference labels generated by iteratively fitting a manually created subject-specific template. We observe that the proposed method outperforms solely image-based methods, especially for the difficult-to-track articulators involved in airway constriction formation during speech.","PeriodicalId":13127,"journal":{"name":"ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"104 1","pages":"7354-7358"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73471078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Interpretability-Guided Convolutional Neural Networks for Seismic Fault Segmentation 基于可解释性的卷积神经网络地震断层分割

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053472

Zhining Liu, Cheng Zhou, Guangmin Hu, Chengyun Song

{"title":"Interpretability-Guided Convolutional Neural Networks for Seismic Fault Segmentation","authors":"Zhining Liu, Cheng Zhou, Guangmin Hu, Chengyun Song","doi":"10.1109/ICASSP40776.2020.9053472","DOIUrl":"https://doi.org/10.1109/ICASSP40776.2020.9053472","url":null,"abstract":"Delineating the seismic fault, which is an important type of geologic structures in seismic images, is a key step for seismic interpretation. Comparing with conventional methods that design a number of hand-crafted features based on the observed characteristics of the seismic fault, convolutional neural networks (CNNs) have proven to be more powerful for automatically learning effective representations. However, the CNN usually serves as a black box in the process of training and inference, which would lead to trust issues. The inability of humans to understand the CNN would be more problematic, especially in critical areas like seismic exploration, medicine and financial markets. To include domain knowledge to improve the interpretability of the CNN, we propose to jointly optimize the prediction accuracy and consistency between explanations of the neural network and domain knowledge. Taking the seismic fault segmentation as an example, we show that the proposed method not only gives reasonable explanations for its predictions, but also more accurately predicts faults than the baseline model.","PeriodicalId":13127,"journal":{"name":"ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"22 1","pages":"4312-4316"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73474098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Improving Proper Noun Recognition in End-To-End Asr by Customization of the Mwer Loss Criterion 自定义词频损耗准则改进端到端自动识别中的专有名词识别

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054235

Cal Peyser, Tara N. Sainath, G. Pundak

引用次数: 11

Coded Illumination and Multiplexing for Lensless Imaging 无透镜成像的编码照明和多路复用

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9052955

Yucheng Zheng, Rongjia Zhang, M. Salman Asif

引用次数: 2

Exploiting Channel Locality for Adaptive Massive MIMO Signal Detection 利用信道局部性实现自适应海量MIMO信号检测

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9052971

Mehrdad Khani Shirkoohi, Mohammad Alizadeh, J. Hoydis, Phil Fleming

引用次数: 1

Hybrid Active Contour Driven by Double-Weighted Signed Pressure Force for Image Segmentation 双加权签名压力驱动的混合主动轮廓图像分割

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054627

Xingyu Fu, Bin Fang, Mingliang Zhou, Jiajun Li

引用次数: 1

Enhanced Action Tubelet Detector for Spatio-Temporal Video Action Detection 用于时空视频动作检测的增强型动作小管检测器

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054394

Yutang Wu, Hanli Wang, Shuheng Wang, Qinyu Li

引用次数: 1

Rate-Invariant Autoencoding of Time-Series 时间序列的率不变自编码

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9053983

K. Koneripalli, Suhas Lohit, Rushil Anirudh, P. Turaga

引用次数: 11

SED-MDD: Towards Sentence Dependent End-To-End Mispronunciation Detection and Diagnosis 基于句子的端到端发音错误检测与诊断

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9052975

Yiqing Feng, Guanyu Fu, Qingcai Chen, Kai Chen

引用次数: 40

Theoretical Analysis of Multi-Carrier Agile Phased Array Radar 多载波敏捷相控阵雷达的理论分析

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2020-05-01 DOI: 10.1109/ICASSP40776.2020.9054035

Tianyao Huang, Nir Shlezinger, Xingyu Xu, Dingyou Ma, Yimin Liu, Yonina C. Eldar

引用次数: 1