ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第8页

Full-Duplex Multifunction Transceiver with Joint Constant Envelope Transmission and Wideband Reception 具有联合恒包络传输和宽带接收的全双工多功能收发器

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9413725

Jaakko Marin, Micael Bernhardt, T. Riihonen

引用次数: 4

UTDN: An Unsupervised Two-Stream Dirichlet-Net for Hyperspectral Unmixing UTDN:用于高光谱解混的无监督双流Dirichlet-Net

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9414810

Qiwen Jin, Yong Ma, Xiaoguang Mei, Hao Li, Jiayi Ma

引用次数: 1

Detecting Alzheimer’s Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation 基于瓶颈特征和数据增强的神经网络语音检测阿尔茨海默病

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9413566

Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Yunxia Li

引用次数: 11

Decomposing Textures using Exponential Analysis 使用指数分析分解纹理

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9413909

Yuan Hou, A. Cuyt, Wen-shin Lee, Deepayan Bhowmik

引用次数: 1

Periodic Signal Denoising: An Analysis-Synthesis Framework Based on Ramanujan Filter Banks and Dictionaries 周期信号去噪:基于拉马努金滤波器组和字典的分析-综合框架

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9413689

Pranav Kulkarni, P. Vaidyanathan

引用次数: 3

A Large-Scale Chinese Long-Text Extractive Summarization Corpus 大型中文长文本抽取摘要语料库

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9414946

Kai Chen, Guanyu Fu, Qingcai Chen, Baotian Hu

引用次数: 3

Drawing Order Recovery from Trajectory Components 从轨迹组件中恢复绘制顺序

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9413542

Minghao Yang, Xukang Zhou, Yangchang Sun, Jinglong Chen, Baohua Qiang

{"title":"Drawing Order Recovery from Trajectory Components","authors":"Minghao Yang, Xukang Zhou, Yangchang Sun, Jinglong Chen, Baohua Qiang","doi":"10.1109/ICASSP39728.2021.9413542","DOIUrl":"https://doi.org/10.1109/ICASSP39728.2021.9413542","url":null,"abstract":"In spite of widely discussed, drawing order recovery (DOR) from static images is still a great challenge task. Based on the idea that drawing trajectories are able to be recovered by connecting their trajectory components in correct orders, this work proposes a novel DOR method from static images. The method contains two steps: firstly, we adopt a convolution neural network (CNN) to predict the next possible drawing components, which is able to covert the components in images to their reasonable sequences. We denote this architecture as Im2Seq-CNN; secondly, considering possible errors exist in the reasonable sequences generated by the first step, we construct a sequence to order structure (Seq2Order) to adjust the sequences to the correct orders. The main contributions include: (1) the Img2Seq-CNN step considers DOR from components instead of traditional pixels one by one along trajectories, which contributes to static images to component sequences; (2) the Seq2Order step adopts image position codes instead of traditional points’ coordinates in its encoder-decoder gated recurrent neural network (GRU-RNN). The proposed method is experienced on two well-known open handwriting databases, and yields robust and competitive results on handwriting DOR tasks compared to the state-of-arts.","PeriodicalId":347060,"journal":{"name":"ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132349415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Structure-Guided and Sparse-Representation-Based 3d Seismic Inversion Method 一种结构导向的稀疏表示三维地震反演方法

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9415071

B. She, Yaojun Wang, Guang Hu

{"title":"A Structure-Guided and Sparse-Representation-Based 3d Seismic Inversion Method","authors":"B. She, Yaojun Wang, Guang Hu","doi":"10.1109/ICASSP39728.2021.9415071","DOIUrl":"https://doi.org/10.1109/ICASSP39728.2021.9415071","url":null,"abstract":"Existing seismic inversion methods are usually 1D, mainly focusing on improving the vertical resolution of inversion results. A few 2D or 3D inversion techniques are either too simple and lack the consideration of stratigraphic structures, or are too complicated which need to extract dip information and solve a complex constrained optimization problem. In this work, with the help of gradient structure tensor (GST) and dictionary learning and sparse representation (DLSR) technologies, we propose a 3D inversion approach (GST-DLSR) that considers both vertical and horizontal structural constraints. In the vertical direction, we investigate the vertical structural features of subsurface models from well-log data by DLSR. In the horizontal direction, we obtain the stratigraphic structural features from a 3D seismic image by GST. We then apply the acquired structural features to constraint the entire inversion procedure. The experiments show that GST-DLSR takes good advantages of both techniques, enabling to produce inversion results with high resolution, good lateral continuity, and enhanced structural features.","PeriodicalId":347060,"journal":{"name":"ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132366413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evolving Quantized Neural Networks for Image Classification Using A Multi-Objective Genetic Algorithm 基于多目标遗传算法的演化量化神经网络图像分类

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9413519

Yong Wang, Xiaojing Wang, Xiaoyu He

引用次数: 0

An Investigation of Using Hybrid Modeling Units for Improving End-to-End Speech Recognition System 基于混合建模单元的端到端语音识别系统改进研究

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2021-06-06 DOI: 10.1109/ICASSP39728.2021.9414598

Shunfei Chen, Xinhui Hu, Sheng Li, Xinkang Xu

{"title":"An Investigation of Using Hybrid Modeling Units for Improving End-to-End Speech Recognition System","authors":"Shunfei Chen, Xinhui Hu, Sheng Li, Xinkang Xu","doi":"10.1109/ICASSP39728.2021.9414598","DOIUrl":"https://doi.org/10.1109/ICASSP39728.2021.9414598","url":null,"abstract":"The acoustic modeling unit is crucial for an end-to-end speech recognition system, especially for the Mandarin language. Until now, most of the studies on Mandarin speech recognition focused on individual units, and few of them paid attention to using a combination of these units. This paper uses a hybrid of the syllable, Chinese character, and subword as the modeling units for the end-to-end speech recognition system based on the CTC/attention multi-task learning. In this approach, the character-subword unit is assigned to train the transformer model in the main task learning stage. In contrast, the syllable unit is assigned to enhance the transformer’s shared encoder in the auxiliary task stage with the Connectionist Temporal Classification (CTC) loss function. The recognition experiments were conducted on AISHELL-1 and an open data set of 1200-hour Mandarin speech corpus collected from the OpenSLR, respectively. The experimental results demonstrated that using the syllable-char-subword hybrid modeling unit can achieve better performances than the conventional units of char-subword, and 6.6% relative CER reduction on our 1200-hour data. The substitution error also achieves a considerable reduction.","PeriodicalId":347060,"journal":{"name":"ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132462606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7