ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第6页

A Unified Two-Stage Model for Separating Superimposed Images 一种统一的两阶段叠加图像分离模型

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746606

Huiyu Duan, Xiongkuo Min, Wei Shen, Guangtao Zhai

引用次数: 3

Attention Guided Invariance Selection for Local Feature Descriptors 局部特征描述符的注意引导不变性选择

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746419

Jiapeng Li, Ge Li, Thomas H. Li

引用次数: 3

New Improved Criterion for Model Selection in Sparse High-Dimensional Linear Regression Models 稀疏高维线性回归模型模型选择的新改进准则

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746867

P. B. Gohain, M. Jansson

引用次数: 4

HOQRI: Higher-Order QR Iteration for Scalable Tucker Decomposition HOQRI:可扩展Tucker分解的高阶QR迭代

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746726

Yuchen Sun, Kejun Huang

引用次数: 2

Signal Recovery from Inconsistent Nonlinear Observations 不一致非线性观测的信号恢复

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746145

P. L. Combettes, Zev Woodstock

引用次数: 0

Generation of Personal Sound Fields in Reverberant Environments Using Interframe Correlation 利用帧间关联在混响环境中产生个人声场

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9747574

Liming Shi, Guoli Ping, Xiaoxiang Shen, M. G. Christensen

{"title":"Generation of Personal Sound Fields in Reverberant Environments Using Interframe Correlation","authors":"Liming Shi, Guoli Ping, Xiaoxiang Shen, M. G. Christensen","doi":"10.1109/icassp43922.2022.9747574","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9747574","url":null,"abstract":"Personal sound field control techniques aim to produce sound fields for different sound contents in different places of an acoustic space without interference. The limitations of the state-of-the-art methods for sound field control include high latency and computational complexity, especially in the cases when the reverberation time is long and number of loudspeakers is large. In this paper, we propose a personal sound field control approach that exploits interframe correlation. Considering the past frames, the proposed method can accommodate long reverberation time with a low latency. To find the optimal parameters for the physical meaningful constraints, the subspace decomposition and Newton’s method are applied. Furthermore, a sound field distortion oriented subspace construction method is proposed to reduce the subspace dimension. Compared with traditional methods, simulation results show that the proposed algorithm is able to obtain a good trade-off between acoustic contrast and reproduction error with a low latency for measured room impulse responses.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"30 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114094179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Learning Task-Specific Representation for Video Anomaly Detection with Spatial-Temporal Attention 基于时空注意的视频异常检测学习任务特定表示

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746822

Y. Liu, Jing Liu, Xiaoguang Zhu, Donglai Wei, Xiaohong Huang, Liang Song

引用次数: 22

Discourse-Level Prosody Modeling with a Variational Autoencoder for Non-Autoregressive Expressive Speech Synthesis 基于变分自编码器的非自回归表达语音合成语篇级韵律建模

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746238

Ning Wu, Zhaoci Liu, Zhenhua Ling

引用次数: 1

Learning Approach For Fast Approximate Matrix Factorizations 快速近似矩阵分解的学习方法

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9747165

Haiyan Yu, Zhen Qin, Zhihui Zhu

{"title":"Learning Approach For Fast Approximate Matrix Factorizations","authors":"Haiyan Yu, Zhen Qin, Zhihui Zhu","doi":"10.1109/icassp43922.2022.9747165","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9747165","url":null,"abstract":"Efficiently computing an (approximate) orthonormal basis and low-rank approximation for the input data X plays a crucial role in data analysis. One of the most efficient algorithms for such tasks is the randomized algorithm, which proceeds by computing a projection XA with a random sketching matrix A of much smaller size, and then computing the orthonormal basis as well as low-rank factorizations of the tall matrix XA. While a random matrix A is the de facto choice, in this work, we improve upon its performance by utilizing a learning approach to find an adaptive sketching matrix A from a set of training data. We derive a closed-form formulation for the gradient of the training problem, enabling us to use efficient gradient-based algorithms. We also extend this approach for learning structured sketching matrix, such as the sparse sketching matrix that performs as selecting a few number of representative columns from the input data. Our experiments on both synthetical and real data show that both learned dense and sparse sketching matrices outperform the random ones in finding the approximate orthonormal basis and low-rank approximations.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121485540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

NEX+: Novel View Synthesis with Neural Regularisation Over Multi-Plane Images NEX+:基于神经正则化的新型多平面图像视图合成

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746938

Wenpeng Xing, Jie Chen

引用次数: 3