ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第10页

Hypergraphs with Edge-Dependent Vertex Weights: Spectral Clustering Based on the 1-Laplacian 边缘依赖顶点权值的超图:基于1-拉普拉斯的谱聚类

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746363

Yu Zhu, Boning Li, Santiago Segarra

引用次数: 2

Low Resources Online Single-Microphone Speech Enhancement with Harmonic Emphasis 低资源在线单麦克风语音增强与谐波重点

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9747656

Nir Raviv, Ofer Schwartz, S. Gannot

引用次数: 1

Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives 基于情感类别与原语关系的语音情感识别置信度估计

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/ICASSP43922.2022.9746930

Y. Li, C. Papayiannis, Viktor Rozgic, Elizabeth Shriberg, Chao Wang

{"title":"Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives","authors":"Y. Li, C. Papayiannis, Viktor Rozgic, Elizabeth Shriberg, Chao Wang","doi":"10.1109/ICASSP43922.2022.9746930","DOIUrl":"https://doi.org/10.1109/ICASSP43922.2022.9746930","url":null,"abstract":"Confidence estimation for Speech Emotion Recognition (SER) is instrumental in improving the reliability in the behavior of downstream applications. In this work we propose (1) a novel confidence metric for SER based on the relationship between emotion primitives: arousal, valence, and dominance (AVD) and emotion categories (ECs), (2) EmoConfidNet - a DNN trained alongside the EC recognizer to predict the proposed confidence metric, and (3) a data filtering technique used to enhance the training of EmoConfidNet and the EC recognizer. For each training sample, we calculate distances from corresponding AVD annotation vectors to centroids of each EC in the AVD space, and define EC confidences as functions of the evaluated distances. EmoConfidNet is trained to predict confidence from the same acoustic representations used to train the EC recognizer. EmoConfidNet outperforms state-of-the-art confidence estimation methods on the MSP-Podcast and IEMOCAP datasets. For a fixed EC recognizer, after we reject the same number of low confidence predictions using EmoConfidNet, we achieve a higher F1 and unweighted average recall (UAR) than when rejecting using other methods.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133566899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Robust Collaborative Learning for Sequence Modelling 鲁棒协同学习序列建模

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746494

Francois Buet-Golfouse, Hans Roggeman, Islam Utyagulov

引用次数: 0

DeepHull: Fast Convex Hull Approximation in High Dimensions DeepHull:高维快速凸壳近似

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/ICASSP43922.2022.9746031

Randall Balestriero, Zichao Wang, Richard Baraniuk

引用次数: 2

Unsupervised Anomaly Detection for Container Cloud Via BILSTM-Based Variational Auto-Encoder 基于bilstm的变分自编码器的容器云无监督异常检测

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9747341

Yulong Wang, Xingshu Chen, Qixu Wang, Run Yang, Bangzhou Xin

引用次数: 3

Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders 用变分自编码器学习语音诱发脑电的主体不变表征

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/ICASSP43922.2022.9747297

Lies Bollens, T. Francart, H. V. hamme

引用次数: 9

Controlled Sensing and Anomaly Detection Via Soft Actor-Critic Reinforcement Learning 基于软Actor-Critic强化学习的受控传感和异常检测

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9747436

Chen Zhong, M. C. Gursoy, Senem Velipasalar

引用次数: 1

Physical Layer Anonymous Communications: An Anonymity Entropy Oriented Precoding Design (Invited Paper) 物理层匿名通信:面向匿名熵的预编码设计(特邀论文)

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/icassp43922.2022.9746100

Zhongxiang Wei, C. Masouros, Sumei Sun

{"title":"Physical Layer Anonymous Communications: An Anonymity Entropy Oriented Precoding Design (Invited Paper)","authors":"Zhongxiang Wei, C. Masouros, Sumei Sun","doi":"10.1109/icassp43922.2022.9746100","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9746100","url":null,"abstract":"Different from traditional security-oriented designs, the aim of anonymizing techniques is to mask users' identities during communication, thereby providing users with unidentifiability and unlinkability. The existing anonymizing techniques are only designated at upper layers of networks, ignoring the risk of anonymity leakage at physical layer (PHY). In this paper, we address the PHY anonymity design with focus on a typical uplink scenario where the receiver is equipped with more antennas than the sender. With the increased degrees-of-freedom at the receiver side, we first propose a maximum likelihood estimation (MLE) signal trace-back detector, which only analyzes the signaling pattern of the received signal to disclose the sender's identity. Accordingly, an anonymity entropy anonymous (AEA) precoder is proposed, which manipulates the transmitted signalling pattern to counteract the receiver's trace-back detector and meanwhile to guarantee high receive signal-to-interference-plus-noise ratio for communication. More importantly, more data streams can be multiplexed than the number of transmit antennas, which is particularly suitable for the strong receiver configuration. Simulation demonstrates that the proposed AEA precoder can simultaneously provide high anonymity and communication performance.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132270137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image Registration 基于无监督分层翻译的多模态医学图像配准模型

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2022-05-23 DOI: 10.1109/ICASSP43922.2022.9746324

X. Dai, Tai Ma, Haibin Cai, Ying Wen

{"title":"Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image Registration","authors":"X. Dai, Tai Ma, Haibin Cai, Ying Wen","doi":"10.1109/ICASSP43922.2022.9746324","DOIUrl":"https://doi.org/10.1109/ICASSP43922.2022.9746324","url":null,"abstract":"Deformable registration of multi-modal medical images is a challenging task in medical image processing due to the differences in both appearance and structure. We propose an unsupervised hierarchical translation-based model to perform a coarse to fine registration of multi-modal medical images. The proposed model consists of three parts: a coarse registration network, a modal translation network and a fine registration network. First, the coarse registration network learns to obtain the coarse deformation field, which is applied as structure-preserving information to generate a translated image by the modal translation network. Then, the translated image as enhancing information combined with the original images are used to derive a fine deformation field in the fine registration network. Furthermore, the final deformation field is composed from the coarse and the fine deformation fields. In this way, the proposed model can learn high accurate deformation field to implement multi-modal medical image registration. Experiments on two multi-modal brain image datasets demonstrate the effectiveness of this model.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132275886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1