ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第8页

Sensor Selection for Angle of Arrival Estimation Based on the Two-Target Cramér-Rao Bound 基于双目标cram<s:1> - rao界的到达角估计传感器选择

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10094942

C. Kokke, M. Coutiño, L. Anitori, R. Heusdens, G. Leus

引用次数: 1

Stargan-vc Based Cross-Domain Data Augmentation for Speaker Verification 基于Stargan-vc的说话人验证跨域数据增强

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10094698

Hang-Rui Hu, Yan Song, Jian-Tao Zhang, Lirong Dai, I. Mcloughlin, Zhu Zhuo, Yujie Zhou, Yu-Hong Li, Hui Xue

{"title":"Stargan-vc Based Cross-Domain Data Augmentation for Speaker Verification","authors":"Hang-Rui Hu, Yan Song, Jian-Tao Zhang, Lirong Dai, I. Mcloughlin, Zhu Zhuo, Yujie Zhou, Yu-Hong Li, Hui Xue","doi":"10.1109/ICASSP49357.2023.10094698","DOIUrl":"https://doi.org/10.1109/ICASSP49357.2023.10094698","url":null,"abstract":"Automatic speaker verification (ASV) faces domain shift caused by the mismatch of intrinsic and extrinsic factors, such as recording device and speaking style, in real-world applications, which leads to severe performance degradation. Since single-speaker multi-condition (SSMC) data is difficult to collect in practice, existing domain adaptation methods are hard to ensure the feature consistency of the same class but different domains. To this end, we propose a cross-domain data generation method to obtain a domain-invariant ASV system. Inspired by voice conversion (VC) task, a StarGAN based generative model first learns cross-domain mappings from SSMC data, and then generates missing domain data for all speakers, thus increasing the intra-class diversity of the training set. Considering the difference between ASV and VC task, we renovate the corresponding training objectives and network structure to make the adaptation task-specific. Evaluations on achieve a relative performance improvement of about 5-8% over the baseline in terms of minDCF and EER, outperforming the CNSRC winner’s system of the equivalent scale.","PeriodicalId":113072,"journal":{"name":"ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124002604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Classifying Non-Individual Head-Related Transfer Functions with A Computational Auditory Model: Calibration And Metrics 用计算听觉模型对非个体头部相关传递函数进行分类:校准和度量

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10095152

Rapolas Daugintis, Roberto Barumerli, L. Picinali, M. Geronazzo

{"title":"Classifying Non-Individual Head-Related Transfer Functions with A Computational Auditory Model: Calibration And Metrics","authors":"Rapolas Daugintis, Roberto Barumerli, L. Picinali, M. Geronazzo","doi":"10.1109/ICASSP49357.2023.10095152","DOIUrl":"https://doi.org/10.1109/ICASSP49357.2023.10095152","url":null,"abstract":"This study explores the use of a multi-feature Bayesian auditory sound localisation model to classify non-individual head-related transfer functions (HRTFs). Based on predicted sound localisation performance, these are grouped into ‘good’ and ‘bad’, and the ‘best’/‘worst’ is selected from each category. Firstly, we present a greedy algorithm for automated individual calibration of the model based on the individual sound localisation data. We then discuss data analysis of predicted directional localisation errors and present an algorithm for categorising the HRTFs based on the localisation error distributions within a limited range of directions in front of the listener. Finally, we discuss the validity of the classification algorithm when using averaged instead of individual model parameters. This analysis of auditory modelling results aims to provide a perceptual foundation for automated HRTF personalisation techniques for an improved experience of binaural spatial audio technologies.","PeriodicalId":113072,"journal":{"name":"ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124185053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Privacy-Enhanced Federated Learning Against Attribute Inference Attack for Speech Emotion Recognition 针对属性推理攻击的隐私增强联邦学习语音情感识别

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10095737

Huan Zhao, Haijiao Chen, Yufeng Xiao, Zixing Zhang

引用次数: 0

Improving Electric Load Demand Forecasting with Anchor-Based Forecasting Method 基于锚点的电力负荷需求预测改进方法

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10096754

Maria Tzelepi, P. Nousi, A. Tefas

引用次数: 1

Multiple Target Measurements: Bayesian Framework for Moving Object Detection in Mimo Radar 多目标测量:Mimo雷达运动目标检测的贝叶斯框架

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10094649

Bastian Eisele, Ali Bereyhi, R. Müller

引用次数: 0

FedSD: A New Federated Learning Structure Used in Non-iid Data FedSD:用于非id数据的新型联邦学习结构

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10095595

Minmin Yi, Houchun Ning, Peng Liu

引用次数: 0

ERBNet: An Effective Representation Based Network for Unbiased Scene Graph Generation 一种有效的基于表示的无偏场景图生成网络

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10094727

Wenxing Ma, Tianxiang Hou, Qianji Di, Zhongang Qi, Ying Shan, Hanzi Wang

引用次数: 0

Seri: Sketching-Reasoning-Integrating Progressive Workflow for Empathetic Response Generation 系列:素描-推理-共情反应生成的集成渐进式工作流程

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/ICASSP49357.2023.10094672

Guanqun Bi, Yanan Cao, Piji Li, Yuqiang Xie, Fang Fang, Zheng Lin

引用次数: 0

Audio-Driven High Definetion and Lip-Synchronized Talking Face Generation Based on Face Reenactment 基于人脸再现的音频驱动高清唇同步说话人脸生成

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2023-06-04 DOI: 10.1109/icassp49357.2023.10097270

Xianyu Wang, Yuhan Zhang, Weihua He, Yaoyuan Wang, Minglei Li, Yuchen Wang, Jingyi Zhang, Shunbo Zhou, Ziyang Zhang

引用次数: 0