2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)最新文献_第3页

Explainable Deep Learning Detection of Gaussian Propeller Noise with Unknown Signal-to-Noise Ratio 未知信噪比高斯螺旋桨噪声的可解释深度学习检测

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596566

M. Thomas, Fillatre Lionel, Deruaz-Pepin Laurent

引用次数: 2

A Placement Angle Detection Method of Recyclable Object for Garbage Power Generation 垃圾发电中可回收物放置角度检测方法

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596431

Y. Cai, Mengwei Chen, Yifei Feng, Zheng Ming

引用次数: 0

A General Parametrization Framework for Pairwise Markov Models: An Application to Unsupervised Image Segmentation 成对马尔可夫模型的通用参数化框架:在无监督图像分割中的应用

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596395

H. Gangloff, Katherine Morales, Y. Petetin

引用次数: 0

Adaptive Normalized LMP Estimation for Graph Signal Processing 图信号处理中的自适应归一化LMP估计

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596181

Yi Yan, Radwa Adel, E. Kuruoğlu

引用次数: 3

Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective 基于自监督目标分解层次变分自编码器的解纠缠语音表示学习

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/MLSP52302.2021.9596320

Yuying Xie, Thomas Arildsen, Z. Tan

{"title":"Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective","authors":"Yuying Xie, Thomas Arildsen, Z. Tan","doi":"10.1109/MLSP52302.2021.9596320","DOIUrl":"https://doi.org/10.1109/MLSP52302.2021.9596320","url":null,"abstract":"Disentangled representation learning aims to extract explanatory features or factors and retain salient information. Factorized hierarchical variational autoencoder (FHVAE) presents a way to disentangle a speech signal into sequential-level and segmental-level features, which represent speaker identity and speech content information, respectively. As a self-supervised objective, autoregressive predictive coding (APC), on the other hand, has been used in extracting meaningful and transferable speech features for multiple downstream tasks. Inspired by the success of these two representation learning methods, this paper proposes to integrate the APC objective into the FHVAE framework aiming at benefiting from the additional self-supervision target. The main proposed method requires neither more training data nor more computational cost at test time, but obtains improved meaningful representations while maintaining disentanglement. The experiments were conducted on the TIMIT dataset. Results demonstrate that FHVAE equipped with the additional self-supervised objective is able to learn features providing superior performance for tasks including speech recognition and speaker recognition. Furthermore, voice conversion, as one application of disentangled representation learning, has been applied and evaluated. The results show performance similar to baseline of the new framework on voice conversion.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130462191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Caesynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders synth:实时音色插值和音高控制与条件自编码器

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596414

Aaron Valero Puche, Sukhan Lee

引用次数: 0

Singing Fundamental Frequency Contour Generation Using Generalized Command-Response Model and Score-Conditional Variational Autoencoder 基于广义命令响应模型和分数条件变分自编码器的歌唱基频轮廓生成

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596428

Shogo Seki, Haruka Taga, T. Toda

{"title":"Singing Fundamental Frequency Contour Generation Using Generalized Command-Response Model and Score-Conditional Variational Autoencoder","authors":"Shogo Seki, Haruka Taga, T. Toda","doi":"10.1109/mlsp52302.2021.9596428","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596428","url":null,"abstract":"This paper proposes a method for achieving physically motivated and interpretable control of fundamental frequency (F0) contour generation in singing aid systems for laryngectomees. Recently proposed variational autoencoder (VAE)-based method, VAE-SPACE, has successfully generated singing F0 contours from musical scores. However, VAE-SPACE can generate physically deviated F0 contours. Moreover, to represent fluctuations in F0 contours, VAE-SPACE requires manual adjustment of noise components used as the input with musical scores. To address these issues, the proposed method 1) introduces a generalized command-response (GCR) model to represent an F0 contour as an approximation of a physical F0 production mechanism, and 2) employs a conditional VAE (CVAE) to treat musical scores and the noise components separately. The experimental results reveal that the proposed method achieves comparable performance as VAE-SPACE without the manual adjustment of noise components and makes it possible to control F0 contours more intuitively by using the trained GCR model.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131655194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adversarial Perturbation Attacks on Nested Dichotomies Classification Systems 嵌套二分类系统的对抗摄动攻击

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596336

Ismail R. Alkhouri, Alvaro Velasquez, George K. Atia

引用次数: 1

Learning Parametric Time-Vertex Graph Processes from Incomplete Realizations 从不完全实现中学习参数化时间顶点图过程

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596563

Eylem Tugçe Güneyi, Abdullah Canbolat, Elif Vural

引用次数: 2

Early Fusion Graph Convolutional Network for Skeleton-Based Action Recognition 基于骨架动作识别的早期融合图卷积网络

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI: 10.1109/mlsp52302.2021.9596448

Xiaoxue Zhao, Cuiwei Liu, Xiangbin Shi

引用次数: 0