基于复合自编码器高斯混合模型的异常声检测

Conference on Electronic Information Engineering and Data Processing Pub Date : 2023-05-26 DOI:10.1117/12.2682257

Heng Wang, Jie Liu, Shuaifeng Li

{"title":"基于复合自编码器高斯混合模型的异常声检测","authors":"Heng Wang, Jie Liu, Shuaifeng Li","doi":"10.1117/12.2682257","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Abnormal sound detection based on composite autoencoder Gaussian mixture model\",\"authors\":\"Heng Wang, Jie Liu, Shuaifeng Li\",\"doi\":\"10.1117/12.2682257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.\",\"PeriodicalId\":177416,\"journal\":{\"name\":\"Conference on Electronic Information Engineering and Data Processing\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Electronic Information Engineering and Data Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2682257\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Electronic Information Engineering and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

针对无监督条件下异常声检测精度不理想的问题，提出了一种复合自编码器与高斯混合模型相结合的异常声检测模型。首先，利用LSTM的定时结构和门控机制提高自编码器(包括自编码器和变分自编码器)的特征提取能力;其次，利用高斯混合模型(GMM)生成人工数据，提高自编码器对背景噪声的鲁棒性。利用ToyADMOS和MIMII公共数据集进行了实验，结果优于朴素自编码模型和两种改进的自编码模型。在实验数据集的6台机器上，AUC分别增加了6.34%、6.65%、4.03%、5.57%、2.38%和1.07%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Abnormal sound detection based on composite autoencoder Gaussian mixture model

Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Conference on Electronic Information Engineering and Data Processing

自引率

0.00%

发文量