{"title":"基于复合自编码器高斯混合模型的异常声检测","authors":"Heng Wang, Jie Liu, Shuaifeng Li","doi":"10.1117/12.2682257","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Abnormal sound detection based on composite autoencoder Gaussian mixture model\",\"authors\":\"Heng Wang, Jie Liu, Shuaifeng Li\",\"doi\":\"10.1117/12.2682257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.\",\"PeriodicalId\":177416,\"journal\":{\"name\":\"Conference on Electronic Information Engineering and Data Processing\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Electronic Information Engineering and Data Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2682257\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Electronic Information Engineering and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Abnormal sound detection based on composite autoencoder Gaussian mixture model
Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.