{"title":"Abnormal sound detection based on composite autoencoder Gaussian mixture model","authors":"Heng Wang, Jie Liu, Shuaifeng Li","doi":"10.1117/12.2682257","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Electronic Information Engineering and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Aiming at the problem that the accuracy of abnormal sound detection under unsupervised conditions is not ideal, a novel abnormal sound detection model using composite self-coder combined with Gaussian mixture model is proposed. Firstly, the timing structure and gating mechanism of LSTM are used to improve the feature extraction ability of self-coder (including self-coder and variational self-coder), Secondly, Gaussian Mixture Model (GMM) is used to generate artificial data to improve the robustness of the self-coder against background noise. Experiments are carried out using ToyADMOS and MIMII public data sets, and the results are superior to the naive self-coder and the two improved self-coding models. On the six machines of the experimental data set, AUC increases by 6.34%, 6.65%, 4.03%, 5.57%, 2.38% and 1.07% respectively.