Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki
{"title":"利用从机器特定滤波器库中提取的频谱-时态调制表示法进行机器异常声音检测","authors":"Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki","doi":"arxiv-2409.05319","DOIUrl":null,"url":null,"abstract":"Early detection of factory machinery malfunctions is crucial in industrial\napplications. In machine anomalous sound detection (ASD), different machines\nexhibit unique vibration-frequency ranges based on their physical properties.\nMeanwhile, the human auditory system is adept at tracking both temporal and\nspectral dynamics of machine sounds. Consequently, integrating the\ncomputational auditory models of the human auditory system with\nmachine-specific properties can be an effective approach to machine ASD. We\nfirst quantified the frequency importances of four types of machines using the\nFisher ratio (F-ratio). The quantified frequency importances were then used to\ndesign machine-specific non-uniform filterbanks (NUFBs), which extract the log\nnon-uniform spectrum (LNS) feature. The designed NUFBs have a narrower\nbandwidth and higher filter distribution density in frequency regions with\nrelatively high F-ratios. Finally, spectral and temporal modulation\nrepresentations derived from the LNS feature were proposed. These proposed LNS\nfeature and modulation representations are input into an autoencoder\nneural-network-based detector for ASD. The quantification results from the\ntraining set of the Malfunctioning Industrial Machine Investigation and\nInspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the\ndistinguishing information between normal and anomalous sounds of different\nmachines is encoded non-uniformly in the frequency domain. By highlighting\nthese important frequency regions using NUFBs, the LNS feature can\nsignificantly enhance performance using the metric of AUC (area under the\nreceiver operating characteristic curve) under various SNR conditions.\nFurthermore, modulation representations can further improve performance.\nSpecifically, temporal modulation is effective for fans, pumps, and sliders,\nwhile spectral modulation is particularly effective for valves.","PeriodicalId":501178,"journal":{"name":"arXiv - CS - Sound","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks\",\"authors\":\"Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki\",\"doi\":\"arxiv-2409.05319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Early detection of factory machinery malfunctions is crucial in industrial\\napplications. In machine anomalous sound detection (ASD), different machines\\nexhibit unique vibration-frequency ranges based on their physical properties.\\nMeanwhile, the human auditory system is adept at tracking both temporal and\\nspectral dynamics of machine sounds. Consequently, integrating the\\ncomputational auditory models of the human auditory system with\\nmachine-specific properties can be an effective approach to machine ASD. We\\nfirst quantified the frequency importances of four types of machines using the\\nFisher ratio (F-ratio). The quantified frequency importances were then used to\\ndesign machine-specific non-uniform filterbanks (NUFBs), which extract the log\\nnon-uniform spectrum (LNS) feature. The designed NUFBs have a narrower\\nbandwidth and higher filter distribution density in frequency regions with\\nrelatively high F-ratios. Finally, spectral and temporal modulation\\nrepresentations derived from the LNS feature were proposed. These proposed LNS\\nfeature and modulation representations are input into an autoencoder\\nneural-network-based detector for ASD. The quantification results from the\\ntraining set of the Malfunctioning Industrial Machine Investigation and\\nInspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the\\ndistinguishing information between normal and anomalous sounds of different\\nmachines is encoded non-uniformly in the frequency domain. By highlighting\\nthese important frequency regions using NUFBs, the LNS feature can\\nsignificantly enhance performance using the metric of AUC (area under the\\nreceiver operating characteristic curve) under various SNR conditions.\\nFurthermore, modulation representations can further improve performance.\\nSpecifically, temporal modulation is effective for fans, pumps, and sliders,\\nwhile spectral modulation is particularly effective for valves.\",\"PeriodicalId\":501178,\"journal\":{\"name\":\"arXiv - CS - Sound\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Sound\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.05319\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Sound","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
工厂机械故障的早期检测在工业应用中至关重要。在机器异常声音检测(ASD)中,不同的机器会根据其物理特性表现出独特的振动频率范围。因此,将人类听觉系统的计算听觉模型与机器的特定属性相结合,不失为一种有效的机器自动识别方法。我们首先使用菲舍尔比率(F-ratio)量化了四种机器的频率输入。然后,我们利用量化的频率导入值设计了针对特定机器的非均匀滤波器库(NUFB),从而提取出了对数均匀频谱(LNS)特征。所设计的 NUFB 在 F 比相对较高的频率区域具有更窄的带宽和更高的滤波器分布密度。最后,还提出了从 LNS 特征得出的频谱和时间调制表示法。这些提出的 LNS 特征和调制表示被输入到基于自动编码器神经网络的 ASD 检测器中。从信噪比(SNR)为 6 dB 的故障工业机器调查和检测数据集训练集的量化结果显示,不同机器的正常声音和异常声音之间的区分信息在频域中编码不均匀。通过使用 NUFBs 突出显示这些重要的频率区域,LNS 特征可以在各种信噪比条件下显著提高 AUC(接收器工作特性曲线下的面积)指标的性能。
Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks
Early detection of factory machinery malfunctions is crucial in industrial
applications. In machine anomalous sound detection (ASD), different machines
exhibit unique vibration-frequency ranges based on their physical properties.
Meanwhile, the human auditory system is adept at tracking both temporal and
spectral dynamics of machine sounds. Consequently, integrating the
computational auditory models of the human auditory system with
machine-specific properties can be an effective approach to machine ASD. We
first quantified the frequency importances of four types of machines using the
Fisher ratio (F-ratio). The quantified frequency importances were then used to
design machine-specific non-uniform filterbanks (NUFBs), which extract the log
non-uniform spectrum (LNS) feature. The designed NUFBs have a narrower
bandwidth and higher filter distribution density in frequency regions with
relatively high F-ratios. Finally, spectral and temporal modulation
representations derived from the LNS feature were proposed. These proposed LNS
feature and modulation representations are input into an autoencoder
neural-network-based detector for ASD. The quantification results from the
training set of the Malfunctioning Industrial Machine Investigation and
Inspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the
distinguishing information between normal and anomalous sounds of different
machines is encoded non-uniformly in the frequency domain. By highlighting
these important frequency regions using NUFBs, the LNS feature can
significantly enhance performance using the metric of AUC (area under the
receiver operating characteristic curve) under various SNR conditions.
Furthermore, modulation representations can further improve performance.
Specifically, temporal modulation is effective for fans, pumps, and sliders,
while spectral modulation is particularly effective for valves.