Zihao Zhang, Yu Zhang, Daniel O’Boy, Miguel Martínez-García
{"title":"Adaptive threshold in Leaky-Integrated-and-Fire function for audio-based industrial diagnosis","authors":"Zihao Zhang, Yu Zhang, Daniel O’Boy, Miguel Martínez-García","doi":"10.1016/j.jii.2025.100944","DOIUrl":null,"url":null,"abstract":"<div><div>Audio-based fault diagnosis identifies machine operating conditions through acoustic signals, enabling targeted maintenance and reducing downtime in smart manufacturing and embodied intelligence. The traditional Leaky-Integrated-and-Fire (LIF) function in neural networks improves fault state classification by removing shared information while preserving category-unique features. However, its threshold, a backpropagation-optimized parameter that governs the information removal pattern, becomes a fixed constant after training. This constant threshold enforces a uniform information removal pattern across all audio samples despite the significant variations in time–frequency characteristics. Motivated by this and inspired by the auditory system’s adaptive modulation, in contrast to the traditional constant threshold, where the threshold remains a constant after training, this paper proposes a learnable Adaptive Threshold, allowing the threshold to dynamically adapt to the input audio even after training. As the threshold adapts to different inputs rather than remaining a fixed constant, more unique information can be preserved to enhance classification accuracy. The results demonstrate that the adaptive threshold outperforms the constant threshold and other state-of-the-art methods, achieving 99.75% on the IDMT Engine dataset and 98.11% on the MIMII Pump dataset. Visualization results confirm that while both the adaptive threshold and constant threshold successfully suppress non-unique background sounds, such as flowing water, the adaptive threshold demonstrates superior performance in preserving unique features, such as the impact sound from a broken pump. This capability contributes to more accurate fault diagnosis, further validating the effectiveness of the proposed method.</div></div>","PeriodicalId":55975,"journal":{"name":"Journal of Industrial Information Integration","volume":"48 ","pages":"Article 100944"},"PeriodicalIF":10.4000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Industrial Information Integration","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452414X25001670","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Audio-based fault diagnosis identifies machine operating conditions through acoustic signals, enabling targeted maintenance and reducing downtime in smart manufacturing and embodied intelligence. The traditional Leaky-Integrated-and-Fire (LIF) function in neural networks improves fault state classification by removing shared information while preserving category-unique features. However, its threshold, a backpropagation-optimized parameter that governs the information removal pattern, becomes a fixed constant after training. This constant threshold enforces a uniform information removal pattern across all audio samples despite the significant variations in time–frequency characteristics. Motivated by this and inspired by the auditory system’s adaptive modulation, in contrast to the traditional constant threshold, where the threshold remains a constant after training, this paper proposes a learnable Adaptive Threshold, allowing the threshold to dynamically adapt to the input audio even after training. As the threshold adapts to different inputs rather than remaining a fixed constant, more unique information can be preserved to enhance classification accuracy. The results demonstrate that the adaptive threshold outperforms the constant threshold and other state-of-the-art methods, achieving 99.75% on the IDMT Engine dataset and 98.11% on the MIMII Pump dataset. Visualization results confirm that while both the adaptive threshold and constant threshold successfully suppress non-unique background sounds, such as flowing water, the adaptive threshold demonstrates superior performance in preserving unique features, such as the impact sound from a broken pump. This capability contributes to more accurate fault diagnosis, further validating the effectiveness of the proposed method.
期刊介绍:
The Journal of Industrial Information Integration focuses on the industry's transition towards industrial integration and informatization, covering not only hardware and software but also information integration. It serves as a platform for promoting advances in industrial information integration, addressing challenges, issues, and solutions in an interdisciplinary forum for researchers, practitioners, and policy makers.
The Journal of Industrial Information Integration welcomes papers on foundational, technical, and practical aspects of industrial information integration, emphasizing the complex and cross-disciplinary topics that arise in industrial integration. Techniques from mathematical science, computer science, computer engineering, electrical and electronic engineering, manufacturing engineering, and engineering management are crucial in this context.