ScaloAdaptAlert, a novel framework for supervised anomaly detection in industrial acoustic data, integrating power scalograms, adaptive filter banks, and convolutional neural networks — A case study
M.A. Zakeri Harandi , Tzu-Yuan Lin , Chen Li , Sigurd L. Villumsen , Maani Ghaffari , Ole Madsen
{"title":"ScaloAdaptAlert, a novel framework for supervised anomaly detection in industrial acoustic data, integrating power scalograms, adaptive filter banks, and convolutional neural networks — A case study","authors":"M.A. Zakeri Harandi , Tzu-Yuan Lin , Chen Li , Sigurd L. Villumsen , Maani Ghaffari , Ole Madsen","doi":"10.1016/j.jmsy.2025.01.007","DOIUrl":null,"url":null,"abstract":"<div><div>Acoustic data, as a modality for building data-driven industrial monitoring systems, is particularly notable for its comprehensive insights into both operational and machinery states of a process. However, the effectiveness of existing time–frequency representation (TFR)-based frameworks remains limited in industrial contexts. Originally designed for analyzing human speech and music signals, these frameworks often struggle with the complex, non-stationary, and non-harmonic nature of manufacturing sound data. Addressing these challenges, this paper introduces ‘ScaloAdaptAlert’ (SAdAlert), a novel, domain-agnostic framework for deriving time–frequency representations from industrial acoustic data. SAdAlert employs wavelet transform to capture both local and global spectral characteristics, uses Gaussian filter banks in an adaptive fashion to identify spectral features at both low and high frequencies, and applies max-pooling to reduce temporal dimensionality. The presented framework effectively preserves dominant information of the acoustic data while isolating its relevant features in noisy settings and addressing class imbalance. Our method, validated on a real-world anomaly detection dataset from a robotic screwing process, demonstrates superior performance compared to state-of-the-art deep learning models and conventional TFR methods. This validation underscores SAdAlert’s potential to advance industrial acoustic monitoring by providing a robust, efficient, and highly adaptable tool for analyzing complex industrial acoustic data.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"79 ","pages":"Pages 234-254"},"PeriodicalIF":12.2000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525000159","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Acoustic data, as a modality for building data-driven industrial monitoring systems, is particularly notable for its comprehensive insights into both operational and machinery states of a process. However, the effectiveness of existing time–frequency representation (TFR)-based frameworks remains limited in industrial contexts. Originally designed for analyzing human speech and music signals, these frameworks often struggle with the complex, non-stationary, and non-harmonic nature of manufacturing sound data. Addressing these challenges, this paper introduces ‘ScaloAdaptAlert’ (SAdAlert), a novel, domain-agnostic framework for deriving time–frequency representations from industrial acoustic data. SAdAlert employs wavelet transform to capture both local and global spectral characteristics, uses Gaussian filter banks in an adaptive fashion to identify spectral features at both low and high frequencies, and applies max-pooling to reduce temporal dimensionality. The presented framework effectively preserves dominant information of the acoustic data while isolating its relevant features in noisy settings and addressing class imbalance. Our method, validated on a real-world anomaly detection dataset from a robotic screwing process, demonstrates superior performance compared to state-of-the-art deep learning models and conventional TFR methods. This validation underscores SAdAlert’s potential to advance industrial acoustic monitoring by providing a robust, efficient, and highly adaptable tool for analyzing complex industrial acoustic data.
期刊介绍:
The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs.
With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.