André Paulo Ferreira Machado, Celso Jose Munaro, Patrick Marques Ciarelli
{"title":"Enhancing one-Class classifiers performance in multivariate time series through dynamic clustering: A case study on hydraulic system fault detection","authors":"André Paulo Ferreira Machado, Celso Jose Munaro, Patrick Marques Ciarelli","doi":"10.1016/j.eswa.2025.128088","DOIUrl":null,"url":null,"abstract":"<div><div>Datasets composed of multivariate time series arising from real applications are usually affected by many factors such as noise and disturbances. Any modeling procedure benefits from having its training data carefully selected. This paper presents a methodology designed to enhance the performance of one-class classifiers in time series by incorporating dynamic time series clustering. The clustering process leverages DTW Barycenter Averaging (DBA) and k-means to group multivariate time series based on similarity. The Apriori algorithm is used to generate subsets of instances, which are then used to train multiple one-class classifiers for the same class. Three distinct strategies are applied to combine the outputs of these classifiers for each class. The proposed method is evaluated on a hydraulic system dataset to investigate typical faults that occur simultaneously and with varying intensities. The results show that improving the similarity of the training subsets and combining the outputs of the classifiers led to a performance improvement of more than 89 %. In addition, the methodology successfully reduced a hydraulic system dataset from 17 variables to as few as 3 or even 1, while still achieving better classification performance compared with recent findings in the literature.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"286 ","pages":"Article 128088"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425017099","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Datasets composed of multivariate time series arising from real applications are usually affected by many factors such as noise and disturbances. Any modeling procedure benefits from having its training data carefully selected. This paper presents a methodology designed to enhance the performance of one-class classifiers in time series by incorporating dynamic time series clustering. The clustering process leverages DTW Barycenter Averaging (DBA) and k-means to group multivariate time series based on similarity. The Apriori algorithm is used to generate subsets of instances, which are then used to train multiple one-class classifiers for the same class. Three distinct strategies are applied to combine the outputs of these classifiers for each class. The proposed method is evaluated on a hydraulic system dataset to investigate typical faults that occur simultaneously and with varying intensities. The results show that improving the similarity of the training subsets and combining the outputs of the classifiers led to a performance improvement of more than 89 %. In addition, the methodology successfully reduced a hydraulic system dataset from 17 variables to as few as 3 or even 1, while still achieving better classification performance compared with recent findings in the literature.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.