{"title":"利用机器学习对小型数据集进行时间序列故障预测","authors":"Caio Souto Maior;Thaylon Silva","doi":"10.1109/TLA.2024.10500720","DOIUrl":null,"url":null,"abstract":"Condition-based maintenance is a decision-making strategy using condition monitoring information to optimize the availability of operational plants. In this context, machine learning techniques are useful and have been used in predicting the remaining useful life (RUL) of equipment to ensure the overall safety and reliability of the system through maintenance policies and, consequently, reducing costs arising from the failure. These databases are not large which is tricky for data-driven models. In this study, we consider five different databases containing the failure times from distinct real-world equipment. Here, four different regression algorithms were compared for RUL prediction, namely: Support Vector Regression (SVR), Decision Tree (DT), Multilayer Perceptron (MLP) and K-Nearest Neighbors (KNN). Furthermore, aiming to improve the data quality, the Empirical Mode Decomposition (EMD) was used, which is responsible for pre-processing the input data used on the predictive modeling. We optimize the models hyperparameters using grid-search cross-validation algorithm and the performance of each model is compared using the normalized root mean squared error (NRMSE). Considering the datasets analyzed, KNN model proves to be the most promising to perform the prognostic task in small datasets, adapting itself to the distinct characteristics of the different databases. In addition, we mention the better performance after optimizing the hyperparameters, which avoided overfitting problems and had a low computational cost for the problems analyzed here.","PeriodicalId":55024,"journal":{"name":"IEEE Latin America Transactions","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10500720","citationCount":"0","resultStr":"{\"title\":\"Time-series failure prediction on small datasets using machine learning\",\"authors\":\"Caio Souto Maior;Thaylon Silva\",\"doi\":\"10.1109/TLA.2024.10500720\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Condition-based maintenance is a decision-making strategy using condition monitoring information to optimize the availability of operational plants. In this context, machine learning techniques are useful and have been used in predicting the remaining useful life (RUL) of equipment to ensure the overall safety and reliability of the system through maintenance policies and, consequently, reducing costs arising from the failure. These databases are not large which is tricky for data-driven models. In this study, we consider five different databases containing the failure times from distinct real-world equipment. Here, four different regression algorithms were compared for RUL prediction, namely: Support Vector Regression (SVR), Decision Tree (DT), Multilayer Perceptron (MLP) and K-Nearest Neighbors (KNN). Furthermore, aiming to improve the data quality, the Empirical Mode Decomposition (EMD) was used, which is responsible for pre-processing the input data used on the predictive modeling. We optimize the models hyperparameters using grid-search cross-validation algorithm and the performance of each model is compared using the normalized root mean squared error (NRMSE). Considering the datasets analyzed, KNN model proves to be the most promising to perform the prognostic task in small datasets, adapting itself to the distinct characteristics of the different databases. In addition, we mention the better performance after optimizing the hyperparameters, which avoided overfitting problems and had a low computational cost for the problems analyzed here.\",\"PeriodicalId\":55024,\"journal\":{\"name\":\"IEEE Latin America Transactions\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10500720\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Latin America Transactions\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10500720/\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Latin America Transactions","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10500720/","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Time-series failure prediction on small datasets using machine learning
Condition-based maintenance is a decision-making strategy using condition monitoring information to optimize the availability of operational plants. In this context, machine learning techniques are useful and have been used in predicting the remaining useful life (RUL) of equipment to ensure the overall safety and reliability of the system through maintenance policies and, consequently, reducing costs arising from the failure. These databases are not large which is tricky for data-driven models. In this study, we consider five different databases containing the failure times from distinct real-world equipment. Here, four different regression algorithms were compared for RUL prediction, namely: Support Vector Regression (SVR), Decision Tree (DT), Multilayer Perceptron (MLP) and K-Nearest Neighbors (KNN). Furthermore, aiming to improve the data quality, the Empirical Mode Decomposition (EMD) was used, which is responsible for pre-processing the input data used on the predictive modeling. We optimize the models hyperparameters using grid-search cross-validation algorithm and the performance of each model is compared using the normalized root mean squared error (NRMSE). Considering the datasets analyzed, KNN model proves to be the most promising to perform the prognostic task in small datasets, adapting itself to the distinct characteristics of the different databases. In addition, we mention the better performance after optimizing the hyperparameters, which avoided overfitting problems and had a low computational cost for the problems analyzed here.
期刊介绍:
IEEE Latin America Transactions (IEEE LATAM) is an interdisciplinary journal focused on the dissemination of original and quality research papers / review articles in Spanish and Portuguese of emerging topics in three main areas: Computing, Electric Energy and Electronics. Some of the sub-areas of the journal are, but not limited to: Automatic control, communications, instrumentation, artificial intelligence, power and industrial electronics, fault diagnosis and detection, transportation electrification, internet of things, electrical machines, circuits and systems, biomedicine and biomedical / haptic applications, secure communications, robotics, sensors and actuators, computer networks, smart grids, among others.