{"title":"A two-step machine learning approach for predictive maintenance and anomaly detection in environmental sensor systems","authors":"Saiprasad Potharaju , Ravi Kumar Tirandasu , Swapnali N. Tambe , Devyani Bhamare Jadhav , Dudla Anil Kumar , Shanmuk Srinivas Amiripalli","doi":"10.1016/j.mex.2025.103181","DOIUrl":null,"url":null,"abstract":"<div><div>Environmental sensor systems are essential for monitoring infrastructure and environmental quality but are prone to unreliability caused by sensor faults and environmental anomalies. Using Environmental Sensor Telemetry Data, this study introduces a novel methodology that combines unsupervised and supervised machine learning approaches to detect anomalies and predict sensor failures. The dataset consisted of sensor readings such as temperature, humidity, CO, LPG, and smoke, with no class labels available. This research is novel in seamlessly blending unsupervised anomaly detection using Isolation Forest to create labels for data points that were previously unlabeled. Finally, these generated labels were used to train the supervised learning models such as Random Forest, Neural Network (MLP Classifier), and AdaBoost to predict anomalies in new sensor data as soon as it gets recorded. The models confirmed the proposed framework's accuracy, whereas Random Forest 99.93 %, Neural Network 99.05 %, and AdaBoost 98.04 % validated the effectiveness of the suggested framework. Such an approach addresses a critical gap, transforming raw, unlabeled IoT sensor data into actionable insights for predictive maintenance. This methodology provides a scalable and robust real-time anomaly detection and sensor fault prediction methodology that greatly enhances the reliability of the environmental monitoring systems and advances the intelligent infrastructure management.<ul><li><span>•</span><span><div>Combines Isolation Forest for anomaly labeling and supervised models for anomaly prediction.</div></span></li><li><span>•</span><span><div>Scalable and adaptable for diverse IoT applications for environmental monitoring.</div></span></li><li><span>•</span><span><div>Provides actionable insights through anomaly visualization, revealing patterns in sensor performance.</div></span></li></ul></div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"14 ","pages":"Article 103181"},"PeriodicalIF":1.6000,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125000299","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Environmental sensor systems are essential for monitoring infrastructure and environmental quality but are prone to unreliability caused by sensor faults and environmental anomalies. Using Environmental Sensor Telemetry Data, this study introduces a novel methodology that combines unsupervised and supervised machine learning approaches to detect anomalies and predict sensor failures. The dataset consisted of sensor readings such as temperature, humidity, CO, LPG, and smoke, with no class labels available. This research is novel in seamlessly blending unsupervised anomaly detection using Isolation Forest to create labels for data points that were previously unlabeled. Finally, these generated labels were used to train the supervised learning models such as Random Forest, Neural Network (MLP Classifier), and AdaBoost to predict anomalies in new sensor data as soon as it gets recorded. The models confirmed the proposed framework's accuracy, whereas Random Forest 99.93 %, Neural Network 99.05 %, and AdaBoost 98.04 % validated the effectiveness of the suggested framework. Such an approach addresses a critical gap, transforming raw, unlabeled IoT sensor data into actionable insights for predictive maintenance. This methodology provides a scalable and robust real-time anomaly detection and sensor fault prediction methodology that greatly enhances the reliability of the environmental monitoring systems and advances the intelligent infrastructure management.
•
Combines Isolation Forest for anomaly labeling and supervised models for anomaly prediction.
•
Scalable and adaptable for diverse IoT applications for environmental monitoring.
•
Provides actionable insights through anomaly visualization, revealing patterns in sensor performance.