Kaleeswari Chinnakkaruppan, K. Krishnamoorthy, Senthilrajan Agniraj
{"title":"基于传感器的水质分布数据技术异常预测的混合方法","authors":"Kaleeswari Chinnakkaruppan, K. Krishnamoorthy, Senthilrajan Agniraj","doi":"10.1109/PIECON56912.2023.10085799","DOIUrl":null,"url":null,"abstract":"The detection of anomalous data from sensor data is the intractable problem in today’s hydrological data monitoring. Hardware malfunctions, power issues, battery life, and efficiency are challenges encountered while employing sensor devices to collect data. In such situations, data that is inconsistent may be recorded. By applying these types of dataset, inaccurate results may be produced when performing classification or other data analytic methods. In order to discover anomalous data from a huge dataset, this paper suggests a hybrid mechanism. Three unsupervised machine learning techniques are used to construct this mechanism. First, this study reduces superfluous data by using Principal Component Analysis (PCA). Isolation Forest (IF) is then used to find outlier scores. Finally, K-means clustering is used to distinguish between abnormal (anomalies) and regular data using a visual representation of cluster assignments. The cluster assessment indices’ criteria were used to evaluate this hybrid approach. According to the findings, this hybrid technique would be suitable for identifying anomalous data inside each data index of the dataset, depending on the target value.","PeriodicalId":182428,"journal":{"name":"2023 International Conference on Power, Instrumentation, Energy and Control (PIECON)","volume":"21 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Approach for Forecasting the Technical Anomalies in Sensor-based Water Quality Distribution Data\",\"authors\":\"Kaleeswari Chinnakkaruppan, K. Krishnamoorthy, Senthilrajan Agniraj\",\"doi\":\"10.1109/PIECON56912.2023.10085799\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The detection of anomalous data from sensor data is the intractable problem in today’s hydrological data monitoring. Hardware malfunctions, power issues, battery life, and efficiency are challenges encountered while employing sensor devices to collect data. In such situations, data that is inconsistent may be recorded. By applying these types of dataset, inaccurate results may be produced when performing classification or other data analytic methods. In order to discover anomalous data from a huge dataset, this paper suggests a hybrid mechanism. Three unsupervised machine learning techniques are used to construct this mechanism. First, this study reduces superfluous data by using Principal Component Analysis (PCA). Isolation Forest (IF) is then used to find outlier scores. Finally, K-means clustering is used to distinguish between abnormal (anomalies) and regular data using a visual representation of cluster assignments. The cluster assessment indices’ criteria were used to evaluate this hybrid approach. According to the findings, this hybrid technique would be suitable for identifying anomalous data inside each data index of the dataset, depending on the target value.\",\"PeriodicalId\":182428,\"journal\":{\"name\":\"2023 International Conference on Power, Instrumentation, Energy and Control (PIECON)\",\"volume\":\"21 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Power, Instrumentation, Energy and Control (PIECON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PIECON56912.2023.10085799\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Power, Instrumentation, Energy and Control (PIECON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIECON56912.2023.10085799","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Hybrid Approach for Forecasting the Technical Anomalies in Sensor-based Water Quality Distribution Data
The detection of anomalous data from sensor data is the intractable problem in today’s hydrological data monitoring. Hardware malfunctions, power issues, battery life, and efficiency are challenges encountered while employing sensor devices to collect data. In such situations, data that is inconsistent may be recorded. By applying these types of dataset, inaccurate results may be produced when performing classification or other data analytic methods. In order to discover anomalous data from a huge dataset, this paper suggests a hybrid mechanism. Three unsupervised machine learning techniques are used to construct this mechanism. First, this study reduces superfluous data by using Principal Component Analysis (PCA). Isolation Forest (IF) is then used to find outlier scores. Finally, K-means clustering is used to distinguish between abnormal (anomalies) and regular data using a visual representation of cluster assignments. The cluster assessment indices’ criteria were used to evaluate this hybrid approach. According to the findings, this hybrid technique would be suitable for identifying anomalous data inside each data index of the dataset, depending on the target value.