Vignesh Rao , Amir Eskandari , Farhana Zulkernine , Mohamed K. Helwa , David Beach
{"title":"CNN-CCA:地铁轨道传感器时间序列数据异常检测的深度学习方法","authors":"Vignesh Rao , Amir Eskandari , Farhana Zulkernine , Mohamed K. Helwa , David Beach","doi":"10.1016/j.mlwa.2025.100728","DOIUrl":null,"url":null,"abstract":"<div><div>The Internet of Things (IoT) offers new challenges in estimating correlations among data from multiple connected devices to understand their behaviors. Canonical Correlation Analysis (CCA) can be used to measure correlations among several observed variables. Different CCA methods have been proposed in the literature including probabilistic, sparse, kernel, discriminative, and deep learning-based CCAs. However, existing CCA approaches are limited by assumptions of linearity, reliance on predefined kernels, or difficulty in modeling localized patterns in high-frequency IoT sensor data. In this research, we explore two methods, linear CCA and non-linear deep learning-based CCA. Experiments demonstrate the effectiveness of CCA in detecting correlation in synthetic and metro rail time series sensor data collected from Autonomous Train (AT) signaling systems. Also, we propose a novel Convolutional Neural Network (CNN) based CCA method to detect correlation-based mappings and combine it with statistical anomaly detection methods in collective anomaly detection. The results indicate strong performance with an F1-score of 89.0% and a sensitivity of 94.1%, which can pave the way for the application of the proposed models to real-time collective anomaly detection and CCA in IoT systems.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"21 ","pages":"Article 100728"},"PeriodicalIF":4.9000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CNN-CCA: A deep learning approach for anomaly detection in metro rail sensor time-series data\",\"authors\":\"Vignesh Rao , Amir Eskandari , Farhana Zulkernine , Mohamed K. Helwa , David Beach\",\"doi\":\"10.1016/j.mlwa.2025.100728\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Internet of Things (IoT) offers new challenges in estimating correlations among data from multiple connected devices to understand their behaviors. Canonical Correlation Analysis (CCA) can be used to measure correlations among several observed variables. Different CCA methods have been proposed in the literature including probabilistic, sparse, kernel, discriminative, and deep learning-based CCAs. However, existing CCA approaches are limited by assumptions of linearity, reliance on predefined kernels, or difficulty in modeling localized patterns in high-frequency IoT sensor data. In this research, we explore two methods, linear CCA and non-linear deep learning-based CCA. Experiments demonstrate the effectiveness of CCA in detecting correlation in synthetic and metro rail time series sensor data collected from Autonomous Train (AT) signaling systems. Also, we propose a novel Convolutional Neural Network (CNN) based CCA method to detect correlation-based mappings and combine it with statistical anomaly detection methods in collective anomaly detection. The results indicate strong performance with an F1-score of 89.0% and a sensitivity of 94.1%, which can pave the way for the application of the proposed models to real-time collective anomaly detection and CCA in IoT systems.</div></div>\",\"PeriodicalId\":74093,\"journal\":{\"name\":\"Machine learning with applications\",\"volume\":\"21 \",\"pages\":\"Article 100728\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Machine learning with applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666827025001112\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025001112","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CNN-CCA: A deep learning approach for anomaly detection in metro rail sensor time-series data
The Internet of Things (IoT) offers new challenges in estimating correlations among data from multiple connected devices to understand their behaviors. Canonical Correlation Analysis (CCA) can be used to measure correlations among several observed variables. Different CCA methods have been proposed in the literature including probabilistic, sparse, kernel, discriminative, and deep learning-based CCAs. However, existing CCA approaches are limited by assumptions of linearity, reliance on predefined kernels, or difficulty in modeling localized patterns in high-frequency IoT sensor data. In this research, we explore two methods, linear CCA and non-linear deep learning-based CCA. Experiments demonstrate the effectiveness of CCA in detecting correlation in synthetic and metro rail time series sensor data collected from Autonomous Train (AT) signaling systems. Also, we propose a novel Convolutional Neural Network (CNN) based CCA method to detect correlation-based mappings and combine it with statistical anomaly detection methods in collective anomaly detection. The results indicate strong performance with an F1-score of 89.0% and a sensitivity of 94.1%, which can pave the way for the application of the proposed models to real-time collective anomaly detection and CCA in IoT systems.