Xiangxiang Zhang, Wenkai Hu, Ahmad W. Al-Dabbagh, Weihua Cao
{"title":"基于词嵌入和移动分割合并距离的工业报警洪水相似度分析","authors":"Xiangxiang Zhang, Wenkai Hu, Ahmad W. Al-Dabbagh, Weihua Cao","doi":"10.1109/ICPS58381.2023.10128020","DOIUrl":null,"url":null,"abstract":"In industrial facilities, alarm systems are essential for process monitoring. However, due to the simplicity of alarm configuration and the poor performance of the alarm system, alarm floods often happen. Similarity analysis of alarm floods compares alarm flood sequences to look for common patterns. These patterns can offer information that is helpful in identifying root causes of alarm floods. Existing methods for alarm flood similarity analysis can only conduct similarity measures based on match operations for alarms represented by text strings; as a result, the match operations ignore the correlations between alarm occurrences. In this work, a new alarm flood similarity analysis method based on word embedding and Move-Split-Merge (MSM) distance is proposed, in order to reveal sequence similarity from a new perspective. The contributions are mainly twofold: 1) The correlations of alarm occurrences are considered in the alarm encoding via word embedding; 2) The MSM distance is calculated to analyze the similarity of alarm flood sequences of different lengths. Then, clustering results are compared with the original true labels of the alarm floods. The effectiveness of the proposed method is demonstrated by a case study with alarm data generated by a public industrial model of the Vinyl Acetate Monomer process.","PeriodicalId":426122,"journal":{"name":"2023 IEEE 6th International Conference on Industrial Cyber-Physical Systems (ICPS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Similarity Analysis of Industrial Alarm Floods Based on Word Embedding and Move-Split-Merge Distance\",\"authors\":\"Xiangxiang Zhang, Wenkai Hu, Ahmad W. Al-Dabbagh, Weihua Cao\",\"doi\":\"10.1109/ICPS58381.2023.10128020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In industrial facilities, alarm systems are essential for process monitoring. However, due to the simplicity of alarm configuration and the poor performance of the alarm system, alarm floods often happen. Similarity analysis of alarm floods compares alarm flood sequences to look for common patterns. These patterns can offer information that is helpful in identifying root causes of alarm floods. Existing methods for alarm flood similarity analysis can only conduct similarity measures based on match operations for alarms represented by text strings; as a result, the match operations ignore the correlations between alarm occurrences. In this work, a new alarm flood similarity analysis method based on word embedding and Move-Split-Merge (MSM) distance is proposed, in order to reveal sequence similarity from a new perspective. The contributions are mainly twofold: 1) The correlations of alarm occurrences are considered in the alarm encoding via word embedding; 2) The MSM distance is calculated to analyze the similarity of alarm flood sequences of different lengths. Then, clustering results are compared with the original true labels of the alarm floods. The effectiveness of the proposed method is demonstrated by a case study with alarm data generated by a public industrial model of the Vinyl Acetate Monomer process.\",\"PeriodicalId\":426122,\"journal\":{\"name\":\"2023 IEEE 6th International Conference on Industrial Cyber-Physical Systems (ICPS)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 6th International Conference on Industrial Cyber-Physical Systems (ICPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPS58381.2023.10128020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 6th International Conference on Industrial Cyber-Physical Systems (ICPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPS58381.2023.10128020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Similarity Analysis of Industrial Alarm Floods Based on Word Embedding and Move-Split-Merge Distance
In industrial facilities, alarm systems are essential for process monitoring. However, due to the simplicity of alarm configuration and the poor performance of the alarm system, alarm floods often happen. Similarity analysis of alarm floods compares alarm flood sequences to look for common patterns. These patterns can offer information that is helpful in identifying root causes of alarm floods. Existing methods for alarm flood similarity analysis can only conduct similarity measures based on match operations for alarms represented by text strings; as a result, the match operations ignore the correlations between alarm occurrences. In this work, a new alarm flood similarity analysis method based on word embedding and Move-Split-Merge (MSM) distance is proposed, in order to reveal sequence similarity from a new perspective. The contributions are mainly twofold: 1) The correlations of alarm occurrences are considered in the alarm encoding via word embedding; 2) The MSM distance is calculated to analyze the similarity of alarm flood sequences of different lengths. Then, clustering results are compared with the original true labels of the alarm floods. The effectiveness of the proposed method is demonstrated by a case study with alarm data generated by a public industrial model of the Vinyl Acetate Monomer process.