Unsupervised Multi-Target Cross-Service Log Anomaly Detection

IF 3.9 3区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Sustainable Computing Pub Date : 2025-06-10 DOI:10.1109/TSUSC.2025.3578517

Shiming He;Rui Liu;Bowen Chen;Kun Xie;Jigang Wen

{"title":"Unsupervised Multi-Target Cross-Service Log Anomaly Detection","authors":"Shiming He;Rui Liu;Bowen Chen;Kun Xie;Jigang Wen","doi":"10.1109/TSUSC.2025.3578517","DOIUrl":null,"url":null,"abstract":"Log analysis, especially log anomaly detection, can help debug systems and analyze root causes to provide reliable services. Deep learning is a promising technology for log anomaly detection. However, deep learning methods need a large amount of training data, which is hard for a newly deployed system to collect sufficient logs. Transfer learning becomes a possible method to solve the problem that can apply the knowledge from a long-term deployed system (source) to a newly deployed system (target). Existing transfer learning methods focus on transferring the knowledge from a source system to a single target system within the same service, in which the source and the target belong to the same service (e.g. operating system, supercomputer, or distributed system). They achieve low performance when applied to multiple target and different services systems because of the obvious differences in log format, syntax, semantics, and component call between different services and the individual training of multiple models for each target system. To tackle the problems, we propose an unsupervised multi-target cross-service log anomaly detection method based on transfer learning and contrastive learning (LogMTC). LogMTC exploits contrastive learning to learn a single model on combined data from the source and multiple target systems, which can fit multiple target systems simultaneously and improve efficiency. LogMTC exploits a hypersphere loss and two contrastive losses to minimize the feature differences crossing different services. Our experiments on two services (supercomputer and distributed system) and three log datasets show that our method is superior to the existing transfer learning methods in the same service, cross-service, and multi-target log anomaly detection. Compared with the best peer accurate transfer learning algorithm LogTAD, LogMTC improves 1.14%–8.28<inline-formula><tex-math>$\\%$</tex-math></inline-formula> F1 score in multi-target transfer and is 1.12–1.22 times faster.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"10 5","pages":"1056-1069"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Sustainable Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11029601/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Log analysis, especially log anomaly detection, can help debug systems and analyze root causes to provide reliable services. Deep learning is a promising technology for log anomaly detection. However, deep learning methods need a large amount of training data, which is hard for a newly deployed system to collect sufficient logs. Transfer learning becomes a possible method to solve the problem that can apply the knowledge from a long-term deployed system (source) to a newly deployed system (target). Existing transfer learning methods focus on transferring the knowledge from a source system to a single target system within the same service, in which the source and the target belong to the same service (e.g. operating system, supercomputer, or distributed system). They achieve low performance when applied to multiple target and different services systems because of the obvious differences in log format, syntax, semantics, and component call between different services and the individual training of multiple models for each target system. To tackle the problems, we propose an unsupervised multi-target cross-service log anomaly detection method based on transfer learning and contrastive learning (LogMTC). LogMTC exploits contrastive learning to learn a single model on combined data from the source and multiple target systems, which can fit multiple target systems simultaneously and improve efficiency. LogMTC exploits a hypersphere loss and two contrastive losses to minimize the feature differences crossing different services. Our experiments on two services (supercomputer and distributed system) and three log datasets show that our method is superior to the existing transfer learning methods in the same service, cross-service, and multi-target log anomaly detection. Compared with the best peer accurate transfer learning algorithm LogTAD, LogMTC improves 1.14%–8.28

$\%$

F1 score in multi-target transfer and is 1.12–1.22 times faster.

查看原文本刊更多论文

无监督多目标跨服务日志异常检测

日志分析，特别是日志异常检测，可以帮助系统调试，分析原因，提供可靠的服务。深度学习是一种很有前途的测井异常检测技术。然而，深度学习方法需要大量的训练数据，这对于新部署的系统来说很难收集到足够的日志。将长期部署的系统（源）中的知识应用到新部署的系统（目标）中，迁移学习成为解决问题的一种可能方法。现有的迁移学习方法侧重于将知识从源系统转移到同一服务内的单个目标系统，其中源系统和目标系统属于同一服务（例如操作系统、超级计算机或分布式系统）。由于不同服务之间的日志格式、语法、语义和组件调用存在明显差异，并且每个目标系统都需要对多个模型进行单独训练，因此它们在应用于多个目标和不同服务系统时性能较低。为了解决这些问题，提出了一种基于迁移学习和对比学习（LogMTC）的无监督多目标跨服务日志异常检测方法。LogMTC利用对比学习的方法，在源数据和多个目标系统的组合数据上学习单个模型，可以同时拟合多个目标系统，提高效率。LogMTC利用一个超球损耗和两个对比损耗来最小化跨不同服务的特性差异。在两种服务（超级计算机和分布式系统）和三种日志数据集上进行的实验表明，该方法在同一服务、跨服务和多目标日志异常检测方面优于现有的迁移学习方法。与最佳的同伴精确迁移学习算法LogTAD相比，LogMTC在多目标迁移中的F1分数提高了1.14% ~ 8.28%，速度提高了1.12 ~ 1.22倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Sustainable Computing Mathematics-Control and Optimization

CiteScore

7.70

自引率

2.60%

发文量