Wen Liu , Degang Sun , Haitian Yang , Yan Wang , Weiqing Huang
{"title":"Heterogeneous data-driven resolution generation for software systems via large language models","authors":"Wen Liu , Degang Sun , Haitian Yang , Yan Wang , Weiqing Huang","doi":"10.1016/j.ipm.2025.104376","DOIUrl":null,"url":null,"abstract":"<div><div>Modern software systems are increasingly complex and dynamic, making them particularly vulnerable to performance anomalies. Although runtime anomaly detection enhances system reliability, engineers still devote considerable time and effort to resolving errors once anomalous logs or metrics are detected. Such challenges call for intelligent automation capable of delivering targeted remediation steps based on detected anomalies. In this work, we first construct an anomaly-related knowledge base by combining heterogeneous operational data, including logs and metrics, with resolutions annotated by domain experts. Furthermore, we propose HASolver, the first Heterogeneous Anomaly Solver to generate recommended resolutions for multi-source system anomalies. The core component is a dual-view multi-vector module, designed to represent heterogeneous anomaly chunks from different modalities and to support effective multi-vector retrieval. HASolver integrates a large language model with domain knowledge to generate mitigation resolutions. We conduct extensive experiments using BLEU and ROUGE-1/2/L metrics. Compared to baseline approaches, HASolver delivers notable performance gains, improving BLEU and ROUGE-L scores by 14.6% and 19.6%, respectively. Further analyses are carried out to explore various multi-vector configurations and the effect of prompt strategies. We also release the annotated resolution dataset derived from the anomaly-related knowledge base to facilitate future research.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 2","pages":"Article 104376"},"PeriodicalIF":6.9000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325003176","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Modern software systems are increasingly complex and dynamic, making them particularly vulnerable to performance anomalies. Although runtime anomaly detection enhances system reliability, engineers still devote considerable time and effort to resolving errors once anomalous logs or metrics are detected. Such challenges call for intelligent automation capable of delivering targeted remediation steps based on detected anomalies. In this work, we first construct an anomaly-related knowledge base by combining heterogeneous operational data, including logs and metrics, with resolutions annotated by domain experts. Furthermore, we propose HASolver, the first Heterogeneous Anomaly Solver to generate recommended resolutions for multi-source system anomalies. The core component is a dual-view multi-vector module, designed to represent heterogeneous anomaly chunks from different modalities and to support effective multi-vector retrieval. HASolver integrates a large language model with domain knowledge to generate mitigation resolutions. We conduct extensive experiments using BLEU and ROUGE-1/2/L metrics. Compared to baseline approaches, HASolver delivers notable performance gains, improving BLEU and ROUGE-L scores by 14.6% and 19.6%, respectively. Further analyses are carried out to explore various multi-vector configurations and the effect of prompt strategies. We also release the annotated resolution dataset derived from the anomaly-related knowledge base to facilitate future research.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.