基于知识的IT事件智能系统

Salman Ahmed, Muskaan Singh, Brendan Doherty, E. Ramlan, Kathryn Harkin, M. Bucholc, Damien Coyle
{"title":"基于知识的IT事件智能系统","authors":"Salman Ahmed, Muskaan Singh, Brendan Doherty, E. Ramlan, Kathryn Harkin, M. Bucholc, Damien Coyle","doi":"10.1109/AIOps59134.2023.00005","DOIUrl":null,"url":null,"abstract":"The automation of IT incident management (i.e., handling of any unusual events that hamper the quality of IT services) is a main focus in Artificial Intelligence for IT Operations (AIOPS). The success and reputation of large-scale firms depend on their customer service and helpdesk system. These systems tend to handle client requests and track customer service agent interactions. In this research, we present a complete knowledge-based system that automates two core components of IT incident service management (ITSM): (1) Ticket Assignment Group(TAG) and (2) Incident Resolution (IR). Our proposed system bypasses the 4 core steps of the traditional ITSM process, including data investigation, event correlation, situation room collaboration, and probable root cause. It provides immediate solutions that can save companies key performance indicator(KPIs) resources and reduce the mean time to resolution (MTTR). The experiment used an industrial, real-time ITSM dataset from a prominent IT organization comprising 500,000 real-time incident descriptions with encoded labels. Furthermore, our systems are then evaluated with an open-source dataset. Compared to the existing benchmark methodologies, there is a 5 % improvement in terms of Accuracy score. The study demonstrates AI automation capabilities in incident handling (TAG and IR) for large real- world IT systems.","PeriodicalId":427858,"journal":{"name":"2023 IEEE/ACM International Workshop on Cloud Intelligence & AIOps (AIOps)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge-based Intelligent System for IT Incident DevOps\",\"authors\":\"Salman Ahmed, Muskaan Singh, Brendan Doherty, E. Ramlan, Kathryn Harkin, M. Bucholc, Damien Coyle\",\"doi\":\"10.1109/AIOps59134.2023.00005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The automation of IT incident management (i.e., handling of any unusual events that hamper the quality of IT services) is a main focus in Artificial Intelligence for IT Operations (AIOPS). The success and reputation of large-scale firms depend on their customer service and helpdesk system. These systems tend to handle client requests and track customer service agent interactions. In this research, we present a complete knowledge-based system that automates two core components of IT incident service management (ITSM): (1) Ticket Assignment Group(TAG) and (2) Incident Resolution (IR). Our proposed system bypasses the 4 core steps of the traditional ITSM process, including data investigation, event correlation, situation room collaboration, and probable root cause. It provides immediate solutions that can save companies key performance indicator(KPIs) resources and reduce the mean time to resolution (MTTR). The experiment used an industrial, real-time ITSM dataset from a prominent IT organization comprising 500,000 real-time incident descriptions with encoded labels. Furthermore, our systems are then evaluated with an open-source dataset. Compared to the existing benchmark methodologies, there is a 5 % improvement in terms of Accuracy score. The study demonstrates AI automation capabilities in incident handling (TAG and IR) for large real- world IT systems.\",\"PeriodicalId\":427858,\"journal\":{\"name\":\"2023 IEEE/ACM International Workshop on Cloud Intelligence & AIOps (AIOps)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE/ACM International Workshop on Cloud Intelligence & AIOps (AIOps)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIOps59134.2023.00005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ACM International Workshop on Cloud Intelligence & AIOps (AIOps)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIOps59134.2023.00005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

IT事件管理的自动化(即,处理任何妨碍IT服务质量的异常事件)是IT运营人工智能(AIOPS)的主要关注点。大型公司的成功和声誉取决于他们的客户服务和帮助台系统。这些系统倾向于处理客户请求并跟踪客户服务代理的交互。在这项研究中,我们提出了一个完整的基于知识的系统,该系统自动化了IT事件服务管理(ITSM)的两个核心组件:(1)票据分配组(TAG)和(2)事件解决(IR)。我们提出的系统绕过了传统ITSM流程的4个核心步骤,包括数据调查、事件关联、态势室协作和可能的根本原因。它提供即时解决方案,可以节省公司的关键绩效指标(kpi)资源,并减少平均解决时间(MTTR)。实验使用了来自一个著名IT组织的工业实时ITSM数据集,其中包含500,000个带有编码标签的实时事件描述。此外,我们的系统然后用开源数据集进行评估。与现有的基准方法相比,准确度得分提高了5%。该研究展示了大型现实世界IT系统在事件处理(TAG和IR)方面的人工智能自动化能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Knowledge-based Intelligent System for IT Incident DevOps
The automation of IT incident management (i.e., handling of any unusual events that hamper the quality of IT services) is a main focus in Artificial Intelligence for IT Operations (AIOPS). The success and reputation of large-scale firms depend on their customer service and helpdesk system. These systems tend to handle client requests and track customer service agent interactions. In this research, we present a complete knowledge-based system that automates two core components of IT incident service management (ITSM): (1) Ticket Assignment Group(TAG) and (2) Incident Resolution (IR). Our proposed system bypasses the 4 core steps of the traditional ITSM process, including data investigation, event correlation, situation room collaboration, and probable root cause. It provides immediate solutions that can save companies key performance indicator(KPIs) resources and reduce the mean time to resolution (MTTR). The experiment used an industrial, real-time ITSM dataset from a prominent IT organization comprising 500,000 real-time incident descriptions with encoded labels. Furthermore, our systems are then evaluated with an open-source dataset. Compared to the existing benchmark methodologies, there is a 5 % improvement in terms of Accuracy score. The study demonstrates AI automation capabilities in incident handling (TAG and IR) for large real- world IT systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信