用于自动运行状况检查、早期问题检测和高级问题确定的分析引擎

Yogendra K. Srivastava, A. Abrashkevich
{"title":"用于自动运行状况检查、早期问题检测和高级问题确定的分析引擎","authors":"Yogendra K. Srivastava, A. Abrashkevich","doi":"10.1109/SRII.2012.70","DOIUrl":null,"url":null,"abstract":"With current trends in software industry toward increased complexity of modern software, tight integration of multiple software products, emphasis on software reliability and high-level availability, software support and maintenance costs increase dramatically. It is imperative for businesses to be able to monitor health of their systems making sure that they are performing at top levels, quickly respond to any problems and timely fix them and also be able to perform advanced problem determination to reduce total time for outages that already occurred. Equally important is to prevent problems from occurring based on best practices and knowledge of known problems/issues for specific software products. To achieve these goals, a powerful analysis engine capable of performing comprehensive health checks of customer systems and advanced problem determination based on analysis of customers' data is proposed. It can be used for both proactive and reactive customer support. Such an engine works as a virtual consultant for the end users. It detects potential problems related to customer systems and installed products and provides notifications or alerts proactively, i.e. could be considered as an early detection system. It is also capable of analyzing FFDC (First Failure Data Capture) data after a problem has occurred, comparing the data with well known problems and related symptoms from relevant knowledge databases and providing customers with the results of analysis, found matches of previously recorded problems and recommendations on how to fix the problem at hand. The engine proposed utilizes up to date analytics from subject matter experts and best practices encoded in it. In the present work, a system architecture and design of such an analysis engine is presented. The proposed engine has a low bar of adoption, flexible extensible design and could be easily adopted for any software product. It is able to analyze encoded human knowledge, compare collected customer data with available historical data and report problems and issues found along with the relevant recommendations and suggested fixes. More specifically, the engine provides a comprehensive analysis in terms of health checks, best practices compliance check, prerequisites check, end-of-service product check, operating environment and configuration setup check, outage prevention, state comparison, problem determination and others. A case study based on the proposed engine design is presented and discussed in more detail.","PeriodicalId":110778,"journal":{"name":"2012 Annual SRII Global Conference","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis Engine for Automated Health Checks, Early Problem Detection and Advanced Problem Determination\",\"authors\":\"Yogendra K. Srivastava, A. Abrashkevich\",\"doi\":\"10.1109/SRII.2012.70\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With current trends in software industry toward increased complexity of modern software, tight integration of multiple software products, emphasis on software reliability and high-level availability, software support and maintenance costs increase dramatically. It is imperative for businesses to be able to monitor health of their systems making sure that they are performing at top levels, quickly respond to any problems and timely fix them and also be able to perform advanced problem determination to reduce total time for outages that already occurred. Equally important is to prevent problems from occurring based on best practices and knowledge of known problems/issues for specific software products. To achieve these goals, a powerful analysis engine capable of performing comprehensive health checks of customer systems and advanced problem determination based on analysis of customers' data is proposed. It can be used for both proactive and reactive customer support. Such an engine works as a virtual consultant for the end users. It detects potential problems related to customer systems and installed products and provides notifications or alerts proactively, i.e. could be considered as an early detection system. It is also capable of analyzing FFDC (First Failure Data Capture) data after a problem has occurred, comparing the data with well known problems and related symptoms from relevant knowledge databases and providing customers with the results of analysis, found matches of previously recorded problems and recommendations on how to fix the problem at hand. The engine proposed utilizes up to date analytics from subject matter experts and best practices encoded in it. In the present work, a system architecture and design of such an analysis engine is presented. The proposed engine has a low bar of adoption, flexible extensible design and could be easily adopted for any software product. It is able to analyze encoded human knowledge, compare collected customer data with available historical data and report problems and issues found along with the relevant recommendations and suggested fixes. More specifically, the engine provides a comprehensive analysis in terms of health checks, best practices compliance check, prerequisites check, end-of-service product check, operating environment and configuration setup check, outage prevention, state comparison, problem determination and others. A case study based on the proposed engine design is presented and discussed in more detail.\",\"PeriodicalId\":110778,\"journal\":{\"name\":\"2012 Annual SRII Global Conference\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Annual SRII Global Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SRII.2012.70\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Annual SRII Global Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SRII.2012.70","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着软件行业当前的趋势,现代软件的复杂性增加,多个软件产品的紧密集成,对软件可靠性和高级可用性的强调,软件支持和维护成本急剧增加。企业必须能够监控其系统的运行状况,确保它们在最高级别上运行,快速响应任何问题并及时修复它们,还必须能够执行高级问题确定,以减少已经发生的中断的总时间。同样重要的是,根据最佳实践和对特定软件产品的已知问题/问题的了解,防止问题发生。为了实现这些目标,提出了一个功能强大的分析引擎,能够对客户系统进行全面的健康检查,并基于对客户数据的分析进行高级问题确定。它既可以用于主动客户支持,也可以用于被动客户支持。这样的引擎可以作为最终用户的虚拟顾问。它检测与客户系统和已安装产品相关的潜在问题,并主动提供通知或警报,即可以视为早期检测系统。它还能够在问题发生后分析FFDC(首次故障数据捕获)数据,将数据与相关知识库中的已知问题和相关症状进行比较,并向客户提供分析结果,找到先前记录的问题的匹配项,并就如何解决当前问题提出建议。建议的引擎利用主题专家的最新分析和编码在其中的最佳实践。本文给出了该分析引擎的系统架构和设计。所提出的引擎具有较低的采用门槛,灵活的可扩展设计,并且可以很容易地用于任何软件产品。它能够分析编码的人类知识,将收集到的客户数据与可用的历史数据进行比较,并报告发现的问题以及相关建议和建议的修复方法。更具体地说,该引擎在运行状况检查、最佳实践遵从性检查、先决条件检查、服务终止产品检查、操作环境和配置设置检查、停机预防、状态比较、问题确定等方面提供了全面的分析。最后给出了基于所提出的发动机设计的一个实例,并进行了详细的讨论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis Engine for Automated Health Checks, Early Problem Detection and Advanced Problem Determination
With current trends in software industry toward increased complexity of modern software, tight integration of multiple software products, emphasis on software reliability and high-level availability, software support and maintenance costs increase dramatically. It is imperative for businesses to be able to monitor health of their systems making sure that they are performing at top levels, quickly respond to any problems and timely fix them and also be able to perform advanced problem determination to reduce total time for outages that already occurred. Equally important is to prevent problems from occurring based on best practices and knowledge of known problems/issues for specific software products. To achieve these goals, a powerful analysis engine capable of performing comprehensive health checks of customer systems and advanced problem determination based on analysis of customers' data is proposed. It can be used for both proactive and reactive customer support. Such an engine works as a virtual consultant for the end users. It detects potential problems related to customer systems and installed products and provides notifications or alerts proactively, i.e. could be considered as an early detection system. It is also capable of analyzing FFDC (First Failure Data Capture) data after a problem has occurred, comparing the data with well known problems and related symptoms from relevant knowledge databases and providing customers with the results of analysis, found matches of previously recorded problems and recommendations on how to fix the problem at hand. The engine proposed utilizes up to date analytics from subject matter experts and best practices encoded in it. In the present work, a system architecture and design of such an analysis engine is presented. The proposed engine has a low bar of adoption, flexible extensible design and could be easily adopted for any software product. It is able to analyze encoded human knowledge, compare collected customer data with available historical data and report problems and issues found along with the relevant recommendations and suggested fixes. More specifically, the engine provides a comprehensive analysis in terms of health checks, best practices compliance check, prerequisites check, end-of-service product check, operating environment and configuration setup check, outage prevention, state comparison, problem determination and others. A case study based on the proposed engine design is presented and discussed in more detail.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信