The Case for Software Health Management

A. Srivastava, J. Schumann
{"title":"The Case for Software Health Management","authors":"A. Srivastava, J. Schumann","doi":"10.1109/SMC-IT.2011.14","DOIUrl":null,"url":null,"abstract":"Software Health Management (SWHM) is a new field that is concerned with the development of tools and technologies to enable automated detection, diagnosis, prediction, and mitigation of adverse events due to software anomalies. Significant effort has been expended in the last several decades in the development of verification and validation (\\VV) methods for software intensive systems, but it is becoming increasingly more apparent that this is not enough to guarantee that a complex software system meets all safety and reliability requirements. %Moreover, error handling techniques are usually concerned with the detection %and isolation of these faults when they occur. Modern software systems can exhibit a variety of failure modes which can go undetected in a verification and validation process. While standard techniques for error handling, fault detection and isolation can have significant benefits for many systems, it is becoming increasingly evident that new technologies and methods are necessary for the development of techniques to detect, diagnose, predict, and then mitigate the adverse events due to software that has {\\em already undergone} significant verification and validation procedures. These software faults often arise due to the interaction between the software and the operating environment. Unanticipated environmental changes lead to software anomalies that may have significant impact on the overall success of the mission. Because software is ubiquitous, it is not sufficient that errors are detected only after they occur. Rather, software must be instrumented and monitored for failures before they happen. This prognostic capability will yield safer and more dependable systems for the future. This paper addresses the motivation, needs, and requirements of software health management as a new discipline.","PeriodicalId":403272,"journal":{"name":"2011 IEEE Fourth International Conference on Space Mission Challenges for Information Technology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Fourth International Conference on Space Mission Challenges for Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMC-IT.2011.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31

Abstract

Software Health Management (SWHM) is a new field that is concerned with the development of tools and technologies to enable automated detection, diagnosis, prediction, and mitigation of adverse events due to software anomalies. Significant effort has been expended in the last several decades in the development of verification and validation (\VV) methods for software intensive systems, but it is becoming increasingly more apparent that this is not enough to guarantee that a complex software system meets all safety and reliability requirements. %Moreover, error handling techniques are usually concerned with the detection %and isolation of these faults when they occur. Modern software systems can exhibit a variety of failure modes which can go undetected in a verification and validation process. While standard techniques for error handling, fault detection and isolation can have significant benefits for many systems, it is becoming increasingly evident that new technologies and methods are necessary for the development of techniques to detect, diagnose, predict, and then mitigate the adverse events due to software that has {\em already undergone} significant verification and validation procedures. These software faults often arise due to the interaction between the software and the operating environment. Unanticipated environmental changes lead to software anomalies that may have significant impact on the overall success of the mission. Because software is ubiquitous, it is not sufficient that errors are detected only after they occur. Rather, software must be instrumented and monitored for failures before they happen. This prognostic capability will yield safer and more dependable systems for the future. This paper addresses the motivation, needs, and requirements of software health management as a new discipline.
软件健康管理案例
软件健康管理(SWHM)是一个新领域,涉及开发工具和技术,以实现由软件异常引起的不良事件的自动检测、诊断、预测和缓解。在过去的几十年里,在软件密集型系统的验证和确认(VV)方法的开发上花费了大量的精力,但是越来越明显的是,这不足以保证一个复杂的软件系统满足所有的安全性和可靠性要求。此外,错误处理技术通常与这些错误发生时的检测和隔离有关。现代软件系统可以表现出各种各样的故障模式,这些模式在验证和确认过程中可能无法检测到。虽然错误处理、故障检测和隔离的标准技术可以为许多系统带来显著的好处,但越来越明显的是,新技术和方法对于检测、诊断、预测和减轻由于软件已经经历了重要的验证和确认程序而导致的不良事件的技术开发是必要的。这些软件故障通常是由于软件与运行环境的交互而产生的。意想不到的环境变化会导致软件异常,这可能会对任务的整体成功产生重大影响。由于软件无处不在,因此仅在错误发生后才检测到错误是不够的。相反,必须在故障发生之前对软件进行检测和监控。这种预测能力将为未来提供更安全、更可靠的系统。本文阐述了软件健康管理作为一门新学科的动机、需要和要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信