Prototype Implementation of a Goal-Based Software Health Management Service

M. Barry, G. Horvath
{"title":"Prototype Implementation of a Goal-Based Software Health Management Service","authors":"M. Barry, G. Horvath","doi":"10.1109/SMC-IT.2009.21","DOIUrl":null,"url":null,"abstract":"The FAILSAFE project is developing concepts and prototype implementations for software health management in mission-critical real-time embedded systems. The project unites features of the industry standard ARINC 653 Avionics Application Software Standard Interface and JPL's Mission Data System (MDS) technology. The ARINC 653 standard establishes requirements for the services provided by partitioned real-time operating systems. The MDS technology provides a state analysis method, canonical architecture, and software framework that facilitates the design and implementation of software-intensive complex systems. We use the MDS technology to provide the health management function for an ARINC 653 application implementation. In particular, we focus on showing how this combination enables reasoning about and recovering from application software problems. Our prototype application software mimics the Space Shuttle orbiter's abort control sequencer software task, which provides safety-related functions to manage vehicle performance during launch aborts. We turned this task into a goal-based function that, when working in concert with the software health manager, aims to work around software and hardware problems in order to maximize abort performance results. In order to make it a compelling demonstration for current aerospace initiatives, we additionally imposed on our prototype a number of requirements derived from NASA's Constellation Program. Lastly, the ARINC 653 standard imposes a number of requirements on the system integrator for developing the requisite error handler process. Under ARINC 653, the health monitoring (HM) service is invoked by an application calling the application error service or by the operating system or hardware detecting a fault. It is these HM and error process details that we implement with the MDS technology, showing how a state-analytic approach is appropriate for identifying fault determination details, and showing how the framework supports acting upon state estimation and control features in order to achieve safety-related goals. We describe herein the requirements, design, and implementation of our software health manager and the software under control. We provide details of the analysis and design for the Phase II prototype, and describe future directions for the remainder of Phase II and the new topics we plan to address in Phase III.","PeriodicalId":422009,"journal":{"name":"2009 Third IEEE International Conference on Space Mission Challenges for Information Technology","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third IEEE International Conference on Space Mission Challenges for Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMC-IT.2009.21","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The FAILSAFE project is developing concepts and prototype implementations for software health management in mission-critical real-time embedded systems. The project unites features of the industry standard ARINC 653 Avionics Application Software Standard Interface and JPL's Mission Data System (MDS) technology. The ARINC 653 standard establishes requirements for the services provided by partitioned real-time operating systems. The MDS technology provides a state analysis method, canonical architecture, and software framework that facilitates the design and implementation of software-intensive complex systems. We use the MDS technology to provide the health management function for an ARINC 653 application implementation. In particular, we focus on showing how this combination enables reasoning about and recovering from application software problems. Our prototype application software mimics the Space Shuttle orbiter's abort control sequencer software task, which provides safety-related functions to manage vehicle performance during launch aborts. We turned this task into a goal-based function that, when working in concert with the software health manager, aims to work around software and hardware problems in order to maximize abort performance results. In order to make it a compelling demonstration for current aerospace initiatives, we additionally imposed on our prototype a number of requirements derived from NASA's Constellation Program. Lastly, the ARINC 653 standard imposes a number of requirements on the system integrator for developing the requisite error handler process. Under ARINC 653, the health monitoring (HM) service is invoked by an application calling the application error service or by the operating system or hardware detecting a fault. It is these HM and error process details that we implement with the MDS technology, showing how a state-analytic approach is appropriate for identifying fault determination details, and showing how the framework supports acting upon state estimation and control features in order to achieve safety-related goals. We describe herein the requirements, design, and implementation of our software health manager and the software under control. We provide details of the analysis and design for the Phase II prototype, and describe future directions for the remainder of Phase II and the new topics we plan to address in Phase III.
基于目标的软件健康管理服务的原型实现
FAILSAFE项目正在开发关键任务实时嵌入式系统中软件健康管理的概念和原型实现。该项目结合了工业标准ARINC 653航空电子应用软件标准接口和喷气推进实验室任务数据系统(MDS)技术的特点。ARINC 653标准为分区实时操作系统提供的服务建立了需求。MDS技术提供了一种状态分析方法、规范体系结构和软件框架,便于软件密集型复杂系统的设计和实现。我们使用MDS技术为arinc653应用程序实现提供运行状况管理功能。特别是,我们将重点展示这种组合如何能够对应用程序软件问题进行推理并从中恢复。我们的原型应用软件模拟了航天飞机轨道器的中止控制程序软件任务,该任务提供了在发射中止期间管理飞行器性能的安全相关功能。我们将此任务转换为基于目标的函数,该函数在与软件运行状况管理器协同工作时,旨在解决软件和硬件问题,以最大限度地提高中止性能结果。为了使它成为当前航空航天计划的一个引人注目的演示,我们在我们的原型上附加了一些来自NASA星座计划的要求。最后,ARINC 653标准对系统集成商提出了许多要求,以开发必要的错误处理程序。在ARINC 653下,运行状况监视(HM)服务由调用应用程序错误服务的应用程序或检测故障的操作系统或硬件调用。我们用MDS技术实现了这些HM和错误过程细节,展示了状态分析方法如何适用于识别故障确定细节,并展示了框架如何支持根据状态估计和控制特征进行操作,以实现与安全相关的目标。我们在此描述我们的软件运行状况管理器和受控软件的需求、设计和实现。我们提供了第二阶段原型的分析和设计细节,并描述了第二阶段剩余部分的未来方向以及我们计划在第三阶段解决的新主题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信