分布式实时系统中可分配线程故障恢复的最佳实时性保证

10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07) Pub Date : 2007-05-07 DOI:10.1109/ISORC.2007.45

B. Ravindran, Edward Curley, J. Anderson, E. Jensen

{"title":"分布式实时系统中可分配线程故障恢复的最佳实时性保证","authors":"B. Ravindran, Edward Curley, J. Anderson, E. Jensen","doi":"10.1109/ISORC.2007.45","DOIUrl":null,"url":null,"abstract":"We consider the problem of recovering from failures of distributable threads in distributed real-time systems that operate under run-time uncertainties including those on thread execution times, thread arrivals, and node failure occurrences. When a thread encounters a node failure, it causes orphans. Under a termination model, the orphans must be detected and aborted, and exceptions must be delivered to farthest, contiguous surviving thread segment for resuming thread execution. Our application/scheduling model includes distributable threads and their exception handlers that are subject to time/utility function (TUF) time constraints and a utility accrual (UA) optimality criterion. A key underpinning of the TUF/UA scheduling paradigm is the notion of \"best-effort\" where high importance threads are always favored over low importance ones, irrespective of thread urgency. We present a scheduling algorithm called HUA and a thread integrity protocol called TPR. We show that HUA and TPR bound the orphan cleanup and recovery time with bounded loss of the best-effort property. Our implementation experience of HUA/TPR within Sun's distributed real-time specification for Java demonstrates the algorithm/protocol's effectiveness","PeriodicalId":265471,"journal":{"name":"10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"On Best-Effort Real-Time Assurances for Recovering from Distributable Thread Failures in Distributed Real-Time Systems\",\"authors\":\"B. Ravindran, Edward Curley, J. Anderson, E. Jensen\",\"doi\":\"10.1109/ISORC.2007.45\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the problem of recovering from failures of distributable threads in distributed real-time systems that operate under run-time uncertainties including those on thread execution times, thread arrivals, and node failure occurrences. When a thread encounters a node failure, it causes orphans. Under a termination model, the orphans must be detected and aborted, and exceptions must be delivered to farthest, contiguous surviving thread segment for resuming thread execution. Our application/scheduling model includes distributable threads and their exception handlers that are subject to time/utility function (TUF) time constraints and a utility accrual (UA) optimality criterion. A key underpinning of the TUF/UA scheduling paradigm is the notion of \\\"best-effort\\\" where high importance threads are always favored over low importance ones, irrespective of thread urgency. We present a scheduling algorithm called HUA and a thread integrity protocol called TPR. We show that HUA and TPR bound the orphan cleanup and recovery time with bounded loss of the best-effort property. Our implementation experience of HUA/TPR within Sun's distributed real-time specification for Java demonstrates the algorithm/protocol's effectiveness\",\"PeriodicalId\":265471,\"journal\":{\"name\":\"10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISORC.2007.45\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISORC.2007.45","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

我们考虑了分布式实时系统中运行时不确定性(包括线程执行时间、线程到达和节点故障发生)的可分发线程故障恢复问题。当线程遇到节点故障时，它会导致孤儿。在终止模型下，必须检测并终止孤儿线程，并且必须将异常传递到最远的、连续的存活线程段以恢复线程执行。我们的应用程序/调度模型包括受时间/效用函数(TUF)时间约束和效用累积(UA)最优性标准约束的可分发线程及其异常处理程序。TUF/UA调度范式的一个关键基础是“尽力而为”的概念，即无论线程的紧急程度如何，高重要性线程总是比低重要性线程更受欢迎。提出了一种称为HUA的调度算法和一种称为TPR的线程完整性协议。我们证明了HUA和TPR约束了孤儿清理和恢复时间，并且最大努力性质的损失是有限的。我们在Sun的Java分布式实时规范中实现HUA/TPR的经验证明了该算法/协议的有效性

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On Best-Effort Real-Time Assurances for Recovering from Distributable Thread Failures in Distributed Real-Time Systems

We consider the problem of recovering from failures of distributable threads in distributed real-time systems that operate under run-time uncertainties including those on thread execution times, thread arrivals, and node failure occurrences. When a thread encounters a node failure, it causes orphans. Under a termination model, the orphans must be detected and aborted, and exceptions must be delivered to farthest, contiguous surviving thread segment for resuming thread execution. Our application/scheduling model includes distributable threads and their exception handlers that are subject to time/utility function (TUF) time constraints and a utility accrual (UA) optimality criterion. A key underpinning of the TUF/UA scheduling paradigm is the notion of "best-effort" where high importance threads are always favored over low importance ones, irrespective of thread urgency. We present a scheduling algorithm called HUA and a thread integrity protocol called TPR. We show that HUA and TPR bound the orphan cleanup and recovery time with bounded loss of the best-effort property. Our implementation experience of HUA/TPR within Sun's distributed real-time specification for Java demonstrates the algorithm/protocol's effectiveness

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07)

自引率

0.00%

发文量