2009 15th IEEE Pacific Rim International Symposium on Dependable Computing最新文献_第5页

Variation-Aware Scheduling for Chip Multiprocessors with Thread Level Redundancy 具有线程级冗余的芯片多处理器的变化感知调度

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.12

Jianbo Dong, Lei Zhang, Yinhe Han, Guihai Yan, Xiaowei Li

引用次数: 5

A Test Vector Compression/Decompression Scheme Based on Logic Operation between Adjacent Bits (LOBAB) Coding 基于LOBAB编码的测试向量压缩/解压缩方案

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.11

Huaguo Liang, Wenfa Zhan, Q. Luo, Cuiyun Jiang

引用次数: 0

Quiescent Leader Election in Crash-Recovery Systems 崩溃恢复系统中的静态领导人选举

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.58

M. Larrea, Cristian Martín

引用次数: 4

Chip-Level Redundancy in Distributed Shared-Memory Multiprocessors 分布式共享内存多处理器中的芯片级冗余

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.39

Brian T. Gold, B. Falsafi, J. Hoe

引用次数: 6

A New Approach to Automated Redundancy Reduction for Test Sequences 一种测试序列自动冗余削减的新方法

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.23

Huai-kou Miao, Pan Liu, Jia Mei, Hong-wei Zeng

引用次数: 6

Reliability Analysis of Single Bus Communication with Real-Time Requirements 具有实时性要求的单总线通信可靠性分析

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.10

M. Sebastian, R. Ernst

引用次数: 18

Zapmem: A Framework for Testing the Effect of Memory Corruption Errors on Operating System Kernel Reliability 一个测试内存损坏错误对操作系统内核可靠性影响的框架

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.53

Roberto Jung Drebes, T. Nanya

{"title":"Zapmem: A Framework for Testing the Effect of Memory Corruption Errors on Operating System Kernel Reliability","authors":"Roberto Jung Drebes, T. Nanya","doi":"10.1109/PRDC.2009.53","DOIUrl":"https://doi.org/10.1109/PRDC.2009.53","url":null,"abstract":"While monolithic operating system kernels are composed of many subsystems, during runtime they all share a common address space, making fault propagation a serious issue. The code quality of each subsystem is different, as OS development is a complex task commonly divided by different groups with different degrees of expertise. Since the memory space into which this code runs is shared, the occurrence of bugs or errors in one of the subsystems may propagate to others and affect general OS reliability. It is necessary, then, to test how errors propagate between the different kernel subsystems and how they affect reliability. This work presents a simple new technique to inject memory corruption faults and Zapmem, a fault injection tool which uses such technique to test the effect on reliability from memory corruption of statically allocated kernel data. Zapmem associates the runtime memory addresses to the corresponding high level (source code) memory structure definitions, which indicate which kernel subsystem allocated that memory region, and the tool has minimal intrusiveness, as our technique does not require kernel instrumentation. The efficacy of our approach and preliminary results are also presented.","PeriodicalId":356141,"journal":{"name":"2009 15th IEEE Pacific Rim International Symposium on Dependable Computing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121285396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Online Computing and Predicting Architectural Vulnerability Factor of Microprocessor Structures 微处理器结构脆弱性因子的在线计算与预测

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.61

Songjun Pan, Yu Hu, Xiaowei Li

{"title":"Online Computing and Predicting Architectural Vulnerability Factor of Microprocessor Structures","authors":"Songjun Pan, Yu Hu, Xiaowei Li","doi":"10.1109/PRDC.2009.61","DOIUrl":"https://doi.org/10.1109/PRDC.2009.61","url":null,"abstract":"Soft Errors have emerged as a key challenge to microprocessor design. Traditional soft error tolerance techniques (such as redundant multithreading and instruction duplication) can achieve high fault coverage but at the cost of significant performance degradation. Prior research reports that soft errors can be masked at the architecture level, and the degree of such masking, named as architecture vulnerability factor (AVF), can vary significantly across workloads and individual structures, hence strict redundant execution may not be necessary for soft error tolerance. In this work, we exploit the AVF varying feature to adaptively tune reliability and performance. We present an infrastructure to online compute and predict AVF for three microprocessor structures (IQ, ROB, and LSQ), guiding when the protection scheme should be activated to improve reliability. Experimental results show that our method can efficiently compute the AVF for different structures independent of hardware configurations. The average differences between our method and a prior offline AVF computing method are 0.10, 0.01, and 0.039 for IQ, ROB, and LSQ, respectively.","PeriodicalId":356141,"journal":{"name":"2009 15th IEEE Pacific Rim International Symposium on Dependable Computing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132047738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

A Novel Process Migration Method for MPI Applications 一种新的MPI应用进程迁移方法

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.46

Tiantian Liu, Zhongmin Ma, Zhonghong Ou

引用次数: 5

Fault-Tolerant Event Detection Using Two Thresholds in Wireless Sensor Networks 基于双阈值的无线传感器网络容错事件检测

2009 15th IEEE Pacific Rim International Symposium on Dependable Computing Pub Date : 2009-11-16 DOI: 10.1109/PRDC.2009.59

S. Yim, Yoon-Hwa Choi

引用次数: 9