IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012)最新文献

Draco: Statistical diagnosis of chronic problems in large distributed systems 大型分布式系统中慢性问题的统计诊断

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263927

Soila Kavulya, S. Daniels, Kaustubh R. Joshi, M. Hiltunen, R. Gandhi, P. Narasimhan

{"title":"Draco: Statistical diagnosis of chronic problems in large distributed systems","authors":"Soila Kavulya, S. Daniels, Kaustubh R. Joshi, M. Hiltunen, R. Gandhi, P. Narasimhan","doi":"10.1109/DSN.2012.6263927","DOIUrl":"https://doi.org/10.1109/DSN.2012.6263927","url":null,"abstract":"Chronics are recurrent problems that often fly under the radar of operations teams because they do not affect enough users or service invocations to set off alarm thresholds. In contrast with major outages that are rare, often have a single cause, and as a result are relatively easy to detect and diagnose quickly, chronic problems are elusive because they are often triggered by complex conditions, persist in a system for days or weeks, and coexist with other problems active at the same time. In this paper, we present Draco, a scalable engine to diagnose chronics that addresses these issues by using a “top-down” approach that starts by heuristically identifying user interactions that are likely to have failed, e.g., dropped calls, and drills down to identify groups of properties that best explain the difference between failed and successful interactions by using a scalable Bayesian learner. We have deployed Draco in production for the VoIP operations of a major ISP. In addition to providing examples of chronics that Draco has helped identify, we show via a comprehensive evaluation on production data that Draco provided 97% coverage, had fewer than 4% false positives, and outperformed state-of-the-art diagnostic techniques by up to 56% for complex chronics.","PeriodicalId":236791,"journal":{"name":"IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125227640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 59

An empirical study of the robustness of Inter-component Communication in Android Android中组件间通信鲁棒性的实证研究

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263963

A. Maji, F. Arshad, S. Bagchi, Jan S. Rellermeyer

引用次数: 103

EliMet: Security metric elicitation in power grid critical infrastructures by observing system administrators' responsive behavior EliMet:通过观察系统管理员的响应行为，得出电网关键基础设施中的安全度量

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263941

S. Zonouz, A. Houmansadr, P. Haghani

{"title":"EliMet: Security metric elicitation in power grid critical infrastructures by observing system administrators' responsive behavior","authors":"S. Zonouz, A. Houmansadr, P. Haghani","doi":"10.1109/DSN.2012.6263941","DOIUrl":"https://doi.org/10.1109/DSN.2012.6263941","url":null,"abstract":"To protect complex power-grid control networks, efficient security assessment techniques are required. However, efficiently making sure that calculated security measures match the expert knowledge is a challenging endeavor. In this paper, we present EliMet, a framework that combines information from different sources and estimates the extent to which a control network meets its security objective. Initially, during an offline phase, a state-based model of the network is generated, and security-level of each state is measured using a generic and easy-to-compute metric. EliMet then passively observes system operators' online reactive behavior against security incidents, and accordingly refines the calculated security measure values. Finally, to make the values comply with the expert knowledge, EliMet actively queries operators regarding those states for which sufficient information was not gained during the passive observation. Our experimental results show that EliMet can optimally make use of prior knowledge as well as automated inference techniques to minimize human involvement and efficiently deduce the expert knowledge regarding individual states of that particular system.","PeriodicalId":236791,"journal":{"name":"IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012)","volume":"21 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124667097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Latent fault detection in large scale services 大规模业务的潜在故障检测

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263932

Moshe Gabel, A. Schuster, Ran Gilad-Bachrach, N. Bjørner

引用次数: 23

A new symbolic approach for network reliability analysis 一种新的网络可靠性分析符号方法

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263935

M. Beccuti, A. Bobbio, G. Franceschinis, R. Terruggia

引用次数: 5

Error injection-based study of soft error propagation in AMD Bulldozer microprocessor module 基于误差注入的AMD推土机微处理器模块软误差传播研究

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263922

C. Constantinescu, Mike Butler, Chris Weller

引用次数: 9

Taming Mr Hayes: Mitigating signaling based attacks on smartphones 驯服海耶斯:减轻针对智能手机的基于信号的攻击

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263943

Collin Mulliner, Steffen Liebergeld, Matthias Lange, Jean-Pierre Seifert

引用次数: 16

Scalable optimal countermeasure selection using implicit enumeration on attack countermeasure trees 基于攻击对策树隐式枚举的可扩展最优对策选择

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263940

A. Roy, Dong Seong Kim, Kishor S. Trivedi

{"title":"Scalable optimal countermeasure selection using implicit enumeration on attack countermeasure trees","authors":"A. Roy, Dong Seong Kim, Kishor S. Trivedi","doi":"10.1109/DSN.2012.6263940","DOIUrl":"https://doi.org/10.1109/DSN.2012.6263940","url":null,"abstract":"Constraints such as limited security investment cost precludes a security decision maker from implementing all possible countermeasures in a system. Existing analytical model-based security optimization strategies do not prevail for the following reasons: (i) none of these model-based methods offer a way to find optimal security solution in the absence of probability assignments to the model, (ii) methods scale badly as size of the system to model increases and (iii) some methods suffer as they use attack trees (AT) whose structure does not allow for the inclusion of countermeasures while others translate the non-state-space model (e.g., attack response tree) into a state-space model hence causing state-space explosion. In this paper, we use a novel AT paradigm called attack countermeasure tree (ACT) whose structure takes into account attacks as well as countermeasures (in the form of detection and mitigation events). We use greedy and branch and bound techniques to study several objective functions with goals such as minimizing the number of countermeasures, security investment cost in the ACT and maximizing the benefit from implementing a certain countermeasure set in the ACT under different constraints. We cast each optimization problem into an integer programming problem which also allows us to find optimal solution even in the absence of probability assignments to the model. Our method scales well for large ACTs and we compare its efficiency with other approaches.","PeriodicalId":236791,"journal":{"name":"IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121582083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 104

A study of soft error consequences in hard disk drives 硬盘驱动器中软错误后果的研究

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263936

T. Tsai, Nawanol Theera-Ampornpunt, S. Bagchi

引用次数: 15

Model-driven consolidation of Java workloads on multicores 多核上Java工作负载的模型驱动整合

IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012) Pub Date : 2012-06-25 DOI: 10.1109/DSN.2012.6263928

Danilo Ansaloni, L. Chen, E. Smirni, Walter Binder

引用次数: 19