用于可靠性评估和改进的Linux故障数据集

2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W) Pub Date : 2022-06-01 DOI:10.1109/dsn-w54100.2022.00024

João R. Campos, Ernesto Costa, M. Vieira

{"title":"用于可靠性评估和改进的Linux故障数据集","authors":"João R. Campos, Ernesto Costa, M. Vieira","doi":"10.1109/dsn-w54100.2022.00024","DOIUrl":null,"url":null,"abstract":"Software systems are now used to execute critical tasks on a daily basis. As a result, unhandled or uncontrolled failures at runtime may lead to non-negligible risks or losses. To mitigate this, considerable effort and resources have been dedicated to assessing and improving the dependability of such systems. However, researching novel techniques to develop more dependable systems requires access to rich and detailed data. As data from real systems are not typically available, researchers often look for alternative processes, such as fault injection, to generate realistic synthetic data. As this requires considerable effort and expertise, researchers frequently rely on outdated datasets or develop simplified processes to collect data, eventually compromising the validation and development of their methods. This paper presents, discusses, and makes available a large failure dataset collected from an up-to-date Linux kernel through fault injection. It provides a detailed characterization of the target system by continuously monitoring hundreds of system metrics and various system logs throughout the experiments. Ultimately, the goal is to provide a reliable, well-defined, and properly generated dataset that can be used to research techniques to support the development of more dependable systems.","PeriodicalId":349937,"journal":{"name":"2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)","volume":"175 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Dataset of Linux Failure Data for Dependability Evaluation and Improvement\",\"authors\":\"João R. Campos, Ernesto Costa, M. Vieira\",\"doi\":\"10.1109/dsn-w54100.2022.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Software systems are now used to execute critical tasks on a daily basis. As a result, unhandled or uncontrolled failures at runtime may lead to non-negligible risks or losses. To mitigate this, considerable effort and resources have been dedicated to assessing and improving the dependability of such systems. However, researching novel techniques to develop more dependable systems requires access to rich and detailed data. As data from real systems are not typically available, researchers often look for alternative processes, such as fault injection, to generate realistic synthetic data. As this requires considerable effort and expertise, researchers frequently rely on outdated datasets or develop simplified processes to collect data, eventually compromising the validation and development of their methods. This paper presents, discusses, and makes available a large failure dataset collected from an up-to-date Linux kernel through fault injection. It provides a detailed characterization of the target system by continuously monitoring hundreds of system metrics and various system logs throughout the experiments. Ultimately, the goal is to provide a reliable, well-defined, and properly generated dataset that can be used to research techniques to support the development of more dependable systems.\",\"PeriodicalId\":349937,\"journal\":{\"name\":\"2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)\",\"volume\":\"175 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/dsn-w54100.2022.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/dsn-w54100.2022.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

软件系统现在用于执行日常的关键任务。因此，运行时未处理或不受控制的故障可能导致不可忽视的风险或损失。为了减轻这种情况，已经投入了大量的精力和资源来评估和提高这些系统的可靠性。然而，研究新技术以开发更可靠的系统需要获得丰富而详细的数据。由于来自真实系统的数据通常不可用，研究人员经常寻找替代方法，例如断层注入，以生成真实的合成数据。由于这需要相当大的努力和专业知识，研究人员经常依赖过时的数据集或开发简化的流程来收集数据，最终损害了其方法的验证和开发。本文介绍、讨论并提供了一个通过故障注入从最新的Linux内核收集的大型故障数据集。它通过在整个实验过程中连续监视数百个系统度量和各种系统日志，提供了目标系统的详细特征。最终的目标是提供一个可靠的、定义良好的、正确生成的数据集，该数据集可用于研究技术，以支持更可靠系统的开发。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Dataset of Linux Failure Data for Dependability Evaluation and Improvement

Software systems are now used to execute critical tasks on a daily basis. As a result, unhandled or uncontrolled failures at runtime may lead to non-negligible risks or losses. To mitigate this, considerable effort and resources have been dedicated to assessing and improving the dependability of such systems. However, researching novel techniques to develop more dependable systems requires access to rich and detailed data. As data from real systems are not typically available, researchers often look for alternative processes, such as fault injection, to generate realistic synthetic data. As this requires considerable effort and expertise, researchers frequently rely on outdated datasets or develop simplified processes to collect data, eventually compromising the validation and development of their methods. This paper presents, discusses, and makes available a large failure dataset collected from an up-to-date Linux kernel through fault injection. It provides a detailed characterization of the target system by continuously monitoring hundreds of system metrics and various system logs throughout the experiments. Ultimately, the goal is to provide a reliable, well-defined, and properly generated dataset that can be used to research techniques to support the development of more dependable systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)

自引率

0.00%

发文量