You can't debug what you can't see: Expanding observability with the OmniTable

Proceedings of the Workshop on Hot Topics in Operating Systems Pub Date : 2019-05-13 DOI:10.1145/3317550.3321428

Andrew Quinn, J. Flinn, Michael J. Cafarella

{"title":"You can't debug what you can't see: Expanding observability with the OmniTable","authors":"Andrew Quinn, J. Flinn, Michael J. Cafarella","doi":"10.1145/3317550.3321428","DOIUrl":null,"url":null,"abstract":"The effectiveness of a debugging tool is fundamentally limited by what program state it can observe. Yet, for performance reasons, all current debugging tools restrict the program state that can be observed in some way. For example, tools like heap analysis restrict what can be observed (i.e., only global variables) and tools like core dump analysis restrict when observations may be made (i.e., only on program termination). Other tools effectively limit the scope of observation by requiring developers to specify what and when observations will be made before execution (e.g., logging) or during an execution (e.g., gdb). We propose a new abstraction for debugging, called an OmniTable, that logically exposes unrestricted access to all program state at all points in an execution to developers. The OmniTable represents a program execution as a database-style table. Developers inspect the OmniTable using a familiar declarative query language: SQL. SQL simplifies the observation and analysis of large, complex execution state. Iterative queries are inherently consistent since they operate over the same logical table. Clearly, materializing the OmniTable for even a simple program is infeasible due to storage and processing overheads. Thus, our prototype, SteamDrill, selectively materializes only the regions of the OmniTable required to answer each query by using deterministic record and replay to reproduce the execution and dynamic instrumentation to extract needed state. By expressing debugging queries with relational logic, SteamDrill leverages proven optimizations such as query optimization and caching. In addition, decomposition into relational logic allows a query to be executed via repeated replays, each replay extracting information needed by the next, which can often be more efficient than extracting all information during a single execution.","PeriodicalId":224944,"journal":{"name":"Proceedings of the Workshop on Hot Topics in Operating Systems","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Workshop on Hot Topics in Operating Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3317550.3321428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

The effectiveness of a debugging tool is fundamentally limited by what program state it can observe. Yet, for performance reasons, all current debugging tools restrict the program state that can be observed in some way. For example, tools like heap analysis restrict what can be observed (i.e., only global variables) and tools like core dump analysis restrict when observations may be made (i.e., only on program termination). Other tools effectively limit the scope of observation by requiring developers to specify what and when observations will be made before execution (e.g., logging) or during an execution (e.g., gdb). We propose a new abstraction for debugging, called an OmniTable, that logically exposes unrestricted access to all program state at all points in an execution to developers. The OmniTable represents a program execution as a database-style table. Developers inspect the OmniTable using a familiar declarative query language: SQL. SQL simplifies the observation and analysis of large, complex execution state. Iterative queries are inherently consistent since they operate over the same logical table. Clearly, materializing the OmniTable for even a simple program is infeasible due to storage and processing overheads. Thus, our prototype, SteamDrill, selectively materializes only the regions of the OmniTable required to answer each query by using deterministic record and replay to reproduce the execution and dynamic instrumentation to extract needed state. By expressing debugging queries with relational logic, SteamDrill leverages proven optimizations such as query optimization and caching. In addition, decomposition into relational logic allows a query to be executed via repeated replays, each replay extracting information needed by the next, which can often be more efficient than extracting all information during a single execution.

查看原文本刊更多论文

你不能调试你看不到的东西:用omunitable扩展可观察性

调试工具的有效性从根本上受限于它所能观察到的程序状态。然而，出于性能原因，所有当前的调试工具都以某种方式限制了可以观察到的程序状态。例如，像堆分析这样的工具限制了可以观察的内容(即，只有全局变量)，而像核心转储分析这样的工具限制了什么时候可以进行观察(即，只在程序终止时进行观察)。其他工具通过要求开发人员指定在执行之前(例如，日志记录)或在执行期间(例如，gdb)进行什么和何时进行观察，有效地限制了观察的范围。我们提出了一种新的调试抽象，称为OmniTable，它在逻辑上向开发人员公开了对执行过程中所有点的所有程序状态的不受限制的访问。OmniTable将程序执行表示为数据库样式的表。开发人员使用一种熟悉的声明性查询语言:SQL来检查OmniTable。SQL简化了对大型复杂执行状态的观察和分析。迭代查询本质上是一致的，因为它们在相同的逻辑表上操作。显然，由于存储和处理开销，即使是简单的程序也无法实现OmniTable。因此，我们的原型SteamDrill通过使用确定性记录和重播来重现执行和动态检测以提取所需状态，选择性地实现回答每个查询所需的OmniTable区域。通过用关系逻辑表达调试查询，SteamDrill利用了查询优化和缓存等经过验证的优化。此外，将查询分解为关系逻辑允许通过重复重播执行查询，每次重播提取下一个重播所需的信息，这通常比在单个执行期间提取所有信息更有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Workshop on Hot Topics in Operating Systems

自引率

0.00%

发文量