Columbo: Low Level End-to-End System Traces through Modular Full-System Simulation

Jakob Görgen, Vaastav Anand, Hejing Li, Jialin Li, Antoine Kaufmann
{"title":"Columbo: Low Level End-to-End System Traces through Modular Full-System Simulation","authors":"Jakob Görgen, Vaastav Anand, Hejing Li, Jialin Li, Antoine Kaufmann","doi":"arxiv-2408.05251","DOIUrl":null,"url":null,"abstract":"Fully understanding performance is a growing challenge when building\nnext-generation cloud systems. Often these systems build on next-generation\nhardware, and evaluation in realistic physical testbeds is out of reach. Even\nwhen physical testbeds are available, visibility into essential system aspects\nis a challenge in modern systems where system performance depends on often\nsub-$\\mu s$ interactions between HW and SW components. Existing tools such as\nperformance counters, logging, and distributed tracing provide aggregate or\nsampled information, but remain insufficient for understanding individual\nrequests in-depth. In this paper, we explore a fundamentally different approach\nto enable in-depth understanding of cloud system behavior at the software and\nhardware level, with (almost) arbitrarily fine-grained visibility. Our proposal\nis to run cloud systems in detailed full-system simulations, configure the\nsimulators to collect detailed events without affecting the system, and finally\nassemble these events into end-to-end system traces that can be analyzed by\nexisting distributed tracing tools.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"24 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.05251","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Fully understanding performance is a growing challenge when building next-generation cloud systems. Often these systems build on next-generation hardware, and evaluation in realistic physical testbeds is out of reach. Even when physical testbeds are available, visibility into essential system aspects is a challenge in modern systems where system performance depends on often sub-$\mu s$ interactions between HW and SW components. Existing tools such as performance counters, logging, and distributed tracing provide aggregate or sampled information, but remain insufficient for understanding individual requests in-depth. In this paper, we explore a fundamentally different approach to enable in-depth understanding of cloud system behavior at the software and hardware level, with (almost) arbitrarily fine-grained visibility. Our proposal is to run cloud systems in detailed full-system simulations, configure the simulators to collect detailed events without affecting the system, and finally assemble these events into end-to-end system traces that can be analyzed by existing distributed tracing tools.
科伦坡通过模块化全系统仿真进行低水平端到端系统跟踪
在构建下一代云系统时,充分了解性能是一个日益严峻的挑战。这些系统通常基于下一代硬件构建,在现实的物理测试平台上进行评估遥不可及。在现代系统中,系统性能往往取决于硬件和软件组件之间的交互,即使有物理测试平台,系统基本方面的可见性也是一个挑战。现有的工具,如性能计数器、日志记录和分布式跟踪可提供汇总或采样信息,但仍不足以深入了解单个请求。在本文中,我们将探索一种根本不同的方法,以便在软件和硬件层面深入了解云系统的行为,并实现(几乎)任意细粒度的可视性。我们的建议是在详细的全系统模拟中运行云系统,配置模拟器以在不影响系统的情况下收集详细事件,最后将这些事件组合成端到端系统跟踪,现有的分布式跟踪工具可对其进行分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信