Micro-architectural anatomy of a commercial TCP/IP stack

IEEE International Workshop on Workload Characterization, 2004. WWC-7. 2004 Pub Date : 2004-10-25 DOI:10.1109/WWC.2004.1437394

R. Illikkal, R. Iyer, D. Newell

{"title":"Micro-architectural anatomy of a commercial TCP/IP stack","authors":"R. Illikkal, R. Iyer, D. Newell","doi":"10.1109/WWC.2004.1437394","DOIUrl":null,"url":null,"abstract":"Over the last couple of decades, computer architects and performance analysts have routinely attempted to profile the overhead of TCP/IP processing in an effort to understand where the time was spent. It is well understood that this is a rather difficult problem since the processing time is spread across various software modules such as the network stack, interrupt routines, drivers, O/S scheduler, etc. As a result, the problem of extracting the micro-architectural characteristics of TCP/IP processing is significantly more challenging. In this paper, we start by covering the previous attempts at this problem and show what existing tools can provide in terms of execution time characteristics. We then propose a detailed methodology that combines full-system simulation, cycle-accurate performance simulations and symbol annotation to provide a rich cycle-accurate view of TCP/IP packet processing execution. We discuss initial results based on our profiling methodology and discuss where the time is spent. This includes an analysis of micro-architectural characteristics (such as instruction breakdown, CPI, MPI and TLB misses on a state-of-the-art microprocessor).","PeriodicalId":240633,"journal":{"name":"IEEE International Workshop on Workload Characterization, 2004. WWC-7. 2004","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Workshop on Workload Characterization, 2004. WWC-7. 2004","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WWC.2004.1437394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Over the last couple of decades, computer architects and performance analysts have routinely attempted to profile the overhead of TCP/IP processing in an effort to understand where the time was spent. It is well understood that this is a rather difficult problem since the processing time is spread across various software modules such as the network stack, interrupt routines, drivers, O/S scheduler, etc. As a result, the problem of extracting the micro-architectural characteristics of TCP/IP processing is significantly more challenging. In this paper, we start by covering the previous attempts at this problem and show what existing tools can provide in terms of execution time characteristics. We then propose a detailed methodology that combines full-system simulation, cycle-accurate performance simulations and symbol annotation to provide a rich cycle-accurate view of TCP/IP packet processing execution. We discuss initial results based on our profiling methodology and discuss where the time is spent. This includes an analysis of micro-architectural characteristics (such as instruction breakdown, CPI, MPI and TLB misses on a state-of-the-art microprocessor).

查看原文本刊更多论文

商业TCP/IP栈的微观架构剖析

在过去的几十年里，计算机架构师和性能分析师经常尝试分析TCP/IP处理的开销，以了解时间花在哪里。众所周知，这是一个相当困难的问题，因为处理时间分布在各种软件模块上，如网络堆栈、中断例程、驱动程序、O/S调度程序等。因此，提取TCP/IP处理的微体系结构特征的问题更加具有挑战性。在本文中，我们首先介绍之前对这个问题的尝试，并展示现有工具在执行时间特征方面可以提供什么。然后，我们提出了一种详细的方法，该方法结合了全系统模拟、周期精确的性能模拟和符号注释，以提供TCP/IP数据包处理执行的丰富的周期精确视图。我们讨论了基于分析方法的初始结果，并讨论了时间花在哪里。这包括对微架构特征的分析(例如指令分解、CPI、MPI和最先进的微处理器上的TLB缺失)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE International Workshop on Workload Characterization, 2004. WWC-7. 2004

自引率

0.00%

发文量