不同架构的Web服务器性能比较:使用高并发工作负载的案例研究

2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb) Pub Date : 2015-11-12 DOI:10.1109/HotWeb.2015.11

Qingwen Fan, Qingyang Wang

{"title":"不同架构的Web服务器性能比较:使用高并发工作负载的案例研究","authors":"Qingwen Fan, Qingyang Wang","doi":"10.1109/HotWeb.2015.11","DOIUrl":null,"url":null,"abstract":"Responsiveness is increasingly important for web servers to seamlessly interact with end-users and enhance user experience. In this paper, we studied how different server architectures -- asynchronous and thread-based -- impact the responsiveness of web servers under high concurrency workload. Through extensive measurements of a standard web server benchmark (Apache Bench), we show that the web servers with asynchronous architecture can achieve much better tail-latency than the thread-based version due to their robustness to handle high concurrency workload. Our fine-grained timeline analysis shows that a thread-based server is fragile to high concurrency workload because of its limited queue size (e.g., limited by thread pool size) for high concurrent requests, causing queue overflow and requests with very long response time due to TCP retransmissions. On the other hand, if we configure a thread-based server with large thread pool size to avoid queue overflow, the maximum achievable throughput can be significantly lower than that of the asynchronous version due to the multi-threading overhead. Our initial results suggest that asynchronous architecture should be considered to construct high responsive and robust web applications that involve hundreds of servers in cloud data centers.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"Performance Comparison of Web Servers with Different Architectures: A Case Study Using High Concurrency Workload\",\"authors\":\"Qingwen Fan, Qingyang Wang\",\"doi\":\"10.1109/HotWeb.2015.11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Responsiveness is increasingly important for web servers to seamlessly interact with end-users and enhance user experience. In this paper, we studied how different server architectures -- asynchronous and thread-based -- impact the responsiveness of web servers under high concurrency workload. Through extensive measurements of a standard web server benchmark (Apache Bench), we show that the web servers with asynchronous architecture can achieve much better tail-latency than the thread-based version due to their robustness to handle high concurrency workload. Our fine-grained timeline analysis shows that a thread-based server is fragile to high concurrency workload because of its limited queue size (e.g., limited by thread pool size) for high concurrent requests, causing queue overflow and requests with very long response time due to TCP retransmissions. On the other hand, if we configure a thread-based server with large thread pool size to avoid queue overflow, the maximum achievable throughput can be significantly lower than that of the asynchronous version due to the multi-threading overhead. Our initial results suggest that asynchronous architecture should be considered to construct high responsive and robust web applications that involve hundreds of servers in cloud data centers.\",\"PeriodicalId\":252318,\"journal\":{\"name\":\"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HotWeb.2015.11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HotWeb.2015.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

摘要

响应性对于web服务器与最终用户无缝交互和增强用户体验越来越重要。在本文中，我们研究了不同的服务器架构——异步的和基于线程的——如何在高并发工作负载下影响web服务器的响应性。通过对标准web服务器基准(Apache Bench)的广泛测量，我们表明异步架构的web服务器可以实现比基于线程的版本更好的尾部延迟，因为它们具有处理高并发工作负载的鲁棒性。我们的细粒度时间线分析表明，基于线程的服务器对高并发工作负载很脆弱，因为它对高并发请求的队列大小有限(例如，受线程池大小的限制)，导致队列溢出和由于TCP重传而具有很长响应时间的请求。另一方面，如果我们将基于线程的服务器配置为具有较大线程池大小以避免队列溢出，则由于多线程开销，可实现的最大吞吐量可能显著低于异步版本。我们的初步结果表明，应该考虑异步架构来构建高响应性和健壮的web应用程序，这些应用程序涉及云数据中心中的数百台服务器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Performance Comparison of Web Servers with Different Architectures: A Case Study Using High Concurrency Workload

Responsiveness is increasingly important for web servers to seamlessly interact with end-users and enhance user experience. In this paper, we studied how different server architectures -- asynchronous and thread-based -- impact the responsiveness of web servers under high concurrency workload. Through extensive measurements of a standard web server benchmark (Apache Bench), we show that the web servers with asynchronous architecture can achieve much better tail-latency than the thread-based version due to their robustness to handle high concurrency workload. Our fine-grained timeline analysis shows that a thread-based server is fragile to high concurrency workload because of its limited queue size (e.g., limited by thread pool size) for high concurrent requests, causing queue overflow and requests with very long response time due to TCP retransmissions. On the other hand, if we configure a thread-based server with large thread pool size to avoid queue overflow, the maximum achievable throughput can be significantly lower than that of the asynchronous version due to the multi-threading overhead. Our initial results suggest that asynchronous architecture should be considered to construct high responsive and robust web applications that involve hundreds of servers in cloud data centers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)

自引率

0.00%

发文量