线程裁剪:动态地将线程编织在一起，以实现高效、自适应的并行应用程序

Proceedings of the 37th annual international symposium on Computer architecture Pub Date : 2010-06-19 DOI:10.1145/1815961.1815996

Janghaeng Lee, Haicheng Wu, M. Ravichandran, Nathan Clark

{"title":"线程裁剪:动态地将线程编织在一起，以实现高效、自适应的并行应用程序","authors":"Janghaeng Lee, Haicheng Wu, M. Ravichandran, Nathan Clark","doi":"10.1145/1815961.1815996","DOIUrl":null,"url":null,"abstract":"Extracting performance from modern parallel architectures requires that applications be divided into many different threads of execution. Unfortunately selecting the appropriate number of threads for an application is a daunting task. Having too many threads can quickly saturate shared resources, such as cache capacity or memory bandwidth, thus degrading performance. On the other hand, having too few threads makes inefficient use of the resources available. Beyond static resource assignment, the program inputs and dynamic system state (e.g., what other applications are executing in the system) can have a significant impact on the right number of threads to use for a particular application. To address this problem we present the Thread Tailor, a dynamic system that automatically adjusts the number of threads in an application to optimize system efficiency. The Thread Tailor leverages offline analysis to estimate what type of threads will exist at runtime and the communication patterns between them. Using this information Thread Tailor dynamically combines threads to better suit the needs of the target system. Thread Tailor adjusts not only to the architecture, but also other applications in the system, and this paper demonstrates that this type of adjustment can lead to significantly better use of thread-level parallelism in real-world architectures.","PeriodicalId":132033,"journal":{"name":"Proceedings of the 37th annual international symposium on Computer architecture","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"94","resultStr":"{\"title\":\"Thread tailor: dynamically weaving threads together for efficient, adaptive parallel applications\",\"authors\":\"Janghaeng Lee, Haicheng Wu, M. Ravichandran, Nathan Clark\",\"doi\":\"10.1145/1815961.1815996\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extracting performance from modern parallel architectures requires that applications be divided into many different threads of execution. Unfortunately selecting the appropriate number of threads for an application is a daunting task. Having too many threads can quickly saturate shared resources, such as cache capacity or memory bandwidth, thus degrading performance. On the other hand, having too few threads makes inefficient use of the resources available. Beyond static resource assignment, the program inputs and dynamic system state (e.g., what other applications are executing in the system) can have a significant impact on the right number of threads to use for a particular application. To address this problem we present the Thread Tailor, a dynamic system that automatically adjusts the number of threads in an application to optimize system efficiency. The Thread Tailor leverages offline analysis to estimate what type of threads will exist at runtime and the communication patterns between them. Using this information Thread Tailor dynamically combines threads to better suit the needs of the target system. Thread Tailor adjusts not only to the architecture, but also other applications in the system, and this paper demonstrates that this type of adjustment can lead to significantly better use of thread-level parallelism in real-world architectures.\",\"PeriodicalId\":132033,\"journal\":{\"name\":\"Proceedings of the 37th annual international symposium on Computer architecture\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"94\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 37th annual international symposium on Computer architecture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1815961.1815996\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 37th annual international symposium on Computer architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1815961.1815996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 94

摘要

从现代并行体系结构中提取性能需要将应用程序划分为许多不同的执行线程。不幸的是，为应用程序选择适当数量的线程是一项艰巨的任务。线程太多会很快使共享资源(如缓存容量或内存带宽)饱和，从而降低性能。另一方面，线程过少会使可用资源的使用效率低下。除了静态资源分配之外，程序输入和动态系统状态(例如，系统中正在执行的其他应用程序)可能会对特定应用程序使用的正确线程数量产生重大影响。为了解决这个问题，我们提出了Thread Tailor，这是一个动态系统，可以自动调整应用程序中的线程数量以优化系统效率。Thread Tailor利用脱机分析来估计运行时将存在什么类型的线程以及它们之间的通信模式。使用这些信息，Thread Tailor可以动态地组合线程，以更好地适应目标系统的需求。Thread Tailor不仅根据体系结构进行调整，还根据系统中的其他应用程序进行调整，并且本文证明了这种类型的调整可以在实际体系结构中更好地使用线程级并行性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Thread tailor: dynamically weaving threads together for efficient, adaptive parallel applications

Extracting performance from modern parallel architectures requires that applications be divided into many different threads of execution. Unfortunately selecting the appropriate number of threads for an application is a daunting task. Having too many threads can quickly saturate shared resources, such as cache capacity or memory bandwidth, thus degrading performance. On the other hand, having too few threads makes inefficient use of the resources available. Beyond static resource assignment, the program inputs and dynamic system state (e.g., what other applications are executing in the system) can have a significant impact on the right number of threads to use for a particular application. To address this problem we present the Thread Tailor, a dynamic system that automatically adjusts the number of threads in an application to optimize system efficiency. The Thread Tailor leverages offline analysis to estimate what type of threads will exist at runtime and the communication patterns between them. Using this information Thread Tailor dynamically combines threads to better suit the needs of the target system. Thread Tailor adjusts not only to the architecture, but also other applications in the system, and this paper demonstrates that this type of adjustment can lead to significantly better use of thread-level parallelism in real-world architectures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 37th annual international symposium on Computer architecture

自引率

0.00%

发文量