{"title":"一种新的多核体系结构的混合缓存一致性与全局窥探","authors":"G. Harsha, Sujay Deb","doi":"10.1145/3462775","DOIUrl":null,"url":null,"abstract":"\n Cache coherence ensures correctness of cached data in multi-core processors. Traditional implementations of existing protocols make them unscalable for many core architectures. While snoopy coherence requires unscalable ordered networks, directory coherence is weighed down by high area and energy overheads. In this work, we propose Wireless-enabled Share-aware Hybrid (WiSH) to provide scalable coherence in many core processors. WiSH implements a novel Snoopy over Directory protocol using on-chip wireless links and hierarchical, clustered Network-on-Chip to achieve low-overhead and highly efficient coherence. A local directory protocol maintains coherence within a cluster of cores, while coherence among such clusters is achieved through global snoopy protocol. The ordered network for global snooping is provided through low-latency and low-energy broadcast wireless links. The overheads are further reduced through share-aware cache segmentation to eliminate coherence for private blocks. Evaluations show that WiSH reduces traffic by\n \n \n \n \n \n \n \n and runtime by\n \n \n \n \n \n \n \n , while requiring\n \n \n \n \n \n \n \n smaller storage and\n \n \n \n \n \n \n \n lower energy as compared to existing hierarchical and hybrid coherence protocols. Owing to its modularity, WiSH provides highly efficient and scalable coherence for many core processors.\n","PeriodicalId":7063,"journal":{"name":"ACM Trans. Design Autom. Electr. Syst.","volume":"21 1","pages":"2:1-2:31"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Novel Hybrid Cache Coherence with Global Snooping for Many-core Architectures\",\"authors\":\"G. Harsha, Sujay Deb\",\"doi\":\"10.1145/3462775\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Cache coherence ensures correctness of cached data in multi-core processors. Traditional implementations of existing protocols make them unscalable for many core architectures. While snoopy coherence requires unscalable ordered networks, directory coherence is weighed down by high area and energy overheads. In this work, we propose Wireless-enabled Share-aware Hybrid (WiSH) to provide scalable coherence in many core processors. WiSH implements a novel Snoopy over Directory protocol using on-chip wireless links and hierarchical, clustered Network-on-Chip to achieve low-overhead and highly efficient coherence. A local directory protocol maintains coherence within a cluster of cores, while coherence among such clusters is achieved through global snoopy protocol. The ordered network for global snooping is provided through low-latency and low-energy broadcast wireless links. The overheads are further reduced through share-aware cache segmentation to eliminate coherence for private blocks. Evaluations show that WiSH reduces traffic by\\n \\n \\n \\n \\n \\n \\n \\n and runtime by\\n \\n \\n \\n \\n \\n \\n \\n , while requiring\\n \\n \\n \\n \\n \\n \\n \\n smaller storage and\\n \\n \\n \\n \\n \\n \\n \\n lower energy as compared to existing hierarchical and hybrid coherence protocols. Owing to its modularity, WiSH provides highly efficient and scalable coherence for many core processors.\\n\",\"PeriodicalId\":7063,\"journal\":{\"name\":\"ACM Trans. Design Autom. Electr. Syst.\",\"volume\":\"21 1\",\"pages\":\"2:1-2:31\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Trans. Design Autom. Electr. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3462775\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Trans. Design Autom. Electr. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3462775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
缓存一致性确保了多核处理器中缓存数据的正确性。现有协议的传统实现使得它们在许多核心体系结构中无法扩展。虽然史努比一致性需要不可伸缩的有序网络,但目录一致性受到高面积和能量开销的影响。在这项工作中,我们提出了无线共享感知混合(WiSH),以在许多核心处理器中提供可扩展的一致性。WiSH使用片上无线链路和分层、集群的片上网络实现了一种新颖的Snoopy over Directory协议,以实现低开销和高效的一致性。本地目录协议保持核心集群内的一致性,而集群之间的一致性是通过全局snoopy协议实现的。通过低延迟和低能量的广播无线链路提供有序的全局窥探网络。通过共享感知缓存分段来消除私有块的一致性,进一步降低了开销。评估表明,与现有的分层和混合相干协议相比,WiSH减少了流量和运行时间,同时需要更小的存储和更低的能量。由于它的模块化,WiSH为许多核心处理器提供了高效和可扩展的一致性。
A Novel Hybrid Cache Coherence with Global Snooping for Many-core Architectures
Cache coherence ensures correctness of cached data in multi-core processors. Traditional implementations of existing protocols make them unscalable for many core architectures. While snoopy coherence requires unscalable ordered networks, directory coherence is weighed down by high area and energy overheads. In this work, we propose Wireless-enabled Share-aware Hybrid (WiSH) to provide scalable coherence in many core processors. WiSH implements a novel Snoopy over Directory protocol using on-chip wireless links and hierarchical, clustered Network-on-Chip to achieve low-overhead and highly efficient coherence. A local directory protocol maintains coherence within a cluster of cores, while coherence among such clusters is achieved through global snoopy protocol. The ordered network for global snooping is provided through low-latency and low-energy broadcast wireless links. The overheads are further reduced through share-aware cache segmentation to eliminate coherence for private blocks. Evaluations show that WiSH reduces traffic by
and runtime by
, while requiring
smaller storage and
lower energy as compared to existing hierarchical and hybrid coherence protocols. Owing to its modularity, WiSH provides highly efficient and scalable coherence for many core processors.