{"title":"基于智能通信控制器的多处理器系统交互式并行绘制","authors":"B. Bäumle, P. Kohler, A. Gunzinger","doi":"10.1145/218327.218342","DOIUrl":null,"url":null,"abstract":"Most data-parallel rendering algorithms (on multiprocessot systems with distributed memory) spend a substantial amount of time composing (merging or assembling) the partial images of all the processors. This paper shows how \"intelligent communication controllers\" (ICCs) help to reduce the immense communication overhead and accumulated latencies to an absolute minimum. Three examples of \"intelligent\" communication schemes are presented: the fully automatic redistribution of multi-dimensional data sets, depthmerge and bucket-sort. We show that these (and other) \"intelligent communication schemes\" can be implemented in hardware with a reasonable effort and that the communication bandwidth is used most efficiently. This results in a good speed-up, good scalability and the maximum utilizable performance for parallel rendering and many other data-parallel algorithms running on our multiprocessor system \"MUSIC\". As an example, we present a simple objectparallel renderer running at interactive frame rates.","PeriodicalId":101947,"journal":{"name":"Proceedings of the IEEE symposium on Parallel rendering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Interactive parallel rendering on a multiprocessor system with intelligent communication controllers\",\"authors\":\"B. Bäumle, P. Kohler, A. Gunzinger\",\"doi\":\"10.1145/218327.218342\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most data-parallel rendering algorithms (on multiprocessot systems with distributed memory) spend a substantial amount of time composing (merging or assembling) the partial images of all the processors. This paper shows how \\\"intelligent communication controllers\\\" (ICCs) help to reduce the immense communication overhead and accumulated latencies to an absolute minimum. Three examples of \\\"intelligent\\\" communication schemes are presented: the fully automatic redistribution of multi-dimensional data sets, depthmerge and bucket-sort. We show that these (and other) \\\"intelligent communication schemes\\\" can be implemented in hardware with a reasonable effort and that the communication bandwidth is used most efficiently. This results in a good speed-up, good scalability and the maximum utilizable performance for parallel rendering and many other data-parallel algorithms running on our multiprocessor system \\\"MUSIC\\\". As an example, we present a simple objectparallel renderer running at interactive frame rates.\",\"PeriodicalId\":101947,\"journal\":{\"name\":\"Proceedings of the IEEE symposium on Parallel rendering\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the IEEE symposium on Parallel rendering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/218327.218342\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the IEEE symposium on Parallel rendering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/218327.218342","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Interactive parallel rendering on a multiprocessor system with intelligent communication controllers
Most data-parallel rendering algorithms (on multiprocessot systems with distributed memory) spend a substantial amount of time composing (merging or assembling) the partial images of all the processors. This paper shows how "intelligent communication controllers" (ICCs) help to reduce the immense communication overhead and accumulated latencies to an absolute minimum. Three examples of "intelligent" communication schemes are presented: the fully automatic redistribution of multi-dimensional data sets, depthmerge and bucket-sort. We show that these (and other) "intelligent communication schemes" can be implemented in hardware with a reasonable effort and that the communication bandwidth is used most efficiently. This results in a good speed-up, good scalability and the maximum utilizable performance for parallel rendering and many other data-parallel algorithms running on our multiprocessor system "MUSIC". As an example, we present a simple objectparallel renderer running at interactive frame rates.