{"title":"分析并行推理机实现动态负载平衡","authors":"M. Sugie, M. Yoneyama, A. Goto","doi":"10.1109/AIIA.1988.13340","DOIUrl":null,"url":null,"abstract":"A parallel inference machine (PIM) prototype modelled on loosely coupled clusters was simulated on a hardware simulator. Performance of the PIM prototype is limited by suspension/resumption overhead in the fine granularity region and by low utilization, due to load distribution imbalance, in the coarse granularity region. It is shown that the load dispatch strategy in which loads are dispatched to the cluster with minimum loads at an AND-fork time is effective on the loosely-coupled cluster level, resulting in 20% higher performance than in the random dispatch strategy, and that the load status modification delay should be less than half of the reduction time to limit the degradation to within 5%.<<ETX>>","PeriodicalId":112397,"journal":{"name":"Proceedings of the International Workshop on Artificial Intelligence for Industrial Applications","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Analysis of parallel inference machines to achieve dynamic load balancing\",\"authors\":\"M. Sugie, M. Yoneyama, A. Goto\",\"doi\":\"10.1109/AIIA.1988.13340\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A parallel inference machine (PIM) prototype modelled on loosely coupled clusters was simulated on a hardware simulator. Performance of the PIM prototype is limited by suspension/resumption overhead in the fine granularity region and by low utilization, due to load distribution imbalance, in the coarse granularity region. It is shown that the load dispatch strategy in which loads are dispatched to the cluster with minimum loads at an AND-fork time is effective on the loosely-coupled cluster level, resulting in 20% higher performance than in the random dispatch strategy, and that the load status modification delay should be less than half of the reduction time to limit the degradation to within 5%.<<ETX>>\",\"PeriodicalId\":112397,\"journal\":{\"name\":\"Proceedings of the International Workshop on Artificial Intelligence for Industrial Applications\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1988-05-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Workshop on Artificial Intelligence for Industrial Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIIA.1988.13340\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Workshop on Artificial Intelligence for Industrial Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIIA.1988.13340","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of parallel inference machines to achieve dynamic load balancing
A parallel inference machine (PIM) prototype modelled on loosely coupled clusters was simulated on a hardware simulator. Performance of the PIM prototype is limited by suspension/resumption overhead in the fine granularity region and by low utilization, due to load distribution imbalance, in the coarse granularity region. It is shown that the load dispatch strategy in which loads are dispatched to the cluster with minimum loads at an AND-fork time is effective on the loosely-coupled cluster level, resulting in 20% higher performance than in the random dispatch strategy, and that the load status modification delay should be less than half of the reduction time to limit the degradation to within 5%.<>