S. Kuramochi, Yu-on Chatchakan, Kousuke Mouri, K. Kaneko
{"title":"Stochastic Link-Fault-Tolerant Routing In A Torus","authors":"S. Kuramochi, Yu-on Chatchakan, Kousuke Mouri, K. Kaneko","doi":"10.1109/JCSSE.2018.8457367","DOIUrl":null,"url":null,"abstract":"It is impossible to operate a parallel system that contains a large number of elements ignoring the existence of faulty elements. In fault-tolerant routing in a parallel system, the method to make each non-faulty vertex store the compressed fault information (limited global information) is effective. The torus is the most popular topology adopted for interconnection networks of parallel systems. However, because of its complexity, any effective method that utilizes the limited global information has not been proposed. In this paper, we propose a stochastic fault-tolerant routing method in tori, which can tolerate the link and/or vertex faults. We can expect that our result achieves drastic improvement regarding fault-tolerant routing in many parallel systems.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2018.8457367","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
It is impossible to operate a parallel system that contains a large number of elements ignoring the existence of faulty elements. In fault-tolerant routing in a parallel system, the method to make each non-faulty vertex store the compressed fault information (limited global information) is effective. The torus is the most popular topology adopted for interconnection networks of parallel systems. However, because of its complexity, any effective method that utilizes the limited global information has not been proposed. In this paper, we propose a stochastic fault-tolerant routing method in tori, which can tolerate the link and/or vertex faults. We can expect that our result achieves drastic improvement regarding fault-tolerant routing in many parallel systems.