Siqi Liu, Varun Patil, Tianyuan Yu, A. Afanasyev, F. Feltus, Susmit Shannigrahi, Lixia Zhang
{"title":"集中控制与分散控制设计Hydra的比较研究","authors":"Siqi Liu, Varun Patil, Tianyuan Yu, A. Afanasyev, F. Feltus, Susmit Shannigrahi, Lixia Zhang","doi":"10.1145/3488663.3493690","DOIUrl":null,"url":null,"abstract":"Today's networked and distributed applications, by and large, rely on cloud services. However, solely cloud-based services are not the ideal solution for all use cases, in particular, the case of high volume data sharing in scientific computing whose cloud usage costs could be prohibitively high. Thus we take on a task of building a distributed, federated data repository, dubbed Hydra, for sharing large volume scientific data. In this paper, we compare two design choices: designing Hydra over TCP/IP with a centralized controller, and designing Hydra over Named Data Network (NDN) to enable distributed control. Our study shows that (i) building Hydra over TCP/IP with a central controller offers a simple, straightforward design; (ii) however, the controller necessarily needs to be replicated for scalability and reliability, and cloud CDN is needed to scale data delivery, both bringing additional complexity into the overall design; and (iii) building Hydra over NDN automatically offers scalable and efficient data dissemination at volume, as well as enables distributed control with high resiliency.","PeriodicalId":359653,"journal":{"name":"Proceedings of the Interdisciplinary Workshop on (de) Centralization in the Internet","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Designing Hydra with Centralized versus Decentralized Control: A Comparative Study\",\"authors\":\"Siqi Liu, Varun Patil, Tianyuan Yu, A. Afanasyev, F. Feltus, Susmit Shannigrahi, Lixia Zhang\",\"doi\":\"10.1145/3488663.3493690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Today's networked and distributed applications, by and large, rely on cloud services. However, solely cloud-based services are not the ideal solution for all use cases, in particular, the case of high volume data sharing in scientific computing whose cloud usage costs could be prohibitively high. Thus we take on a task of building a distributed, federated data repository, dubbed Hydra, for sharing large volume scientific data. In this paper, we compare two design choices: designing Hydra over TCP/IP with a centralized controller, and designing Hydra over Named Data Network (NDN) to enable distributed control. Our study shows that (i) building Hydra over TCP/IP with a central controller offers a simple, straightforward design; (ii) however, the controller necessarily needs to be replicated for scalability and reliability, and cloud CDN is needed to scale data delivery, both bringing additional complexity into the overall design; and (iii) building Hydra over NDN automatically offers scalable and efficient data dissemination at volume, as well as enables distributed control with high resiliency.\",\"PeriodicalId\":359653,\"journal\":{\"name\":\"Proceedings of the Interdisciplinary Workshop on (de) Centralization in the Internet\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Interdisciplinary Workshop on (de) Centralization in the Internet\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3488663.3493690\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Interdisciplinary Workshop on (de) Centralization in the Internet","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3488663.3493690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Designing Hydra with Centralized versus Decentralized Control: A Comparative Study
Today's networked and distributed applications, by and large, rely on cloud services. However, solely cloud-based services are not the ideal solution for all use cases, in particular, the case of high volume data sharing in scientific computing whose cloud usage costs could be prohibitively high. Thus we take on a task of building a distributed, federated data repository, dubbed Hydra, for sharing large volume scientific data. In this paper, we compare two design choices: designing Hydra over TCP/IP with a centralized controller, and designing Hydra over Named Data Network (NDN) to enable distributed control. Our study shows that (i) building Hydra over TCP/IP with a central controller offers a simple, straightforward design; (ii) however, the controller necessarily needs to be replicated for scalability and reliability, and cloud CDN is needed to scale data delivery, both bringing additional complexity into the overall design; and (iii) building Hydra over NDN automatically offers scalable and efficient data dissemination at volume, as well as enables distributed control with high resiliency.