Jian Lin, Lin Huang, Tao Zhou, Dongming Xie, Bo Yu
{"title":"按需云原生容器存储设计及其在kubernetes上的hdfs实践","authors":"Jian Lin, Lin Huang, Tao Zhou, Dongming Xie, Bo Yu","doi":"10.1145/3589845.3589846","DOIUrl":null,"url":null,"abstract":"Cloud-native big data services become popular in recent years. Two pillars of these services are identified: the separation architecture of compute and storage, and the application-specific controller mechanism. In terms of storage for big data on the cloud, current practices focus on managing a single on-premise storage cluster or building independent PaaS storage services. This paper focuses on the cloud-native containerized storage. An on-demand provisioning design is proposed, which extends the mainstream storage architecture and supports the provisioning of storage clusters for multi-tenancy in a dynamic manner. Its instance of HDFS-on-Kubernetes is implemented. With the mechanisms of global endpoint provisioning and dynamic volume provisioning, this provisioner enables the creation and management of multiple on-demand storage clusters with full-stack resources in an automated way. It guarantees the native performance of host network and local storage, which has been validated through experiments and production applications. It is also easy to use because of its high-level abstraction and single-point configuration mechanism. The design as well as the provisioner has served real business in industrial scenarios.","PeriodicalId":302027,"journal":{"name":"Proceedings of the 2023 9th International Conference on Computing and Data Engineering","volume":"180 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An On-Demand Cloud-Native Containerized Storage Design and its Practice of HDFS-on-Kubernetes\",\"authors\":\"Jian Lin, Lin Huang, Tao Zhou, Dongming Xie, Bo Yu\",\"doi\":\"10.1145/3589845.3589846\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cloud-native big data services become popular in recent years. Two pillars of these services are identified: the separation architecture of compute and storage, and the application-specific controller mechanism. In terms of storage for big data on the cloud, current practices focus on managing a single on-premise storage cluster or building independent PaaS storage services. This paper focuses on the cloud-native containerized storage. An on-demand provisioning design is proposed, which extends the mainstream storage architecture and supports the provisioning of storage clusters for multi-tenancy in a dynamic manner. Its instance of HDFS-on-Kubernetes is implemented. With the mechanisms of global endpoint provisioning and dynamic volume provisioning, this provisioner enables the creation and management of multiple on-demand storage clusters with full-stack resources in an automated way. It guarantees the native performance of host network and local storage, which has been validated through experiments and production applications. It is also easy to use because of its high-level abstraction and single-point configuration mechanism. The design as well as the provisioner has served real business in industrial scenarios.\",\"PeriodicalId\":302027,\"journal\":{\"name\":\"Proceedings of the 2023 9th International Conference on Computing and Data Engineering\",\"volume\":\"180 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 9th International Conference on Computing and Data Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3589845.3589846\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 9th International Conference on Computing and Data Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3589845.3589846","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An On-Demand Cloud-Native Containerized Storage Design and its Practice of HDFS-on-Kubernetes
Cloud-native big data services become popular in recent years. Two pillars of these services are identified: the separation architecture of compute and storage, and the application-specific controller mechanism. In terms of storage for big data on the cloud, current practices focus on managing a single on-premise storage cluster or building independent PaaS storage services. This paper focuses on the cloud-native containerized storage. An on-demand provisioning design is proposed, which extends the mainstream storage architecture and supports the provisioning of storage clusters for multi-tenancy in a dynamic manner. Its instance of HDFS-on-Kubernetes is implemented. With the mechanisms of global endpoint provisioning and dynamic volume provisioning, this provisioner enables the creation and management of multiple on-demand storage clusters with full-stack resources in an automated way. It guarantees the native performance of host network and local storage, which has been validated through experiments and production applications. It is also easy to use because of its high-level abstraction and single-point configuration mechanism. The design as well as the provisioner has served real business in industrial scenarios.