Ojus Thomas Lee, G. Akash, S. D. M. Kumar, P. Chandran
{"title":"A Method for Storage Node Allocation in Erasure Code Based Storage Systems","authors":"Ojus Thomas Lee, G. Akash, S. D. M. Kumar, P. Chandran","doi":"10.1109/CIC.2017.00064","DOIUrl":null,"url":null,"abstract":"Fault tolerance is a major issue for all storage service providers. Currently, the storage service providers make use of data replication as a method to ensure fault tolerance. In the big data era, relying on data replication for fault tolerance reduces the storage efficiency. Most of the modern applications make use of erasure code based storage systems as an alternative to the data replication. In erasure code based storage systems, the allocation of storage nodes for storing data is to be done with care so that the load on the nodes of the storage system is always balanced. In this paper, we propose a greedy solution for the storage node allocation problem in big data environment with load balancing. Other major contributions discussed in the paper are modeling this problem with graph theory and suggesting an integer linear program formulation for the problem.","PeriodicalId":156843,"journal":{"name":"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)","volume":"228 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIC.2017.00064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Fault tolerance is a major issue for all storage service providers. Currently, the storage service providers make use of data replication as a method to ensure fault tolerance. In the big data era, relying on data replication for fault tolerance reduces the storage efficiency. Most of the modern applications make use of erasure code based storage systems as an alternative to the data replication. In erasure code based storage systems, the allocation of storage nodes for storing data is to be done with care so that the load on the nodes of the storage system is always balanced. In this paper, we propose a greedy solution for the storage node allocation problem in big data environment with load balancing. Other major contributions discussed in the paper are modeling this problem with graph theory and suggesting an integer linear program formulation for the problem.