{"title":"云数据中心数据副本存储的跨区域节点选择算法","authors":"Yang Lu, Changlin Xu, Yan Zheng","doi":"10.1109/ICCEIC51584.2020.00044","DOIUrl":null,"url":null,"abstract":"With the arrival of cloud computing and big data, enterprise application systems are supported by massive data storing in cloud data centers (CDCs), in which the popular open-source NoSQL databases (Cassandra, HBase, MongoDB) are utilized to cope with high concurrency, high availability, and high scalability. There are still two challenging issues: (1) When the consistency level and replica number is specified, selecting the appropriate nodes for data replicas to reduce the communication latency is necessary. (2) A tradeoff between reliability and synchronization time should be discussed in CDCs. In this paper, a cross-area node selection algorithm (CANSA) is proposed to minimize the communication latency, then a new method to adjust reliability in the data center is designed. Numerical results in the Cassandra cluster of CDCs demonstrate that the CANSA provides excellent consensus ratio, synchronization, and latency.","PeriodicalId":135840,"journal":{"name":"2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Cross-area Node Selection Algorithm for Data Replica Storing in Cloud Data Centers\",\"authors\":\"Yang Lu, Changlin Xu, Yan Zheng\",\"doi\":\"10.1109/ICCEIC51584.2020.00044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the arrival of cloud computing and big data, enterprise application systems are supported by massive data storing in cloud data centers (CDCs), in which the popular open-source NoSQL databases (Cassandra, HBase, MongoDB) are utilized to cope with high concurrency, high availability, and high scalability. There are still two challenging issues: (1) When the consistency level and replica number is specified, selecting the appropriate nodes for data replicas to reduce the communication latency is necessary. (2) A tradeoff between reliability and synchronization time should be discussed in CDCs. In this paper, a cross-area node selection algorithm (CANSA) is proposed to minimize the communication latency, then a new method to adjust reliability in the data center is designed. Numerical results in the Cassandra cluster of CDCs demonstrate that the CANSA provides excellent consensus ratio, synchronization, and latency.\",\"PeriodicalId\":135840,\"journal\":{\"name\":\"2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCEIC51584.2020.00044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEIC51584.2020.00044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cross-area Node Selection Algorithm for Data Replica Storing in Cloud Data Centers
With the arrival of cloud computing and big data, enterprise application systems are supported by massive data storing in cloud data centers (CDCs), in which the popular open-source NoSQL databases (Cassandra, HBase, MongoDB) are utilized to cope with high concurrency, high availability, and high scalability. There are still two challenging issues: (1) When the consistency level and replica number is specified, selecting the appropriate nodes for data replicas to reduce the communication latency is necessary. (2) A tradeoff between reliability and synchronization time should be discussed in CDCs. In this paper, a cross-area node selection algorithm (CANSA) is proposed to minimize the communication latency, then a new method to adjust reliability in the data center is designed. Numerical results in the Cassandra cluster of CDCs demonstrate that the CANSA provides excellent consensus ratio, synchronization, and latency.