{"title":"Accelerating Data Ingress for Range-Scan Optimized HBase Instances","authors":"Adrian-Ioan Argesanu, G. Andreescu","doi":"10.1109/SACI51354.2021.9465633","DOIUrl":null,"url":null,"abstract":"HBase was designed to get high performance for read-intensive workloads, making it very attractive for use cases involving retrieval of contiguous sets of rows. HBase instances set up for fast reads often force users to compromise on write performance, which in turn can lead to temporary degradation of read capabilities. In this paper we study existing write penalty mitigation options and introduce our software solution, which not only accelerates bulk data ingress but also retains egress performance without any post-processing or alterations of the soft-schema. Our experiments show that the proposed software solution yields a 47 % reduction of ingestion duration.","PeriodicalId":321907,"journal":{"name":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI51354.2021.9465633","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
HBase was designed to get high performance for read-intensive workloads, making it very attractive for use cases involving retrieval of contiguous sets of rows. HBase instances set up for fast reads often force users to compromise on write performance, which in turn can lead to temporary degradation of read capabilities. In this paper we study existing write penalty mitigation options and introduce our software solution, which not only accelerates bulk data ingress but also retains egress performance without any post-processing or alterations of the soft-schema. Our experiments show that the proposed software solution yields a 47 % reduction of ingestion duration.