一种Elasticsearch索引分片数优化方法

Bizhong Wei, Jian Dai, Liqiang Deng, Haiyan Huang
{"title":"一种Elasticsearch索引分片数优化方法","authors":"Bizhong Wei, Jian Dai, Liqiang Deng, Haiyan Huang","doi":"10.1109/CIS52066.2020.00048","DOIUrl":null,"url":null,"abstract":"Elasticsearch, as an open source distributed data search and analysis engine, has been widely used in recent years due to its characteristics. But in a wide range of utilization and deployment, it is not suitable for all scenarios and requirements. Therefore, this paper proposes a method to optimize the number of Elasticsearch index shard based on Elasticsearch full-text retrieval technology and data features in practical application. This method can comprehensively analyze and calculate Elasticsearch remaining storage space and index shard size of each node in distributed cluster to determine the optimal number of index shard in the system, which can improve the efficiency of data retrieval. Experimental results show that, compare with traditional methods, the proposed method can improve the system performance in data distribution, data writing efficiency and data query delay.","PeriodicalId":106959,"journal":{"name":"2020 16th International Conference on Computational Intelligence and Security (CIS)","volume":"259 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Optimization Method for Elasticsearch Index Shard Number\",\"authors\":\"Bizhong Wei, Jian Dai, Liqiang Deng, Haiyan Huang\",\"doi\":\"10.1109/CIS52066.2020.00048\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Elasticsearch, as an open source distributed data search and analysis engine, has been widely used in recent years due to its characteristics. But in a wide range of utilization and deployment, it is not suitable for all scenarios and requirements. Therefore, this paper proposes a method to optimize the number of Elasticsearch index shard based on Elasticsearch full-text retrieval technology and data features in practical application. This method can comprehensively analyze and calculate Elasticsearch remaining storage space and index shard size of each node in distributed cluster to determine the optimal number of index shard in the system, which can improve the efficiency of data retrieval. Experimental results show that, compare with traditional methods, the proposed method can improve the system performance in data distribution, data writing efficiency and data query delay.\",\"PeriodicalId\":106959,\"journal\":{\"name\":\"2020 16th International Conference on Computational Intelligence and Security (CIS)\",\"volume\":\"259 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 16th International Conference on Computational Intelligence and Security (CIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIS52066.2020.00048\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 16th International Conference on Computational Intelligence and Security (CIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIS52066.2020.00048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

Elasticsearch作为一个开源的分布式数据搜索和分析引擎,由于其自身的特点,近年来得到了广泛的应用。但在广泛的利用和部署中,它并不适合所有的场景和需求。因此,本文在实际应用中提出了一种基于Elasticsearch全文检索技术和数据特征的Elasticsearch索引分片数量优化方法。该方法可以综合分析和计算分布式集群中每个节点的Elasticsearch剩余存储空间和索引分片大小,从而确定系统中最优的索引分片数量,提高数据检索效率。实验结果表明,与传统方法相比,该方法在数据分发、数据写入效率和数据查询延迟等方面都能提高系统性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Optimization Method for Elasticsearch Index Shard Number
Elasticsearch, as an open source distributed data search and analysis engine, has been widely used in recent years due to its characteristics. But in a wide range of utilization and deployment, it is not suitable for all scenarios and requirements. Therefore, this paper proposes a method to optimize the number of Elasticsearch index shard based on Elasticsearch full-text retrieval technology and data features in practical application. This method can comprehensively analyze and calculate Elasticsearch remaining storage space and index shard size of each node in distributed cluster to determine the optimal number of index shard in the system, which can improve the efficiency of data retrieval. Experimental results show that, compare with traditional methods, the proposed method can improve the system performance in data distribution, data writing efficiency and data query delay.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信