生物样本数据库随机查询优化器的设计与分析

Manik Sharma, Gurvinder Singh, Rajinder Singh, Jasbir Singh
{"title":"生物样本数据库随机查询优化器的设计与分析","authors":"Manik Sharma, Gurvinder Singh, Rajinder Singh, Jasbir Singh","doi":"10.1109/ICCSA.2015.17","DOIUrl":null,"url":null,"abstract":"The enduring revolution in the field of life sciences is producing biological data at phenomenal rate. The large volume of biological data is meaningful only when it is accessed and analyzed by different researchers. A flood of biological databases is creating a problem in finding an efficient query execution plan for the complex query as posed by bio-researchers. In this research paper, an effort has been made to effectively process the biological data collected in biobanks. The major objective of this research work is to find an optimal query execution plan for biobank database queries using the proposed Restricted Exhaustive Enumeration Approach (REA) and Entropy based Restricted Genetic Approach (ERGA). The result of different query optimization approaches viz. Exhaustive Enumeration, Restricted Exhaustive Enumeration, Simple Genetic Approach and Entropy Based Restricted Genetic Approach are compared with each other on the basis of usage of system resources. The results of Entropy Based Restricted Genetic Algorithm in finding query execution plan for Burbank queries are better than EA and SGAby 2-20% and 7-15% respectively. Furthermore, experimental results reveal that use of Inter-site parallel environment further optimized the results of ERGA by 0-4%.","PeriodicalId":197153,"journal":{"name":"2015 15th International Conference on Computational Science and Its Applications","volume":"29 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Design and Analysis of Stochastic Query Optimizer for Biobank Databases\",\"authors\":\"Manik Sharma, Gurvinder Singh, Rajinder Singh, Jasbir Singh\",\"doi\":\"10.1109/ICCSA.2015.17\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The enduring revolution in the field of life sciences is producing biological data at phenomenal rate. The large volume of biological data is meaningful only when it is accessed and analyzed by different researchers. A flood of biological databases is creating a problem in finding an efficient query execution plan for the complex query as posed by bio-researchers. In this research paper, an effort has been made to effectively process the biological data collected in biobanks. The major objective of this research work is to find an optimal query execution plan for biobank database queries using the proposed Restricted Exhaustive Enumeration Approach (REA) and Entropy based Restricted Genetic Approach (ERGA). The result of different query optimization approaches viz. Exhaustive Enumeration, Restricted Exhaustive Enumeration, Simple Genetic Approach and Entropy Based Restricted Genetic Approach are compared with each other on the basis of usage of system resources. The results of Entropy Based Restricted Genetic Algorithm in finding query execution plan for Burbank queries are better than EA and SGAby 2-20% and 7-15% respectively. Furthermore, experimental results reveal that use of Inter-site parallel environment further optimized the results of ERGA by 0-4%.\",\"PeriodicalId\":197153,\"journal\":{\"name\":\"2015 15th International Conference on Computational Science and Its Applications\",\"volume\":\"29 5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 15th International Conference on Computational Science and Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSA.2015.17\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 15th International Conference on Computational Science and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSA.2015.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

生命科学领域的持久革命正在以惊人的速度产生生物数据。大量的生物数据只有在被不同的研究人员访问和分析时才有意义。生物数据库的泛滥给生物研究人员提出的复杂查询寻找有效的查询执行计划带来了难题。在本研究中,对生物库中收集的生物数据进行了有效的处理。本研究的主要目的是利用所提出的受限穷举枚举法(REA)和基于熵的受限遗传法(ERGA)为生物样本库数据库查询找到最优的查询执行计划。基于系统资源的使用情况,比较了穷尽枚举、受限穷尽枚举、简单遗传和基于熵的受限遗传四种查询优化方法的优化结果。基于熵的受限遗传算法在查找Burbank查询执行计划方面分别优于EA和SGAby 2-20%和7-15%。此外,实验结果表明,使用站点间并行环境可使ERGA结果优化0-4%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Design and Analysis of Stochastic Query Optimizer for Biobank Databases
The enduring revolution in the field of life sciences is producing biological data at phenomenal rate. The large volume of biological data is meaningful only when it is accessed and analyzed by different researchers. A flood of biological databases is creating a problem in finding an efficient query execution plan for the complex query as posed by bio-researchers. In this research paper, an effort has been made to effectively process the biological data collected in biobanks. The major objective of this research work is to find an optimal query execution plan for biobank database queries using the proposed Restricted Exhaustive Enumeration Approach (REA) and Entropy based Restricted Genetic Approach (ERGA). The result of different query optimization approaches viz. Exhaustive Enumeration, Restricted Exhaustive Enumeration, Simple Genetic Approach and Entropy Based Restricted Genetic Approach are compared with each other on the basis of usage of system resources. The results of Entropy Based Restricted Genetic Algorithm in finding query execution plan for Burbank queries are better than EA and SGAby 2-20% and 7-15% respectively. Furthermore, experimental results reveal that use of Inter-site parallel environment further optimized the results of ERGA by 0-4%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信