Triple integration optimization techniques in data grid environment using OptorSim simulator

2017 International Conference on Data Management, Analytics and Innovation (ICDMAI) Pub Date : 2017-02-01 DOI:10.1109/ICDMAI.2017.8073499

D. Manjaiah, Abdo H. Guroob

{"title":"Triple integration optimization techniques in data grid environment using OptorSim simulator","authors":"D. Manjaiah, Abdo H. Guroob","doi":"10.1109/ICDMAI.2017.8073499","DOIUrl":null,"url":null,"abstract":"Data Grid Environments consist of geographically distributed resources to solve scientific problems and tasks of researchers, scientists and engineers, which are difficult to accomplish by traditional methods based on computer networks. Scheduling and replication are considered some of the most important techniques used in data grid environments, which are used to improve performance and availability to get the best throughput in the shortest possible time. Thus, some algorithms are used for these purposes. Effective scheduling working to reduce the time of implementation of tasks (makespan) of the available resources in the data grid, while replication is working to provide appropriate places or replace similar data to accelerate job execution time. On the other hand, there is another technique, which is important as scheduling and replication, which can be used to reduce the time of implementation for a user request. This technique called Access Pattern, defines the order in which the files are requested for each job to accelerate the completion of the task. Most researchers are focusing on the scheduling, replication, or Access Pattern separately, which leads to variation in the results and gives them unsatisfactory results. The contribution of this paper is present the impact and effect of the triple integration of the three techniques to completing tasks in data grid environments by comparing the results of different algorithms available in the OptorSim simulator.","PeriodicalId":368507,"journal":{"name":"2017 International Conference on Data Management, Analytics and Innovation (ICDMAI)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Data Management, Analytics and Innovation (ICDMAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMAI.2017.8073499","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Data Grid Environments consist of geographically distributed resources to solve scientific problems and tasks of researchers, scientists and engineers, which are difficult to accomplish by traditional methods based on computer networks. Scheduling and replication are considered some of the most important techniques used in data grid environments, which are used to improve performance and availability to get the best throughput in the shortest possible time. Thus, some algorithms are used for these purposes. Effective scheduling working to reduce the time of implementation of tasks (makespan) of the available resources in the data grid, while replication is working to provide appropriate places or replace similar data to accelerate job execution time. On the other hand, there is another technique, which is important as scheduling and replication, which can be used to reduce the time of implementation for a user request. This technique called Access Pattern, defines the order in which the files are requested for each job to accelerate the completion of the task. Most researchers are focusing on the scheduling, replication, or Access Pattern separately, which leads to variation in the results and gives them unsatisfactory results. The contribution of this paper is present the impact and effect of the triple integration of the three techniques to completing tasks in data grid environments by comparing the results of different algorithms available in the OptorSim simulator.

查看原文本刊更多论文

基于OptorSim模拟器的数据网格环境下的三重积分优化技术

数据网格环境是由地理上分布的资源组成的，用于解决基于计算机网络的传统方法难以完成的研究人员、科学家和工程师的科学问题和任务。调度和复制被认为是数据网格环境中使用的一些最重要的技术，它们用于提高性能和可用性，以便在尽可能短的时间内获得最佳吞吐量。因此，一些算法被用于这些目的。有效的调度工作减少数据网格中可用资源的任务实现时间(makespan)，而复制工作提供适当的位置或替换类似的数据以加快作业执行时间。另一方面，还有另一种技术，它与调度和复制一样重要，可用于减少用户请求的实现时间。这种称为访问模式的技术定义了为每个作业请求文件的顺序，以加速任务的完成。大多数研究人员分别关注调度、复制或访问模式，这导致了结果的变化，并给出了令人不满意的结果。本文的贡献是通过比较OptorSim模拟器中可用的不同算法的结果，介绍三种技术的三重集成对数据网格环境下完成任务的影响和效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Data Management, Analytics and Innovation (ICDMAI)

自引率

0.00%

发文量