Ali Elghirani, Riky Subrata, Albert Y. Zomaya, A. Mazari
{"title":"数据网格中混合复制和遗传算法协同调度的性能增强","authors":"Ali Elghirani, Riky Subrata, Albert Y. Zomaya, A. Mazari","doi":"10.1109/AICCSA.2008.4493570","DOIUrl":null,"url":null,"abstract":"In data grid environments data-intensive applications require large amounts of data to execute. Data transfer is a primary cause of job execution delay. In this paper we study smart scheduling integrated with replica management optimization to improve system performance. We study the use of genetic algorithm (GA) for the scheduling phase of data-intensive applications. The schedulers proposed incorporate information about the datasets and their replicas needed by the jobs to be scheduled, and co-schedules the jobs and the datasets to the computation node guaranteeing minimum job execution time. We employ a data grid replica management framework for the optimization phase of the replica distribution. In this approach we try to achieve a double optimization effect from both the replica management and the scheduling phases, while integrating scheduling and data replication to improve the performance of the grid system. We evaluate and compare our genetic algorithm (GA) with a Tabu search (TS) and the de facto max-min based schedulers.","PeriodicalId":234556,"journal":{"name":"2008 IEEE/ACS International Conference on Computer Systems and Applications","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Performance enhancement through hybrid replication and Genetic Algorithm co-scheduling in data grids\",\"authors\":\"Ali Elghirani, Riky Subrata, Albert Y. Zomaya, A. Mazari\",\"doi\":\"10.1109/AICCSA.2008.4493570\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In data grid environments data-intensive applications require large amounts of data to execute. Data transfer is a primary cause of job execution delay. In this paper we study smart scheduling integrated with replica management optimization to improve system performance. We study the use of genetic algorithm (GA) for the scheduling phase of data-intensive applications. The schedulers proposed incorporate information about the datasets and their replicas needed by the jobs to be scheduled, and co-schedules the jobs and the datasets to the computation node guaranteeing minimum job execution time. We employ a data grid replica management framework for the optimization phase of the replica distribution. In this approach we try to achieve a double optimization effect from both the replica management and the scheduling phases, while integrating scheduling and data replication to improve the performance of the grid system. We evaluate and compare our genetic algorithm (GA) with a Tabu search (TS) and the de facto max-min based schedulers.\",\"PeriodicalId\":234556,\"journal\":{\"name\":\"2008 IEEE/ACS International Conference on Computer Systems and Applications\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE/ACS International Conference on Computer Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICCSA.2008.4493570\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE/ACS International Conference on Computer Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2008.4493570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance enhancement through hybrid replication and Genetic Algorithm co-scheduling in data grids
In data grid environments data-intensive applications require large amounts of data to execute. Data transfer is a primary cause of job execution delay. In this paper we study smart scheduling integrated with replica management optimization to improve system performance. We study the use of genetic algorithm (GA) for the scheduling phase of data-intensive applications. The schedulers proposed incorporate information about the datasets and their replicas needed by the jobs to be scheduled, and co-schedules the jobs and the datasets to the computation node guaranteeing minimum job execution time. We employ a data grid replica management framework for the optimization phase of the replica distribution. In this approach we try to achieve a double optimization effect from both the replica management and the scheduling phases, while integrating scheduling and data replication to improve the performance of the grid system. We evaluate and compare our genetic algorithm (GA) with a Tabu search (TS) and the de facto max-min based schedulers.