基于可扩展并行遗传算法库的大规模土地利用优化

Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery Pub Date : 2013-07-22 DOI:10.1145/2484762.2484824

Yan Y. Liu, M. Guo, Shaowen Wang

{"title":"基于可扩展并行遗传算法库的大规模土地利用优化","authors":"Yan Y. Liu, M. Guo, Shaowen Wang","doi":"10.1145/2484762.2484824","DOIUrl":null,"url":null,"abstract":"Optimization algorithms are often employed in spatial analysis and modeling to provide adaptive mechanisms at both individual and collective levels to enable decision-makers for the search of optimal solutions with respect to single/multiple objectives and constraints imposed by spatial configurations. This research aims to solve large-scale agricultural land use optimization problems by exploiting massive parallel computing resources provided by supercomputers such as those in XSEDE. The optimization of agricultural land use patterns finds an optimal assignment of crops (e.g., food and biofuel crops) on land parcels of a specified study area that maximizes the total yield and satisfies various competing constraints. These constraints often consider spatial factors such as contiguity and ownership, climate and land management factors (e.g., soil, precipitation, light, temperature, and ozone) and their effects on the productivity, suitability, and cost of assigning a crop on a land parcel. We have formulated the land use optimization problem as a classic combinatorial optimization problem - Generalized Assignment Problem (GAP) [2]. GAP is a well-known NP-hard problem [3]. When a landscape includes tens of thousands of land parcels (e.g., Figure 1), finding an exact optimal solution is computationally intractable. In our research, we develop a parallel heuristic algorithm by combining an attention to the idiosyncrasies of agricultural land use optimization problem with a scalable parallel genetic algorithm (PGA) [4] to produce near-optimal solutions through scalable and efficient PGA computation on a large number of processors. Our PGA parallelizes the GA computation by running a large number of PGA processes simultaneously, each process conducting independent GA computation with a migration strategy that exchanges solutions between any two directly connected PGA processes at regular intervals. On each PGA process, a set of solutions form a local population. Standard GA operators such as population initialization, selection, crossover, mutation, and replacement are tailored to facilitate the search for better land use patterns based on aforementioned spatial and social economic factors. The parallelism in PGA is straightforward and easily permits a large number of PGA processes to evolve independently by following different randomized search paths and exploring the solution space collectively through migration strategies [1]. Nonetheless, a significant challenge remains regarding how to devise PGAs that are able to scale to massively parallel computer architectures. Issues persist because 1) a common PGA design adopts synchronized migration, which becomes increasingly costly as a larger number of processors are involved in global synchronization in each iteration; and 2) asynchronous PGA design and associated performance evaluation are intricate since the stochastic nature of PGA results in computations that are not simply dependent on the problem size. We addressed this challenge by developing an asynchronous PGA library that implements a scalable asynchronous migration strategy [4]. A suite of non-blocking migration operators (i.e., export, import, and inject) and buffer-based communications are developed to not only remove the costly global synchronization on migration operations, but also to allow for the overlapping of GA computation and migration communication. Buffer overflow issues caused by inter-process communications are resolved through algorithmic analysis. As a result, the relationship between the configuration of asynchronous PGA parameters (i.e., migration intervals, migration rate, and topology attributes) and buffer sizes is established based on the underlying message passing communication library and supercomputer interconnect characteristics to avoid buffer overflow issues at both system and application levels. The scalability of our PGA library was evaluated by conducting strong and weak scaling experiments using up to 16,384 processor cores of the Ranger supercomputer at the Texas Advanced Computing Center. The design of these two experiments was tailored to evaluate the performance of asynchronous implementation of PGAs. Results indicated that our PGA library exhibited desirable speedups in the strong scaling experiment and impressive scalability to problem workload in the weak scaling experiment. Super-linear speedups were observed consistently as the number processor cores increased. The comparison between the asynchronous migration strategy and the corresponding synchronous implementation (Figure 2) is achieved by measuring the ratio of speedup (calculated by dividing the execution time of synchronous runs over asynchronous runs) at multiple solution quality thresholds. In all of the scenarios in which both our PGA and the synchronous version reached the specified solution quality thresholds, our PGA exhibited superior speedups. When using 16,384 processor cores, the speedup improvement was consistent across all of the solution quality thresholds. On average, the communication cost of our PGA was 15.5%, significantly lower than the synchronous version (54%). In the weak scaling experiment, the execution times of our PGA on 16,384 processors were consistently 60% less than the synchronous version at all of the solution quality thresholds as the global population size increased from 204,800 to 3,287,400 (Figure 3). The scalability enabled by the asynchronous migration strategy, in turn, greatly enhanced the problem-solving capabilities of the library to exploit massive computing power for solving large land use optimization instances. Several enhancements to our PGA library are developed for the land use optimization problem-solving. Specific GA encoding mechanism and operators for efficient land use pattern search and fitness evaluation based on formulated spatial and social economic constraints are developed to improve the numerical performance of our PGA. Our PGA library is extended to adapt to supercomputers of hybrid architecture (e.g., Stampede cluster with mixed CPU, Intel Many Integrated Core (MIC), and GPU architecture). Specifically, the asynchronous migration strategy is enhanced and a runtime PGA parameter tuning function is developed for the library to be adaptive to dramatically increased heterogeneity among PGA processes on such supercomputers in order to achieve desirable scalability and reliability. Land use optimization results on the study area of Champaign County, Illinois are presented.","PeriodicalId":426819,"journal":{"name":"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large-scale land use optimization by enhancing a scalable parallel genetic algorithm library\",\"authors\":\"Yan Y. Liu, M. Guo, Shaowen Wang\",\"doi\":\"10.1145/2484762.2484824\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optimization algorithms are often employed in spatial analysis and modeling to provide adaptive mechanisms at both individual and collective levels to enable decision-makers for the search of optimal solutions with respect to single/multiple objectives and constraints imposed by spatial configurations. This research aims to solve large-scale agricultural land use optimization problems by exploiting massive parallel computing resources provided by supercomputers such as those in XSEDE. The optimization of agricultural land use patterns finds an optimal assignment of crops (e.g., food and biofuel crops) on land parcels of a specified study area that maximizes the total yield and satisfies various competing constraints. These constraints often consider spatial factors such as contiguity and ownership, climate and land management factors (e.g., soil, precipitation, light, temperature, and ozone) and their effects on the productivity, suitability, and cost of assigning a crop on a land parcel. We have formulated the land use optimization problem as a classic combinatorial optimization problem - Generalized Assignment Problem (GAP) [2]. GAP is a well-known NP-hard problem [3]. When a landscape includes tens of thousands of land parcels (e.g., Figure 1), finding an exact optimal solution is computationally intractable. In our research, we develop a parallel heuristic algorithm by combining an attention to the idiosyncrasies of agricultural land use optimization problem with a scalable parallel genetic algorithm (PGA) [4] to produce near-optimal solutions through scalable and efficient PGA computation on a large number of processors. Our PGA parallelizes the GA computation by running a large number of PGA processes simultaneously, each process conducting independent GA computation with a migration strategy that exchanges solutions between any two directly connected PGA processes at regular intervals. On each PGA process, a set of solutions form a local population. Standard GA operators such as population initialization, selection, crossover, mutation, and replacement are tailored to facilitate the search for better land use patterns based on aforementioned spatial and social economic factors. The parallelism in PGA is straightforward and easily permits a large number of PGA processes to evolve independently by following different randomized search paths and exploring the solution space collectively through migration strategies [1]. Nonetheless, a significant challenge remains regarding how to devise PGAs that are able to scale to massively parallel computer architectures. Issues persist because 1) a common PGA design adopts synchronized migration, which becomes increasingly costly as a larger number of processors are involved in global synchronization in each iteration; and 2) asynchronous PGA design and associated performance evaluation are intricate since the stochastic nature of PGA results in computations that are not simply dependent on the problem size. We addressed this challenge by developing an asynchronous PGA library that implements a scalable asynchronous migration strategy [4]. A suite of non-blocking migration operators (i.e., export, import, and inject) and buffer-based communications are developed to not only remove the costly global synchronization on migration operations, but also to allow for the overlapping of GA computation and migration communication. Buffer overflow issues caused by inter-process communications are resolved through algorithmic analysis. As a result, the relationship between the configuration of asynchronous PGA parameters (i.e., migration intervals, migration rate, and topology attributes) and buffer sizes is established based on the underlying message passing communication library and supercomputer interconnect characteristics to avoid buffer overflow issues at both system and application levels. The scalability of our PGA library was evaluated by conducting strong and weak scaling experiments using up to 16,384 processor cores of the Ranger supercomputer at the Texas Advanced Computing Center. The design of these two experiments was tailored to evaluate the performance of asynchronous implementation of PGAs. Results indicated that our PGA library exhibited desirable speedups in the strong scaling experiment and impressive scalability to problem workload in the weak scaling experiment. Super-linear speedups were observed consistently as the number processor cores increased. The comparison between the asynchronous migration strategy and the corresponding synchronous implementation (Figure 2) is achieved by measuring the ratio of speedup (calculated by dividing the execution time of synchronous runs over asynchronous runs) at multiple solution quality thresholds. In all of the scenarios in which both our PGA and the synchronous version reached the specified solution quality thresholds, our PGA exhibited superior speedups. When using 16,384 processor cores, the speedup improvement was consistent across all of the solution quality thresholds. On average, the communication cost of our PGA was 15.5%, significantly lower than the synchronous version (54%). In the weak scaling experiment, the execution times of our PGA on 16,384 processors were consistently 60% less than the synchronous version at all of the solution quality thresholds as the global population size increased from 204,800 to 3,287,400 (Figure 3). The scalability enabled by the asynchronous migration strategy, in turn, greatly enhanced the problem-solving capabilities of the library to exploit massive computing power for solving large land use optimization instances. Several enhancements to our PGA library are developed for the land use optimization problem-solving. Specific GA encoding mechanism and operators for efficient land use pattern search and fitness evaluation based on formulated spatial and social economic constraints are developed to improve the numerical performance of our PGA. Our PGA library is extended to adapt to supercomputers of hybrid architecture (e.g., Stampede cluster with mixed CPU, Intel Many Integrated Core (MIC), and GPU architecture). Specifically, the asynchronous migration strategy is enhanced and a runtime PGA parameter tuning function is developed for the library to be adaptive to dramatically increased heterogeneity among PGA processes on such supercomputers in order to achieve desirable scalability and reliability. Land use optimization results on the study area of Champaign County, Illinois are presented.\",\"PeriodicalId\":426819,\"journal\":{\"name\":\"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2484762.2484824\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2484762.2484824","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

优化算法通常用于空间分析和建模，以提供个人和集体层面的自适应机制，使决策者能够针对空间配置施加的单/多目标和约束寻找最佳解决方案。本研究旨在利用XSEDE等超级计算机提供的海量并行计算资源，解决大规模农业用地优化问题。农业土地利用模式的优化是在特定研究区域的地块上寻找作物(如粮食和生物燃料作物)的最佳分配，使总产量最大化，并满足各种竞争约束。这些限制通常考虑空间因素，如邻近性和所有权、气候和土地管理因素(如土壤、降水、光照、温度和臭氧)及其对生产力、适宜性和在一块土地上分配作物成本的影响。我们将土地利用优化问题表述为一个经典的组合优化问题——广义分配问题(GAP)[2]。GAP是一个众所周知的NP-hard问题[3]。当一个景观包含数以万计的地块时(例如，图1)，找到一个精确的最优解决方案在计算上是棘手的。在我们的研究中，我们开发了一种并行启发式算法，将对农业土地利用优化问题的特殊性的关注与可扩展的并行遗传算法(PGA)[4]相结合，通过在大量处理器上进行可扩展和高效的PGA计算来产生接近最优的解决方案。我们的PGA通过同时运行大量PGA进程来并行化GA计算，每个进程进行独立的GA计算，并采用一种迁移策略，在任意两个直接连接的PGA进程之间定期交换解。在每个PGA过程中，一组解决方案形成一个局部种群。标准的遗传算子，如种群初始化、选择、交叉、突变和替换，是为了便于基于上述空间和社会经济因素寻找更好的土地利用模式而定制的。PGA中的并行性很简单，可以通过遵循不同的随机搜索路径，并通过迁移策略共同探索解空间，从而使大量PGA进程独立进化[1]。然而，如何设计能够扩展到大规模并行计算机体系结构的pga仍然是一个重大的挑战。问题之所以持续存在，是因为1)常见的PGA设计采用同步迁移，随着每次迭代中涉及的全局同步处理器数量的增加，同步迁移的成本越来越高;2)异步PGA设计和相关的性能评估是复杂的，因为PGA的随机性导致计算不仅仅依赖于问题的大小。我们通过开发实现可伸缩异步迁移策略的异步PGA库来解决这一挑战[4]。开发了一套非阻塞迁移操作符(即导出、导入和注入)和基于缓冲区的通信，不仅消除了迁移操作上昂贵的全局同步，而且还允许遗传算法计算和迁移通信的重叠。通过算法分析解决了进程间通信引起的缓冲区溢出问题。因此，基于底层消息传递通信库和超级计算机互连特性，建立了异步PGA参数配置(即迁移间隔、迁移速率和拓扑属性)与缓冲区大小之间的关系，避免了系统级和应用级缓冲区溢出问题。我们的PGA库的可扩展性是通过在德克萨斯州高级计算中心的Ranger超级计算机上使用多达16,384个处理器内核进行强缩放和弱缩放实验来评估的。这两个实验的设计是为了评估异步实现的PGAs的性能。结果表明，我们的PGA库在强缩放实验中表现出理想的速度，在弱缩放实验中表现出令人印象深刻的问题工作负载可扩展性。随着处理器核数的增加，可以观察到超线性的加速。异步迁移策略和相应的同步实现之间的比较(图2)是通过测量多个解决方案质量阈值下的加速比率(通过将同步运行的执行时间除以异步运行的执行时间计算)来实现的。在我们的PGA和同步版本都达到指定的解决方案质量阈值的所有场景中，我们的PGA表现出优越的加速。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Large-scale land use optimization by enhancing a scalable parallel genetic algorithm library

Optimization algorithms are often employed in spatial analysis and modeling to provide adaptive mechanisms at both individual and collective levels to enable decision-makers for the search of optimal solutions with respect to single/multiple objectives and constraints imposed by spatial configurations. This research aims to solve large-scale agricultural land use optimization problems by exploiting massive parallel computing resources provided by supercomputers such as those in XSEDE. The optimization of agricultural land use patterns finds an optimal assignment of crops (e.g., food and biofuel crops) on land parcels of a specified study area that maximizes the total yield and satisfies various competing constraints. These constraints often consider spatial factors such as contiguity and ownership, climate and land management factors (e.g., soil, precipitation, light, temperature, and ozone) and their effects on the productivity, suitability, and cost of assigning a crop on a land parcel. We have formulated the land use optimization problem as a classic combinatorial optimization problem - Generalized Assignment Problem (GAP) [2]. GAP is a well-known NP-hard problem [3]. When a landscape includes tens of thousands of land parcels (e.g., Figure 1), finding an exact optimal solution is computationally intractable. In our research, we develop a parallel heuristic algorithm by combining an attention to the idiosyncrasies of agricultural land use optimization problem with a scalable parallel genetic algorithm (PGA) [4] to produce near-optimal solutions through scalable and efficient PGA computation on a large number of processors. Our PGA parallelizes the GA computation by running a large number of PGA processes simultaneously, each process conducting independent GA computation with a migration strategy that exchanges solutions between any two directly connected PGA processes at regular intervals. On each PGA process, a set of solutions form a local population. Standard GA operators such as population initialization, selection, crossover, mutation, and replacement are tailored to facilitate the search for better land use patterns based on aforementioned spatial and social economic factors. The parallelism in PGA is straightforward and easily permits a large number of PGA processes to evolve independently by following different randomized search paths and exploring the solution space collectively through migration strategies [1]. Nonetheless, a significant challenge remains regarding how to devise PGAs that are able to scale to massively parallel computer architectures. Issues persist because 1) a common PGA design adopts synchronized migration, which becomes increasingly costly as a larger number of processors are involved in global synchronization in each iteration; and 2) asynchronous PGA design and associated performance evaluation are intricate since the stochastic nature of PGA results in computations that are not simply dependent on the problem size. We addressed this challenge by developing an asynchronous PGA library that implements a scalable asynchronous migration strategy [4]. A suite of non-blocking migration operators (i.e., export, import, and inject) and buffer-based communications are developed to not only remove the costly global synchronization on migration operations, but also to allow for the overlapping of GA computation and migration communication. Buffer overflow issues caused by inter-process communications are resolved through algorithmic analysis. As a result, the relationship between the configuration of asynchronous PGA parameters (i.e., migration intervals, migration rate, and topology attributes) and buffer sizes is established based on the underlying message passing communication library and supercomputer interconnect characteristics to avoid buffer overflow issues at both system and application levels. The scalability of our PGA library was evaluated by conducting strong and weak scaling experiments using up to 16,384 processor cores of the Ranger supercomputer at the Texas Advanced Computing Center. The design of these two experiments was tailored to evaluate the performance of asynchronous implementation of PGAs. Results indicated that our PGA library exhibited desirable speedups in the strong scaling experiment and impressive scalability to problem workload in the weak scaling experiment. Super-linear speedups were observed consistently as the number processor cores increased. The comparison between the asynchronous migration strategy and the corresponding synchronous implementation (Figure 2) is achieved by measuring the ratio of speedup (calculated by dividing the execution time of synchronous runs over asynchronous runs) at multiple solution quality thresholds. In all of the scenarios in which both our PGA and the synchronous version reached the specified solution quality thresholds, our PGA exhibited superior speedups. When using 16,384 processor cores, the speedup improvement was consistent across all of the solution quality thresholds. On average, the communication cost of our PGA was 15.5%, significantly lower than the synchronous version (54%). In the weak scaling experiment, the execution times of our PGA on 16,384 processors were consistently 60% less than the synchronous version at all of the solution quality thresholds as the global population size increased from 204,800 to 3,287,400 (Figure 3). The scalability enabled by the asynchronous migration strategy, in turn, greatly enhanced the problem-solving capabilities of the library to exploit massive computing power for solving large land use optimization instances. Several enhancements to our PGA library are developed for the land use optimization problem-solving. Specific GA encoding mechanism and operators for efficient land use pattern search and fitness evaluation based on formulated spatial and social economic constraints are developed to improve the numerical performance of our PGA. Our PGA library is extended to adapt to supercomputers of hybrid architecture (e.g., Stampede cluster with mixed CPU, Intel Many Integrated Core (MIC), and GPU architecture). Specifically, the asynchronous migration strategy is enhanced and a runtime PGA parameter tuning function is developed for the library to be adaptive to dramatically increased heterogeneity among PGA processes on such supercomputers in order to achieve desirable scalability and reliability. Land use optimization results on the study area of Champaign County, Illinois are presented.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery

自引率

0.00%

发文量