Jae-Moo Heo, Hyun Yang, Youngje Park, Hee-Jeong Han
{"title":"并行处理技术在卫星海洋颜色数据处理中的应用","authors":"Jae-Moo Heo, Hyun Yang, Youngje Park, Hee-Jeong Han","doi":"10.54007/ijmaf.2020.12.2.1","DOIUrl":null,"url":null,"abstract":"Recent advances demand that remote-sensing satellites efficiently process massive amounts of ocean color data. This paper compares the open multi-processing (OpenMP), the open computing language (OpenCL), the Message Passing Interface (MPI), the hybrid MPI/OpenMP, and the hybrid MPI/OpenCL in the parallel implementation of ocean color processing algorithms using data from the Geostationary Ocean Color Imager (GOCI), which is the first ocean color remote sensor operated in geostationary orbit. Since 2010, GOCI has observed ocean color around the Korean Peninsula and has generated hundreds of terabytes of big data. When any of the data-processing algorithms are updated, all preexisting data is required to be reprocessed, which can take hundreds of days because GOCI data are currently processed sequentially. Therefore, we attempted to develop an efficient parallel processing methodology for GOCI data. We tested well-known GOCI dataprocessing algorithms, like the chlorophyll (CHL) and total suspended solid (TSS) concentration estimation algorithms, using a cluster system. This cluster uses the Red Hat Linux operating system with two Intel Xeon 8-core processors (CPU), an AMD Radeon HD 7970 (GPU), and InfiniBand 4x QDR (network). As a result of this study we were able to improve the GOCI ocean color algorithms' processing speeds for OpenMP, OpenCL, MPI, hybrid MPI/OpenMP, and hybrid MPI/OpenCL by 3.92, 2.56, 2.51 3.27, and 2.05 times, respectively, than that of when we run the data sequentially. Moreover, we confirmed that the OpenMP programming model is the most useful for real-time processing GOCI data, which involves large amounts of input data and relatively simple formulas. Also, the vast number of computational nodes helps reduce the time taken to reprocess all data.","PeriodicalId":278094,"journal":{"name":"KMI International Journal of Maritime Affairs and Fisheries","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Parallel Processing Techniques to Satellite Ocean Color Data Processing\",\"authors\":\"Jae-Moo Heo, Hyun Yang, Youngje Park, Hee-Jeong Han\",\"doi\":\"10.54007/ijmaf.2020.12.2.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent advances demand that remote-sensing satellites efficiently process massive amounts of ocean color data. This paper compares the open multi-processing (OpenMP), the open computing language (OpenCL), the Message Passing Interface (MPI), the hybrid MPI/OpenMP, and the hybrid MPI/OpenCL in the parallel implementation of ocean color processing algorithms using data from the Geostationary Ocean Color Imager (GOCI), which is the first ocean color remote sensor operated in geostationary orbit. Since 2010, GOCI has observed ocean color around the Korean Peninsula and has generated hundreds of terabytes of big data. When any of the data-processing algorithms are updated, all preexisting data is required to be reprocessed, which can take hundreds of days because GOCI data are currently processed sequentially. Therefore, we attempted to develop an efficient parallel processing methodology for GOCI data. We tested well-known GOCI dataprocessing algorithms, like the chlorophyll (CHL) and total suspended solid (TSS) concentration estimation algorithms, using a cluster system. This cluster uses the Red Hat Linux operating system with two Intel Xeon 8-core processors (CPU), an AMD Radeon HD 7970 (GPU), and InfiniBand 4x QDR (network). As a result of this study we were able to improve the GOCI ocean color algorithms' processing speeds for OpenMP, OpenCL, MPI, hybrid MPI/OpenMP, and hybrid MPI/OpenCL by 3.92, 2.56, 2.51 3.27, and 2.05 times, respectively, than that of when we run the data sequentially. Moreover, we confirmed that the OpenMP programming model is the most useful for real-time processing GOCI data, which involves large amounts of input data and relatively simple formulas. Also, the vast number of computational nodes helps reduce the time taken to reprocess all data.\",\"PeriodicalId\":278094,\"journal\":{\"name\":\"KMI International Journal of Maritime Affairs and Fisheries\",\"volume\":\"63 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"KMI International Journal of Maritime Affairs and Fisheries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54007/ijmaf.2020.12.2.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"KMI International Journal of Maritime Affairs and Fisheries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54007/ijmaf.2020.12.2.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
最近的进展要求遥感卫星有效地处理大量的海洋颜色数据。利用首个在地球静止轨道运行的海洋颜色遥感器GOCI的数据,比较了开放多处理(OpenMP)、开放计算语言(OpenCL)、消息传递接口(MPI)、MPI/OpenMP混合模式和MPI/OpenCL混合模式在并行实现海洋颜色处理算法中的应用。GOCI从2010年开始观测韩半岛周围的海洋颜色,并产生了数百tb的大数据。当更新任何数据处理算法时,需要重新处理所有先前存在的数据,这可能需要数百天,因为GOCI数据目前是顺序处理的。因此,我们试图开发一种高效的GOCI数据并行处理方法。我们使用集群系统测试了众所周知的GOCI数据处理算法,如叶绿素(CHL)和总悬浮固体(TSS)浓度估计算法。本集群采用Red Hat Linux操作系统,CPU为Intel Xeon 8核,GPU为AMD Radeon HD 7970,网络为InfiniBand 4x QDR。通过本研究,我们能够将GOCI海洋颜色算法在OpenMP、OpenCL、MPI、混合MPI/OpenMP和混合MPI/OpenCL下的处理速度分别提高3.92倍、2.56倍、2.51倍、3.27倍和2.05倍。此外,我们证实了OpenMP编程模型对于实时处理GOCI数据最有用,这涉及到大量的输入数据和相对简单的公式。此外,大量的计算节点有助于减少重新处理所有数据所花费的时间。
Application of Parallel Processing Techniques to Satellite Ocean Color Data Processing
Recent advances demand that remote-sensing satellites efficiently process massive amounts of ocean color data. This paper compares the open multi-processing (OpenMP), the open computing language (OpenCL), the Message Passing Interface (MPI), the hybrid MPI/OpenMP, and the hybrid MPI/OpenCL in the parallel implementation of ocean color processing algorithms using data from the Geostationary Ocean Color Imager (GOCI), which is the first ocean color remote sensor operated in geostationary orbit. Since 2010, GOCI has observed ocean color around the Korean Peninsula and has generated hundreds of terabytes of big data. When any of the data-processing algorithms are updated, all preexisting data is required to be reprocessed, which can take hundreds of days because GOCI data are currently processed sequentially. Therefore, we attempted to develop an efficient parallel processing methodology for GOCI data. We tested well-known GOCI dataprocessing algorithms, like the chlorophyll (CHL) and total suspended solid (TSS) concentration estimation algorithms, using a cluster system. This cluster uses the Red Hat Linux operating system with two Intel Xeon 8-core processors (CPU), an AMD Radeon HD 7970 (GPU), and InfiniBand 4x QDR (network). As a result of this study we were able to improve the GOCI ocean color algorithms' processing speeds for OpenMP, OpenCL, MPI, hybrid MPI/OpenMP, and hybrid MPI/OpenCL by 3.92, 2.56, 2.51 3.27, and 2.05 times, respectively, than that of when we run the data sequentially. Moreover, we confirmed that the OpenMP programming model is the most useful for real-time processing GOCI data, which involves large amounts of input data and relatively simple formulas. Also, the vast number of computational nodes helps reduce the time taken to reprocess all data.