电磁散射有限元法的并行实现

Trans Black Sea Region Symposium on Applied Electromagnetism Pub Date : 1996-04-17 DOI:10.1109/AEM.1996.873072

P. Atlamazoglou, G. Pagiatakis, N. Uzunoglu

{"title":"电磁散射有限元法的并行实现","authors":"P. Atlamazoglou, G. Pagiatakis, N. Uzunoglu","doi":"10.1109/AEM.1996.873072","DOIUrl":null,"url":null,"abstract":"The finite element method for the computation of electromagnetic fields is gaining popularity in recent times due to its ability to handle arbitrary geometries and its versatility in modeling inhomogeneities and material discontinuities. For electrically large and complex structures, massively parallel computers have to be used in order to obtain sufficiently accurate solutions in acceptable time fiames. In this paper we present the implementation details and the performance evaluation of a parallel three dimensional finite element code for open domain electromagnetic problems. In the finite element method the computational domain is divided into smaller nonoverlapping subvolumes, in our implementation tetrahedra. Within each tetrahedron the scattered electric field is represented using edge based vector basis hnctions. The finite element mesh is truncated artificially at some distance fiom the scatterer with the use of a second order absorbing boundary condition [ 11. The whole mathematical procedure leads to a linear system with symmetric complex sparse matrix. Only the nonzero elements of the upper triangular part of this matrix are stored using the compressed row storage format. The linear system is solved with the conjugate orthogonal conjugate gradient method. The parallel computer we use is the Parsytec We13612 of the Athens High Performance Computing Laboratory. It is a distributed memory machine with message passing architecture. It consists of 512 T805 transputers arranged on a two dimensional grid. In order to parallelize the finite element code using np processors, we divided the global matrix and the vectors into np sections, and assign one section to each processor. The data decomposition is performed in a manner that reduces interprocessor communication, while balancing the load on each processor. We organize the np processors in a virtual ring topology, and employ asynchronous communication that allows us to overlap message exchange with computations for better efficiency. We tested the parallel code for the case of a plane wave incident on a dielectric sphere. The near field values were in good agreement with those from a Me series solution, although the absorbing boundary surface was placed only a fraction of a wavelength away fiom the scatterer. We observed significant speedups for large numbers of processors. This means that the finite element method is well suited for parallelization in a massively parallel environment. We firther noticed that for a given problem size, we can always find an upper boundary of processors above which performance deteriorates, as the increased communication overhead exceeds the time saved by parallel execution of computations. However we can set this boundary arbitrarily high by scaling up sufficiently the size of the problem. References","PeriodicalId":445510,"journal":{"name":"Trans Black Sea Region Symposium on Applied Electromagnetism","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A parallel implementation of the finite element method for electromagnetic scattering\",\"authors\":\"P. Atlamazoglou, G. Pagiatakis, N. Uzunoglu\",\"doi\":\"10.1109/AEM.1996.873072\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The finite element method for the computation of electromagnetic fields is gaining popularity in recent times due to its ability to handle arbitrary geometries and its versatility in modeling inhomogeneities and material discontinuities. For electrically large and complex structures, massively parallel computers have to be used in order to obtain sufficiently accurate solutions in acceptable time fiames. In this paper we present the implementation details and the performance evaluation of a parallel three dimensional finite element code for open domain electromagnetic problems. In the finite element method the computational domain is divided into smaller nonoverlapping subvolumes, in our implementation tetrahedra. Within each tetrahedron the scattered electric field is represented using edge based vector basis hnctions. The finite element mesh is truncated artificially at some distance fiom the scatterer with the use of a second order absorbing boundary condition [ 11. The whole mathematical procedure leads to a linear system with symmetric complex sparse matrix. Only the nonzero elements of the upper triangular part of this matrix are stored using the compressed row storage format. The linear system is solved with the conjugate orthogonal conjugate gradient method. The parallel computer we use is the Parsytec We13612 of the Athens High Performance Computing Laboratory. It is a distributed memory machine with message passing architecture. It consists of 512 T805 transputers arranged on a two dimensional grid. In order to parallelize the finite element code using np processors, we divided the global matrix and the vectors into np sections, and assign one section to each processor. The data decomposition is performed in a manner that reduces interprocessor communication, while balancing the load on each processor. We organize the np processors in a virtual ring topology, and employ asynchronous communication that allows us to overlap message exchange with computations for better efficiency. We tested the parallel code for the case of a plane wave incident on a dielectric sphere. The near field values were in good agreement with those from a Me series solution, although the absorbing boundary surface was placed only a fraction of a wavelength away fiom the scatterer. We observed significant speedups for large numbers of processors. This means that the finite element method is well suited for parallelization in a massively parallel environment. We firther noticed that for a given problem size, we can always find an upper boundary of processors above which performance deteriorates, as the increased communication overhead exceeds the time saved by parallel execution of computations. However we can set this boundary arbitrarily high by scaling up sufficiently the size of the problem. References\",\"PeriodicalId\":445510,\"journal\":{\"name\":\"Trans Black Sea Region Symposium on Applied Electromagnetism\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Trans Black Sea Region Symposium on Applied Electromagnetism\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AEM.1996.873072\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Trans Black Sea Region Symposium on Applied Electromagnetism","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEM.1996.873072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

电磁场计算的有限元方法近年来越来越受欢迎，因为它能够处理任意几何形状，并且在模拟非均匀性和材料不连续方面具有通用性。对于电力大而复杂的结构，为了在可接受的时间内获得足够精确的解，必须使用大规模并行计算机。本文给出了一个求解开域电磁问题的并行三维有限元程序的实现细节和性能评价。在有限元法中，计算域被划分为较小的非重叠子体，在我们的实现中是四面体。在每个四面体内，散射电场用基于边缘的向量基函数表示。利用二阶吸收边界条件在离散射体一定距离处人为截断有限元网格[11]。整个数学过程导致了一个具有对称复稀疏矩阵的线性系统。使用压缩行存储格式存储该矩阵的上三角形部分的非零元素。用共轭正交共轭梯度法求解线性系统。我们使用的并行计算机是雅典高性能计算实验室的Parsytec We13612。它是一种具有消息传递体系结构的分布式内存机。它由512个排列在二维网格上的T805转发器组成。为了使用np处理器并行化有限元代码，我们将全局矩阵和向量划分为np段，并为每个处理器分配一段。数据分解以减少处理器间通信的方式执行，同时平衡每个处理器上的负载。我们将np处理器组织在虚拟环拓扑中，并采用异步通信，使我们能够将消息交换与计算重叠以提高效率。我们对平面波入射到介电球上的并行代码进行了测试。近场值与Me系列溶液的值一致，尽管吸收边界表面距离散射体只有一个波长的一小部分。我们观察到大量处理器的显著加速。这意味着有限元方法非常适合于大规模并行环境中的并行化。我们还注意到，对于给定的问题规模，我们总能找到处理器的上限，超过这个上限，性能就会下降，因为增加的通信开销超过了并行执行计算所节省的时间。然而，我们可以通过充分放大问题的大小，将这个边界设置得任意高。参考文献

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A parallel implementation of the finite element method for electromagnetic scattering

The finite element method for the computation of electromagnetic fields is gaining popularity in recent times due to its ability to handle arbitrary geometries and its versatility in modeling inhomogeneities and material discontinuities. For electrically large and complex structures, massively parallel computers have to be used in order to obtain sufficiently accurate solutions in acceptable time fiames. In this paper we present the implementation details and the performance evaluation of a parallel three dimensional finite element code for open domain electromagnetic problems. In the finite element method the computational domain is divided into smaller nonoverlapping subvolumes, in our implementation tetrahedra. Within each tetrahedron the scattered electric field is represented using edge based vector basis hnctions. The finite element mesh is truncated artificially at some distance fiom the scatterer with the use of a second order absorbing boundary condition [ 11. The whole mathematical procedure leads to a linear system with symmetric complex sparse matrix. Only the nonzero elements of the upper triangular part of this matrix are stored using the compressed row storage format. The linear system is solved with the conjugate orthogonal conjugate gradient method. The parallel computer we use is the Parsytec We13612 of the Athens High Performance Computing Laboratory. It is a distributed memory machine with message passing architecture. It consists of 512 T805 transputers arranged on a two dimensional grid. In order to parallelize the finite element code using np processors, we divided the global matrix and the vectors into np sections, and assign one section to each processor. The data decomposition is performed in a manner that reduces interprocessor communication, while balancing the load on each processor. We organize the np processors in a virtual ring topology, and employ asynchronous communication that allows us to overlap message exchange with computations for better efficiency. We tested the parallel code for the case of a plane wave incident on a dielectric sphere. The near field values were in good agreement with those from a Me series solution, although the absorbing boundary surface was placed only a fraction of a wavelength away fiom the scatterer. We observed significant speedups for large numbers of processors. This means that the finite element method is well suited for parallelization in a massively parallel environment. We firther noticed that for a given problem size, we can always find an upper boundary of processors above which performance deteriorates, as the increased communication overhead exceeds the time saved by parallel execution of computations. However we can set this boundary arbitrarily high by scaling up sufficiently the size of the problem. References

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Trans Black Sea Region Symposium on Applied Electromagnetism

自引率

0.00%

发文量