{"title":"Accelerating GPU-Based Parallel FDTD With Advanced Operator Fusion","authors":"Siyi Huang;Yu Cheng;Raj Mittra;Xinyue Zhang;Xingqi Zhang","doi":"10.1109/LAWP.2025.3577055","DOIUrl":null,"url":null,"abstract":"The finite-difference time-domain (FDTD) method is one of the most widely used methods for solving Maxwell’s equations, but its efficiency is limited by the Courant–Friedrichs–Lewy stability condition. Recent research has extensively explored graphics processing unit (GPU)-based parallel implementations of the FDTD method to enhance computational performance. However, the inherent time-stepping and time-marching nature of the FDTD algorithm leads to frequent kernel launches and low memory efficiency on GPUs, still significantly impacting execution efficiency. This letter proposes a GPU-based FDTD framework enhanced with operator fusion to address this challenge. Within this framework, the FDTD algorithm is represented as a computation graph composed of operators. We classify these operators into different types based on their input and output relationships. Using this type of information, a rule-based strategy is developed to merge the operators into larger computational kernels, effectively enhancing GPU execution efficiency. Simulation results demonstrate that the proposed operator fusion framework does not introduce additional errors while achieving a 4× speedup compared to conventional GPU-based implementations.","PeriodicalId":51059,"journal":{"name":"IEEE Antennas and Wireless Propagation Letters","volume":"24 9","pages":"2914-2918"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Antennas and Wireless Propagation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11024191/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The finite-difference time-domain (FDTD) method is one of the most widely used methods for solving Maxwell’s equations, but its efficiency is limited by the Courant–Friedrichs–Lewy stability condition. Recent research has extensively explored graphics processing unit (GPU)-based parallel implementations of the FDTD method to enhance computational performance. However, the inherent time-stepping and time-marching nature of the FDTD algorithm leads to frequent kernel launches and low memory efficiency on GPUs, still significantly impacting execution efficiency. This letter proposes a GPU-based FDTD framework enhanced with operator fusion to address this challenge. Within this framework, the FDTD algorithm is represented as a computation graph composed of operators. We classify these operators into different types based on their input and output relationships. Using this type of information, a rule-based strategy is developed to merge the operators into larger computational kernels, effectively enhancing GPU execution efficiency. Simulation results demonstrate that the proposed operator fusion framework does not introduce additional errors while achieving a 4× speedup compared to conventional GPU-based implementations.
期刊介绍:
IEEE Antennas and Wireless Propagation Letters (AWP Letters) is devoted to the rapid electronic publication of short manuscripts in the technical areas of Antennas and Wireless Propagation. These are areas of competence for the IEEE Antennas and Propagation Society (AP-S). AWPL aims to be one of the "fastest" journals among IEEE publications. This means that for papers that are eventually accepted, it is intended that an author may expect his or her paper to appear in IEEE Xplore, on average, around two months after submission.