Performance analysis and optimization of three-dimensional FDTD on GPU using roofline model

Ki Hwan Kim, Kyoungho Kim, Q. Han Park

Research output: Contribution to journalArticlepeer-review

45 Citations (Scopus)


The Finite-Difference Time-Domain (FDTD) method is commonly used for electromagnetic field simulations. Recently, successful hardware-accelerations using Graphics Processing Unit (GPU) have been reported for the large-scale FDTD simulations. In this paper, we present a performance analysis of the three-dimensional (3D) FDTD on GPU using the roofline model. We find that theoretical predictions on maximum performance agrees well with the experimental results. We also suggest the suitable optimization methods for the best performance of FDTD on GPU. In particular, the optimized 3D FDTD program on GPU (NVIDIA Geforce GTX 480) is shown to be 64 times faster than the naively implemented program on CPU (Intel Core i7 2600).

Original languageEnglish
Pages (from-to)1201-1207
Number of pages7
JournalComputer Physics Communications
Issue number6
Publication statusPublished - 2011 Jun

Bibliographical note

Funding Information:
This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korea Government (MEST) (Nos. 2010-0028713 , 2010-0019171 ). This work was also supported by a Korea University Grant.


  • CUDA
  • FDTD
  • GPU
  • Roofline

ASJC Scopus subject areas

  • Hardware and Architecture
  • General Physics and Astronomy


Dive into the research topics of 'Performance analysis and optimization of three-dimensional FDTD on GPU using roofline model'. Together they form a unique fingerprint.

Cite this