Open Access. Powered by Scholars. Published by Universities.®
Biomedical Engineering and Bioengineering Commons™
Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 1 of 1
Full-Text Articles in Biomedical Engineering and Bioengineering
Multi-Level Parallelism For Incompressible Flow Computations On Gpu Clusters, Dana A. Jacobsen, Inanc Senocak
Multi-Level Parallelism For Incompressible Flow Computations On Gpu Clusters, Dana A. Jacobsen, Inanc Senocak
Mechanical and Biomedical Engineering Faculty Publications and Presentations
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA parallel implementations, in which all computations are done on the GPU using CUDA. We explore efficiency and scalability of incompressible flow computations using up to 256 GPUs on a problem with approximately 17.2 billion cells. Our work addresses some of the unique issues faced when merging fine-grain parallelism on the GPU using CUDA with coarse-grain parallelism that use either MPI or MPI-OpenMP for communications. We present three different strategies to overlap computations with communications, and systematically assess their impact on parallel performance on two different GPU clusters. Our …