Engineering | Open Access Articles | Digital Commons Network™

Mpi Within A Gpu, Bobby Dalton Young Jan 2009

Mpi Within A Gpu, Bobby Dalton Young

University of Kentucky Master's Theses

GPUs offer high-performance floating-point computation at commodity prices, but their usage is hindered by programming models which expose the user to irregularities in the current shared-memory environments and require learning new interfaces and semantics.

This thesis will demonstrate that the message-passing paradigm can be conceptually cleaner than the current data-parallel models for programming GPUs because it can hide the quirks of current GPU shared-memory environments, as well as GPU-specific features, behind a well-established and well-understood interface. This will be shown by demonstrating a proof-of-concept MPI implementation which provides cleaner, simpler code with a reasonable performance cost. This thesis will also …

Go to article

Collective Communication And Barrier Synchronization On Nvidia Cuda Gpu, Diego Alejandro Rivera-Polanco Jan 2009

Collective Communication And Barrier Synchronization On Nvidia Cuda Gpu, Diego Alejandro Rivera-Polanco

University of Kentucky Master's Theses

GPUs (Graphics Processing Units) employ a multi-threaded execution model using multiple SIMD cores. Compared to use of a single SIMD engine, this architecture can scale to more processing elements. However, GPUs sacrifice the timing properties which made barrier synchronization implicit and collective communication operations fast.

This thesis demonstrates efficient methods by which these aggregate functions can be implemented using unmodified NVIDIA CUDA GPUs. Although NVIDIA's highest “compute capability" GPUs provide atomic memory functions, they have order N execution time. In contrast, the methods proposed here take advantage of basic properties of the GPU architecture to make implementations that are both …

Go to article

Engineering Commons^™

Full-Text Articles in Engineering

Mpi Within A Gpu, Bobby Dalton Young

University of Kentucky Master's Theses

Collective Communication And Barrier Synchronization On Nvidia Cuda Gpu, Diego Alejandro Rivera-Polanco

University of Kentucky Master's Theses