Generating data transfers for distributed GPU parallel programs |
| |
Authors: | F. Silber-Chaussumier A. Muller R. Habel |
| |
Affiliation: | Institut Mines Télécom, Télécom SudParis, Computer Science Department, 91011 Évry, France |
| |
Abstract: | Nowadays, high performance applications exploit multiple level architectures, due to the presence of hardware accelerators like GPUs inside each computing node. Data transfers occur at two different levels: inside the computing node between the CPU and the accelerators and between computing nodes. We consider the case where the intra-node parallelism is handled with HMPP compiler directives and message-passing programming with MPI is used to program the inter-node communications. This way of programming on such an heterogeneous architecture is costly and error-prone. In this paper, we specifically demonstrate the transformation of HMPP programs designed to exploit a single computing node equipped with a GPU into an heterogeneous HMPP + MPI exploiting multiple GPUs located on different computing nodes. |
| |
Keywords: | Distributed memory Data transfer Source-to-source transformation Parallel execution Compiler directives GPU |
本文献已被 ScienceDirect 等数据库收录! |