Generating data transfers for distributed GPU parallel programs期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Generating data transfers for distributed GPU parallel programs

Authors:	F. Silber-Chaussumier A. Muller R. Habel

Affiliation:	Institut Mines Télécom, Télécom SudParis, Computer Science Department, 91011 Évry, France

Abstract:	Nowadays, high performance applications exploit multiple level architectures, due to the presence of hardware accelerators like GPUs inside each computing node. Data transfers occur at two different levels: inside the computing node between the CPU and the accelerators and between computing nodes. We consider the case where the intra-node parallelism is handled with HMPP compiler directives and message-passing programming with MPI is used to program the inter-node communications. This way of programming on such an heterogeneous architecture is costly and error-prone. In this paper, we specifically demonstrate the transformation of HMPP programs designed to exploit a single computing node equipped with a GPU into an heterogeneous HMPP + MPI exploiting multiple GPUs located on different computing nodes.

Keywords:	Distributed memory Data transfer Source-to-source transformation Parallel execution Compiler directives GPU
本文献已被 ScienceDirect 等数据库收录！