首页 | 官方网站   微博 | 高级检索  
     


Generating data transfers for distributed GPU parallel programs
Authors:F. Silber-Chaussumier  A. Muller  R. Habel
Affiliation:Institut Mines Télécom, Télécom SudParis, Computer Science Department, 91011 Évry, France
Abstract:Nowadays, high performance applications exploit multiple level architectures, due to the presence of hardware accelerators like GPUs inside each computing node. Data transfers occur at two different levels: inside the computing node between the CPU and the accelerators and between computing nodes. We consider the case where the intra-node parallelism is handled with HMPP compiler directives and message-passing programming with MPI is used to program the inter-node communications. This way of programming on such an heterogeneous architecture is costly and error-prone. In this paper, we specifically demonstrate the transformation of HMPP programs designed to exploit a single computing node equipped with a GPU into an heterogeneous HMPP + MPI exploiting multiple GPUs located on different computing nodes.
Keywords:Distributed memory   Data transfer   Source-to-source transformation   Parallel execution   Compiler directives   GPU
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号