StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators

GPUs have largely entered HPC clusters, as shown by the top entries of the latest top500 issue. Exploiting such machines is however very challenging, not only because of combining two separate paradigms, MPI and CUDA or OpenCL, but also because nodes are heterogeneous and thus require careful load balancing within nodes themselves. The current paradigms are usually limited to only offloading parts of the computation and leaving CPUs idle, or they require static work partitioning between CPUs and GPUs. To handle single-node architecture heterogeneity, we have previously proposed StarPU, a runtime system capable of dynamically scheduling tasks in an optimized way on such machines. We show here how the task paradigm of StarPU has been combined with MPI communications, and how we extended the task paradigm itself to allow mapping the task graph on MPI clusters such as to automatically achieve an optimized distributed execution. We show how a sequential-like Cholesky source code can be easily extended into a scalable distributed parallel execution, and already exhibits a speedup of 5 on 6 nodes.

Data and Resources

Additional Info

Field Value
Source https://inria.hal.science/hal-00992208
Author Augonnet, Cédric, Aumage, Olivier, Furmento, Nathalie, Thibault, Samuel, Namyst, Raymond
Maintainer CCSD
Last Updated May 5, 2026, 10:17 (UTC)
Created May 5, 2026, 10:17 (UTC)
Identifier Report N°: RR-8538
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Efficient runtime systems for parallel architectures (RUNTIME) ; Centre Inria de l'Université de Bordeaux ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Bordeaux (UB)-Centre National de la Recherche Scientifique (CNRS)
creator Augonnet, Cédric
date 2014-05-16T00:00:00
harvest_object_id 34e5200d-dc92-4983-aa8a-6e7f631f45d5
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-05-26T00:00:00
set_spec type:REPORT