Task-based Conjugate-Gradient for multi-GPUs platforms

Whereas most today parallel High Performance Computing (HPC) software is written as highly tuned code taking care of low-level details, the advent of the manycore area forces the community to consider modular programming paradigms and delegate part of the work to a third party software. That latter approach has been shown to be very productive and efficient with regular algorithms, such as dense linear algebra solvers. In this paper we show that such a model can be efficiently applied to a much more irregular and less compute intensive algorithm. We illustrate our discussion with the standard unpreconditioned Conjugate Gradient (CG) that we carefully express as a task-based algorithm. We use the StarPU runtime system to assess the efficiency of the approach on a computational platform consisting of three NVIDIA Fermi GPUs. We show that almost optimum speed up (up to 2.89) may be reached (relatively to a mono-GPU execution) when processing large matrices and that the performance is portable when changing the low-level memory transfer mechanism.

Data and Resources

Additional Info

Field Value
Source https://inria.hal.science/hal-00767368
Author Agullo, Emmanuel, Giraud, Luc, Guermouche, Abdou, Nakov, Stojce, Roman, Jean
Maintainer CCSD
Last Updated May 30, 2026, 02:12 (UTC)
Created May 30, 2026, 02:12 (UTC)
Identifier Report N°: RR-8192
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor High-End Parallel Algorithms for Challenging Numerical Simulations (HiePACS) ; Laboratoire Bordelais de Recherche en Informatique (LaBRI) ; Université de Bordeaux (UB)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)-Centre National de la Recherche Scientifique (CNRS)-Université de Bordeaux (UB)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)-Centre National de la Recherche Scientifique (CNRS)-Centre Inria de l'Université de Bordeaux ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
creator Agullo, Emmanuel
date 2012-05-30T00:00:00
harvest_object_id c185d4f4-7eb8-43cb-978b-29b26fc2ec86
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-05-26T00:00:00
set_spec type:REPORT