-
Towards Scalable Checkpoint Restart: A Collective Inline Memory Contents Dedu...
International audience -
BlobCR: Virtual Disk Based Checkpoint-Restart for HPC Applications on IaaS Cl...
International audience -
Fault tolerance in the parallel and distributed environments : optimizing the...
The parallel computing platforms available today are increasingly larger. Typically the emerging parallel platforms will be composed of several millions of CPU cores...
