Treffer: symPACK: A GPU-Capable Fan-Out Sparse Cholesky Solver

Title:
symPACK: A GPU-Capable Fan-Out Sparse Cholesky Solver
Source:
ACM International Conference Proceeding Series
Publisher Information:
eScholarship, University of California
Publication Year:
2023
Collection:
University of California: eScholarship
Time:
1171 - 1184
Document Type:
Fachzeitschrift article in journal/newspaper
File Description:
application/pdf
Language:
unknown
DOI:
10.1145/3624062.3624600
Rights:
CC-BY
Accession Number:
edsbas.6C1B2E43
Database:
BASE

Weitere Informationen

Sparse symmetric positive definite systems of equations are ubiquitous in scientific workloads and applications. Parallel sparse Cholesky factorization is the method of choice for solving such linear systems. Therefore, the development of parallel sparse Cholesky codes that can efficiently run on today’s large-scale heterogeneous distributed-memory platforms is of vital importance. Modern supercomputers offer nodes that contain a mix of CPUs and GPUs. To fully utilize the computing power of these nodes, scientific codes must be adapted to offload expensive computations to GPUs. We present symPACK, a GPU-capable parallel sparse Cholesky solver that uses one-sided communication primitives and remote procedure calls provided by the UPC++ library. We also utilize the UPC++ “memory kinds” feature to enable efficient communication of GPU-resident data. We show that on a number of large problems, symPACK outperforms comparable state-of-the-art GPU-capable Cholesky factorization codes by up to 14x on the NERSC Perlmutter supercomputer.