-
Notifications
You must be signed in to change notification settings - Fork 50
Benchmark Sorting performance
Sam Reeve edited this page May 17, 2023
·
4 revisions
The plot below shows performance of sorting and binning of particle AoSoA
data on the ORNL Summit supercomputer. Both CPU and GPU performance are compared as a function of total particle count (with a single MPI rank).
Creation building the permutation vector, while permuting refers to actually sorting (or binning) the particles. Each line represents a single hardware and binning density (if binning).
Default parameters with the commandline "large" setting were used for these results.
Cabana - A Co-Designed Library for Exascale Particle Simulations