Skip to content

Benchmark Sorting performance

Sam Reeve edited this page May 17, 2023 · 4 revisions

The plot below shows performance of sorting and binning of particle AoSoA data on the ORNL Summit supercomputer. Both CPU and GPU performance are compared as a function of total particle count (with a single MPI rank).

Creation building the permutation vector, while permuting refers to actually sorting (or binning) the particles. Each line represents a single hardware and binning density (if binning).

Sorting/binning performance

Implementation

Default parameters with the commandline "large" setting were used for these results.

Clone this wiki locally